Infogain

https://infogain.com

Founded

Type

Size

Stage

About

Infogain, a Silicon Valley-based human-centered digital platform engineering company, is dedicated to forging a purpose driven future through innovative solutions. Our mission is to accelerate experience-led transformation for Fortune 500 companies and digital natives across diverse sectors, including technology, healthcare, insurance, travel, telecom, and retail/CPG. Leveraging cutting-edge technologies such as cloud, microservices, automation, IoT, and artificial intelligence, we are committed to #EngineeringBusinessOutcomes that make a real impact. Infogain is also a trusted multi-cloud expert, proficiently navigating hyperscale cloud providers like Microsoft Azure, Google Cloud Platform, and Amazon Web Services to deliver solutions that shape a better future. Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, and Singapore, with delivery centers in Seattle, Houston, Montevideo, Kraków, Noida, Bengaluru, Pune, Gurgaon, and Mumbai. Celebrating a global team and our commitment to fostering a diverse and inclusive work environment fuels our collective drive to engineer impactful solutions for a better future. To learn more, visit www.infogain.com.

Jobs at Infogain

Data Steward

at Infogain

Agency job

via Technogen India PvtLtd by RAHUL BATTA

The recruiter has not been active on this job recently. You may apply but please expect a delayed response.

NCR (Delhi | Gurgaon | Noida), Bengaluru (Bangalore), Mumbai, Pune

7 - 8 yrs

₹15L - ₹16L / yr

Data steward

MDM

Tamr

Reltio

Data engineering

+7 more

Data Steward :

Data Steward will collaborate and work closely within the group software engineering and business division. Data Steward has overall accountability for the group's / Divisions overall data and reporting posture by responsibly managing data assets, data lineage, and data access, supporting sound data analysis. This role requires focus on data strategy, execution, and support for projects, programs, application enhancements, and production data fixes. Makes well-thought-out decisions on complex or ambiguous data issues and establishes the data stewardship and information management strategy and direction for the group. Effectively communicates to individuals at various levels of the technical and business communities. This individual will become part of the corporate Data Quality and Data management/entity resolution team supporting various systems across the board.

Primary Responsibilities:

Responsible for data quality and data accuracy across all group/division delivery initiatives.
Responsible for data analysis, data profiling, data modeling, and data mapping capabilities.
Responsible for reviewing and governing data queries and DML.
Accountable for the assessment, delivery, quality, accuracy, and tracking of any production data fixes.
Accountable for the performance, quality, and alignment to requirements for all data query design and development.
Responsible for defining standards and best practices for data analysis, modeling, and queries.
Responsible for understanding end-to-end data flows and identifying data dependencies in support of delivery, release, and change management.
Responsible for the development and maintenance of an enterprise data dictionary that is aligned to data assets and the business glossary for the group responsible for the definition and maintenance of the group's data landscape including overlays with the technology landscape, end-to-end data flow/transformations, and data lineage.
Responsible for rationalizing the group's reporting posture through the definition and maintenance of a reporting strategy and roadmap.
Partners with the data governance team to ensure data solutions adhere to the organization’s data principles and guidelines.
Owns group's data assets including reports, data warehouse, etc.
Understand customer business use cases and be able to translate them to technical specifications and vision on how to implement a solution.
Accountable for defining the performance tuning needs for all group data assets and managing the implementation of those requirements within the context of group initiatives as well as steady-state production.
Partners with others in test data management and masking strategies and the creation of a reusable test data repository.
Responsible for solving data-related issues and communicating resolutions with other solution domains.
Actively and consistently support all efforts to simplify and enhance the Clinical Trial Predication use cases.
Apply knowledge in analytic and statistical algorithms to help customers explore methods to improve their business.
Contribute toward analytical research projects through all stages including concept formulation, determination of appropriate statistical methodology, data manipulation, research evaluation, and final research report.
Visualize and report data findings creatively in a variety of visual formats that appropriately provide insight to the stakeholders.
Achieve defined project goals within customer deadlines; proactively communicate status and escalate issues as needed.

Additional Responsibilities:

Strong understanding of the Software Development Life Cycle (SDLC) with Agile Methodologies
Knowledge and understanding of industry-standard/best practices requirements gathering methodologies.
Knowledge and understanding of Information Technology systems and software development.
Experience with data modeling and test data management tools.
Experience in the data integration project • Good problem solving & decision-making skills.
Good communication skills within the team, site, and with the customer

Knowledge, Skills and Abilities

Technical expertise in data architecture principles and design aspects of various DBMS and reporting concepts.
Solid understanding of key DBMS platforms like SQL Server, Azure SQL
Results-oriented, diligent, and works with a sense of urgency. Assertive, responsible for his/her own work (self-directed), have a strong affinity for defining work in deliverables, and be willing to commit to deadlines.
Experience in MDM tools like MS DQ, SAS DM Studio, Tamr, Profisee, Reltio etc.
Experience in Report and Dashboard development
Statistical and Machine Learning models
Python (sklearn, numpy, pandas, genism)
Nice to Have:
1yr of ETL experience
Natural Language Processing
Neural networks and Deep learning
xperience in keras,tensorflow,spacy, nltk, LightGBM python library

Interaction : Frequently interacts with subordinate supervisors.

Education : Bachelor’s degree, preferably in Computer Science, B.E or other quantitative field related to the area of assignment. Professional certification related to the area of assignment may be required

Experience : 7 years of Pharmaceutical /Biotech/life sciences experience, 5 years of Clinical Trials experience and knowledge, Excellent Documentation, Communication, and Presentation Skills including PowerPoint

Data Steward :

Primary Responsibilities:

Responsible for data quality and data accuracy across all group/division delivery initiatives.
Responsible for data analysis, data profiling, data modeling, and data mapping capabilities.
Responsible for reviewing and governing data queries and DML.
Accountable for the assessment, delivery, quality, accuracy, and tracking of any production data fixes.
Accountable for the performance, quality, and alignment to requirements for all data query design and development.
Responsible for defining standards and best practices for data analysis, modeling, and queries.
Responsible for understanding end-to-end data flows and identifying data dependencies in support of delivery, release, and change management.
Responsible for the development and maintenance of an enterprise data dictionary that is aligned to data assets and the business glossary for the group responsible for the definition and maintenance of the group's data landscape including overlays with the technology landscape, end-to-end data flow/transformations, and data lineage.
Responsible for rationalizing the group's reporting posture through the definition and maintenance of a reporting strategy and roadmap.
Partners with the data governance team to ensure data solutions adhere to the organization’s data principles and guidelines.
Owns group's data assets including reports, data warehouse, etc.
Understand customer business use cases and be able to translate them to technical specifications and vision on how to implement a solution.
Accountable for defining the performance tuning needs for all group data assets and managing the implementation of those requirements within the context of group initiatives as well as steady-state production.
Partners with others in test data management and masking strategies and the creation of a reusable test data repository.
Responsible for solving data-related issues and communicating resolutions with other solution domains.
Actively and consistently support all efforts to simplify and enhance the Clinical Trial Predication use cases.
Apply knowledge in analytic and statistical algorithms to help customers explore methods to improve their business.
Contribute toward analytical research projects through all stages including concept formulation, determination of appropriate statistical methodology, data manipulation, research evaluation, and final research report.
Visualize and report data findings creatively in a variety of visual formats that appropriately provide insight to the stakeholders.
Achieve defined project goals within customer deadlines; proactively communicate status and escalate issues as needed.

Additional Responsibilities:

Strong understanding of the Software Development Life Cycle (SDLC) with Agile Methodologies
Knowledge and understanding of industry-standard/best practices requirements gathering methodologies.
Knowledge and understanding of Information Technology systems and software development.
Experience with data modeling and test data management tools.
Experience in the data integration project • Good problem solving & decision-making skills.
Good communication skills within the team, site, and with the customer

Knowledge, Skills and Abilities

Technical expertise in data architecture principles and design aspects of various DBMS and reporting concepts.
Solid understanding of key DBMS platforms like SQL Server, Azure SQL
Results-oriented, diligent, and works with a sense of urgency. Assertive, responsible for his/her own work (self-directed), have a strong affinity for defining work in deliverables, and be willing to commit to deadlines.
Experience in MDM tools like MS DQ, SAS DM Studio, Tamr, Profisee, Reltio etc.
Experience in Report and Dashboard development
Statistical and Machine Learning models
Python (sklearn, numpy, pandas, genism)
Nice to Have:
1yr of ETL experience
Natural Language Processing
Neural networks and Deep learning
xperience in keras,tensorflow,spacy, nltk, LightGBM python library

Interaction : Frequently interacts with subordinate supervisors.

Sr Data Engineer

at Infogain

Agency job

via Technogen India PvtLtd by RAHUL BATTA

The recruiter has not been active on this job recently. You may apply but please expect a delayed response.

Bengaluru (Bangalore), Pune, Noida, NCR (Delhi | Gurgaon | Noida)

7 - 10 yrs

₹20L - ₹25L / yr

Data engineering

Python

SQL

Spark

PySpark

+10 more

Sr. Data Engineer:

Core Skills – Data Engineering, Big Data, Pyspark, Spark SQL and Python

Candidate with prior Palantir Cloud Foundry OR Clinical Trial Data Model background is preferred

Major accountabilities:

Responsible for Data Engineering, Foundry Data Pipeline Creation, Foundry Analysis & Reporting, Slate Application development, re-usable code development & management and Integrating Internal or External System with Foundry for data ingestion with high quality.
Have good understanding on Foundry Platform landscape and it’s capabilities
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Defines company data assets (data models), Pyspark, spark SQL, jobs to populate data models.
Designs data integrations and data quality framework.
Design & Implement integration with Internal, External Systems, F1 AWS platform using Foundry Data Connector or Magritte Agent
Collaboration with data scientists, data analyst and technology teams to document and leverage their understanding of the Foundry integration with different data sources - Actively participate in agile work practices
Coordinating with Quality Engineer to ensure the all quality controls, naming convention & best practices have been followed

Desired Candidate Profile :

Strong data engineering background
Experience with Clinical Data Model is preferred
Experience in

SQL Server ,Postgres, Cassandra, Hadoop, and Spark for distributed data storage and parallel computing
Java and Groovy for our back-end applications and data integration tools
Python for data processing and analysis
Cloud infrastructure based on AWS EC2 and S3

7+ years IT experience, 2+ years’ experience in Palantir Foundry Platform, 4+ years’ experience in Big Data platform
5+ years of Python and Pyspark development experience
Strong troubleshooting and problem solving skills
BTech or master's degree in computer science or a related technical field
Experience designing, building, and maintaining big data pipelines systems
Hands-on experience on Palantir Foundry Platform and Foundry custom Apps development
Able to design and implement data integration between Palantir Foundry and external Apps based on Foundry data connector framework
Hands-on in programming languages primarily Python, R, Java, Unix shell scripts
Hand-on experience in AWS / Azure cloud platform and stack
Strong in API based architecture and concept, able to do quick PoC using API integration and development
Knowledge of machine learning and AI
Skill and comfort working in a rapidly changing environment with dynamic objectives and iteration with users.

Demonstrated ability to continuously learn, work independently, and make decisions with minimal supervision

Sr. Data Engineer:

Core Skills – Data Engineering, Big Data, Pyspark, Spark SQL and Python

Candidate with prior Palantir Cloud Foundry OR Clinical Trial Data Model background is preferred

Major accountabilities:

Responsible for Data Engineering, Foundry Data Pipeline Creation, Foundry Analysis & Reporting, Slate Application development, re-usable code development & management and Integrating Internal or External System with Foundry for data ingestion with high quality.
Have good understanding on Foundry Platform landscape and it’s capabilities
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Defines company data assets (data models), Pyspark, spark SQL, jobs to populate data models.
Designs data integrations and data quality framework.
Design & Implement integration with Internal, External Systems, F1 AWS platform using Foundry Data Connector or Magritte Agent
Collaboration with data scientists, data analyst and technology teams to document and leverage their understanding of the Foundry integration with different data sources - Actively participate in agile work practices
Coordinating with Quality Engineer to ensure the all quality controls, naming convention & best practices have been followed

Desired Candidate Profile :

Strong data engineering background
Experience with Clinical Data Model is preferred
Experience in

SQL Server ,Postgres, Cassandra, Hadoop, and Spark for distributed data storage and parallel computing
Java and Groovy for our back-end applications and data integration tools
Python for data processing and analysis
Cloud infrastructure based on AWS EC2 and S3

7+ years IT experience, 2+ years’ experience in Palantir Foundry Platform, 4+ years’ experience in Big Data platform
5+ years of Python and Pyspark development experience
Strong troubleshooting and problem solving skills
BTech or master's degree in computer science or a related technical field
Experience designing, building, and maintaining big data pipelines systems
Hands-on experience on Palantir Foundry Platform and Foundry custom Apps development
Able to design and implement data integration between Palantir Foundry and external Apps based on Foundry data connector framework
Hands-on in programming languages primarily Python, R, Java, Unix shell scripts
Hand-on experience in AWS / Azure cloud platform and stack
Strong in API based architecture and concept, able to do quick PoC using API integration and development
Knowledge of machine learning and AI
Skill and comfort working in a rapidly changing environment with dynamic objectives and iteration with users.

Demonstrated ability to continuously learn, work independently, and make decisions with minimal supervision

Did not find a job you were looking for?

Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.

Similar companies

Cutshort

https://cutshort.io

Founded

2015

Type

Product

Size

20-100

Stage

Profitable

About the company

To hire better and faster, companies need rich candidate data, smart software and sound human judgement.

Cutshort is using AI to combine all these 3 to offer a 10x talent sourcing solution that is faster, better and cheaper.

We have 3 AI-powered offerings

Hire using our AI platform: Affordable annual subscriptions
Get only sourcing: 3.5% of annual CTC when you hire
Get full recruiting: 6.99% of annual CTC when you hire

Customers such as Fractal, Sprinto, Shiprocket, Highlevel, ThoughtWorks, Deepintent have built strong engineering teams with Cutshort.

Jobs

RefactorQ

https://refactorq.com

Founded

2021

Type

Services

Size

20-100

Stage

Profitable

About the company

Information Technology company

Jobs

Cutshort Lightning

https://cutshort.io

Founded

2023

Type

Services

Size

10-50

Stage

Bootstrapped

About the company

Jobs

Binocs Labs Pvt Ltd

https://binocs.co

Founded

2022

Type

Product

Size

0-20

Stage

Raised funding

About the company

Binocs is an AI-Driven Portfolio Tracking & Workflow Management System for Private Credit Funds. Binocs exists to improve operational efficiency and risk management for private credit funds, allowing them to scale up faster. Binocs automates document management by integrating of all channels to a single platform. Our platform uses AI and ML solutions to extract and standardize financial data for further analysing Financial Statements. Additionally, Binocs offers streamlined covenant & ESG monitoring; and advanced data analytics & risk management modules. Binocs provides bespoke financial models customized to meet the specific needs and requirements of Portfolio Managers.

Jobs

Autonomize AI

https://autonomize.ai

Founded

2022

Type

Product

Size

20-100

Stage

Raised funding

About the company

AI Copilots for Healthcare & Life Sciences. Empower knowledge workers, reduce administrative burden and improve health outcomes.

Jobs

Arthur Grand Technologies Pvt Ltd

https://arthurgrand.com

Founded

2012

Type

Products & Services

Size

20-100

Stage

Bootstrapped

About the company

Jobs

Sentiaflow

https://sentiaflow.com

Founded

2012

Type

Products & Services

Size

0-20

Stage

Bootstrapped

About the company

Sentiaflow is a New Delhi AI engineering company helping startups and enterprise teams hire AI engineers, build RAG pipelines, integrate LLMs, launch AI agents, and scale MLOps.

Jobs

Move my Stuff

https://movemstuff.com.au

Founded

2008

Type

Services

Size

20-100

Stage

Bootstrapped

About the company

Jobs

NSM Consultant

https://nsm-consultant.com

Founded

2012

Type

Products & Services

Size

20-100

Stage

Bootstrapped

About the company

NSM Consultant offers expert software development, business consulting, and IT solutions. Led by Ningaiah Mahesh with 27+ years experience and $300M sales track record.

Jobs

NumiSpark

https://numispark.com

Founded

2025

Type

Products & Services

Size

0-20

Stage

Bootstrapped

About the company

Numispark, agence digitale, propose des solutions innovantes en développement web & mobile, marketing digital, SEO et design, adaptées à vos besoins à Caen, Bernay et Rennes Normandie.

Jobs

Want to work at Infogain?

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Infogain

About

Company social profiles

Jobs at Infogain

Data Steward

Sr Data Engineer

Similar companies

Cutshort

RefactorQ

Cutshort Lightning

Binocs Labs Pvt Ltd

Autonomize AI

Arthur Grand Technologies Pvt Ltd

Sentiaflow

Move my Stuff

NSM Consultant

NumiSpark