Etl jobs

50+ ETL Jobs in India

Apply to 50+ ETL Jobs on CutShort.io. Find your next job, effortlessly. Browse ETL Jobs and apply today!

Data Engineer - GCP (Rotational Shift)

at Quantiphi

3 candid answers

1 video

Posted by Nikita Sinha

Chennai

4 - 8 yrs

Best in industry

Python

SQL

ETL

Google Cloud Platform (GCP)

Work Schedule: 4 days work from office with rotational shifts, including night shifts as per business requirements.

Required Skills:

Bachelor’s degree in Computer Science or similar field or equivalent work experience
3+ years of experience on Data Warehousing, Data Engineering or Data Integration projects
Expert with data warehousing concepts, strategies, and tools
Strong SQL background
Strong knowledge of relational databases like SQL Server, PostgreSQL, MySQL
Strong experience in GCP & Google BigQuery, Cloud SQL, Composer (Airflow), Dataflow, Dataproc, Cloud Function and GCS
Good to have knowledge on SQL Server Reporting Services (SSRS), and SQL Server Integration Services (SSIS)
Good to have a Mainframe skillset
Experience in Informatica Power exchange for Mainframe, Salesforce, and other new-age data sources
Experience in integration using APIs, XML, JSONs etc.
In-depth understanding of database management systems, online analytical processing (OLAP) and ETL (Extract, transform, load) framework, data-warehousing and Data Lakes
Good understanding of SDLC, Agile and Scrum processes
Strong problem-solving, multi-tasking, and organizational skills
Highly proficient in working with large volumes of business data and strong understanding of database design and implementation
Good written and verbal communication skills
Demonstrated experience of leading a team spread across multiple locations

Roles & Responsibilities:

Work with business users and other stakeholders to understand business processes
Ability to design and implement Dimensional and Fact tables
Identify and implement data transformation/cleansing requirements
Develop a highly scalable, reliable, and high-performance data processing pipeline to extract, transform and load data from various systems to the Enterprise Data Warehouse
Develop conceptual, logical, and physical data models with associated metadata including data lineage and technical data definitions
Design, develop and maintain ETL workflows and mappings using the appropriate data load technique
Provide research, high-level design, and estimates for data transformation and data integration from source applications to end-user BI solutions
Provide production support of ETL processes to ensure timely completion and availability of data in the data warehouse for reporting use
Analyze and resolve problems and provide technical assistance as necessary
Partner with the BI team to evaluate, design, develop BI reports and dashboards according to functional specifications while maintaining data integrity and data quality
Work collaboratively with key stakeholders to translate business information needs into well-defined data requirements to implement the BI solutions
Leverage transactional information, data from ERP, CRM, HRIS applications to model, extract and transform into reporting & analytics
Define and document the use of BI through user experience/use cases, prototypes, test, and deploy BI solutions
Develop and support data governance processes, analyze data to identify and articulate trends, patterns, outliers, quality issues, and continuously validate reports, dashboards and suggest improvements
Train business end-users, IT analysts, and developers

Work Schedule: 4 days work from office with rotational shifts, including night shifts as per business requirements.

Required Skills:

Bachelor’s degree in Computer Science or similar field or equivalent work experience
3+ years of experience on Data Warehousing, Data Engineering or Data Integration projects
Expert with data warehousing concepts, strategies, and tools
Strong SQL background
Strong knowledge of relational databases like SQL Server, PostgreSQL, MySQL
Strong experience in GCP & Google BigQuery, Cloud SQL, Composer (Airflow), Dataflow, Dataproc, Cloud Function and GCS
Good to have knowledge on SQL Server Reporting Services (SSRS), and SQL Server Integration Services (SSIS)
Good to have a Mainframe skillset
Experience in Informatica Power exchange for Mainframe, Salesforce, and other new-age data sources
Experience in integration using APIs, XML, JSONs etc.
In-depth understanding of database management systems, online analytical processing (OLAP) and ETL (Extract, transform, load) framework, data-warehousing and Data Lakes
Good understanding of SDLC, Agile and Scrum processes
Strong problem-solving, multi-tasking, and organizational skills
Highly proficient in working with large volumes of business data and strong understanding of database design and implementation
Good written and verbal communication skills
Demonstrated experience of leading a team spread across multiple locations

Roles & Responsibilities:

Work with business users and other stakeholders to understand business processes
Ability to design and implement Dimensional and Fact tables
Identify and implement data transformation/cleansing requirements
Develop a highly scalable, reliable, and high-performance data processing pipeline to extract, transform and load data from various systems to the Enterprise Data Warehouse
Develop conceptual, logical, and physical data models with associated metadata including data lineage and technical data definitions
Design, develop and maintain ETL workflows and mappings using the appropriate data load technique
Provide research, high-level design, and estimates for data transformation and data integration from source applications to end-user BI solutions
Provide production support of ETL processes to ensure timely completion and availability of data in the data warehouse for reporting use
Analyze and resolve problems and provide technical assistance as necessary
Partner with the BI team to evaluate, design, develop BI reports and dashboards according to functional specifications while maintaining data integrity and data quality
Work collaboratively with key stakeholders to translate business information needs into well-defined data requirements to implement the BI solutions
Leverage transactional information, data from ERP, CRM, HRIS applications to model, extract and transform into reporting & analytics
Define and document the use of BI through user experience/use cases, prototypes, test, and deploy BI solutions
Develop and support data governance processes, analyze data to identify and articulate trends, patterns, outliers, quality issues, and continuously validate reports, dashboards and suggest improvements
Train business end-users, IT analysts, and developers

Databricks Engineer with AI/BI

at TECHNAVITAS INFO INDIA PVT LTD

Posted by RAMANA P

Hyderabad

5 - 8 yrs

₹5L - ₹20L / yr

MLFlow

Apache Spark

Python

databricks

Delta Lake

+6 more

Data Engineer — AI / BI

Artificial Intelligence & Business Intelligence | Data & Analytics

Who We Are:

Since our inception back in 2006, Navitas has grown to be an industry leader in the digital transformation space, and we’ve served as trusted advisors supporting our client base within the commercial, federal, and state and local markets.

What We Do:

At our very core, we’re a group of problem solvers providing our award-winning technology solutions to drive digital acceleration for our customers! With proven solutions, award-winning technologies, and a team of expert problem solvers, Navitas has consistently empowered customers to use technology as a competitive advantage and deliver cutting-edge transformative solutions.

Position Overview

We are seeking a Databricks Engineer to design, build, and operate a Data & AI platform with a strong foundation in the Medallion Architecture (raw/bronze, curated/silver, and mart/gold layers). This platform will orchestrate complex data workflows and scalable ELT pipelines to integrate data from enterprise systems such as PeopleSoft, D2L, and Salesforce, delivering high-quality, governed data for machine learning, AI/BI, and analytics at scale.

You will play a critical role in engineering the infrastructure and workflows that enable seamless data flow across the enterprise, ensure operational excellence, and provide the backbone for strategic decision-making, predictive modeling, and innovation

Responsibilities:

Data & AI Platform Engineering (Databricks-Centric):

Design, implement, and optimize end-to-end data pipelines on Databricks, following the Medallion Architecture principles.
Build robust and scalable ETL/ELT pipelines using Apache Spark and Delta Lake to transform raw (bronze) data into trusted curated (silver) and analytics-ready (gold) data layers.
Operationalize Databricks Workflows for orchestration, dependency management, and pipeline automation.
Apply schema evolution and data versioning to support agile data development.

Platform Integration & Data Ingestion:

Connect and ingest data from enterprise systems such as PeopleSoft, D2L, and Salesforce using APIs, JDBC, or other integration frameworks.
Implement connectors and ingestion frameworks that accommodate structured, semi-structured, and unstructured data.
Design standardized data ingestion processes with automated error handling, retries, and alerting.

Data Quality, Monitoring, and Governance:

Develop data quality checks, validation rules, and anomaly detection mechanisms to ensure data integrity across all layers.
Integrate monitoring and observability tools (e.g., Databricks metrics, Grafana) to track ETL performance, latency, and failures.
Implement Unity Catalog or equivalent tools for centralized metadata management, data lineage, and governance policy enforcement.

Security, Privacy, and Compliance:

Enforce data security best practices including row-level security, encryption at rest/in transit, and fine-grained access control via Unity Catalog.
Design and implement data masking, tokenization, and anonymization for compliance with privacy regulations (e.g., GDPR, FERPA).
Work with security teams to audit and certify compliance controls.

AI/ML-Ready Data Foundation:

Enable data scientists by delivering high-quality, feature-rich data sets for model training and inference.
Support AIOps/MLOps lifecycle workflows using MLflow for experiment tracking, model registry, and deployment within Databricks.
Collaborate with AI/ML teams to create reusable feature stores and training pipelines.

Cloud Data Architecture and Storage:

Architect and manage data lakes on Azure Data Lake Storage (ADLS) or Amazon S3, and design ingestion pipelines to feed the bronze layer.
Build data marts and warehousing solutions using platforms like Databricks.
Optimize data storage and access patterns for performance and cost-efficiency.

Documentation & Enablement:

Maintain technical documentation, architecture diagrams, data dictionaries, and runbooks for all pipelines and components.
Provide training and enablement sessions to internal stakeholders on the Databricks platform, Medallion Architecture, and data governance practices.
Conduct code reviews and promote reusable patterns and frameworks across teams.

Reporting and Accountability:

Submit a weekly schedule of hours worked and progress reports outlining completed tasks, upcoming plans, and blockers.
Track deliverables against roadmap milestones and communicate risks or dependencies.

Required Qualifications:

Hands-on experience with Databricks, Delta Lake, and Apache Spark for large-scale data engineering.
Deep understanding of ELT pipeline development, orchestration, and monitoring in cloud-native environments.
Experience implementing Medallion Architecture (Bronze/Silver/Gold) and working with data versioning and schema enforcement in enterprise grade environments.
Strong proficiency in SQL, Python, or Scala for data transformations and workflow logic.
Proven experience integrating enterprise platforms (e.g., PeopleSoft, Salesforce, D2L) into centralized data platforms.
Familiarity with data governance, lineage tracking, and metadata management tools.

Preferred Qualifications:

Prior UMGC or USM experience preferred.
Experience with Databricks Unity Catalog for metadata management and access control.
Experience deploying ML models at scale using MLFlow or similar MLOps tools.
Familiarity with cloud platforms like Azure or AWS, including storage, security, and networking aspects.
Knowledge of data warehouse design and star/snowflake schema modeling.

Equal Employer/Veterans/Disabled

Navitas Business Consulting is an affirmative action and equal opportunity employer. If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please contact Navitas Human Resources.

Navitas is an equal opportunity employer. We provide employment and opportunities for advancement, compensation, training, and growth according to individual merit, without regard to race, color, religion, sex (including pregnancy), national origin, sexual orientation, gender identity or expression, marital status, age, genetic information, disability, veteran-status veteran or military status, or any other characteristic protected under applicable Federal, state, or local law. Our goal is for each staff member to have the opportunity to grow to the limits of their abilities and to achieve personal and organizational objectives. We will support positive programs for equal treatment of all staff and full utilization of all qualified employees at all levels within Navitas.

Data Engineer — AI / BI

Artificial Intelligence & Business Intelligence | Data & Analytics

Who We Are:

What We Do:

Position Overview

Responsibilities:

Data & AI Platform Engineering (Databricks-Centric):

Design, implement, and optimize end-to-end data pipelines on Databricks, following the Medallion Architecture principles.
Build robust and scalable ETL/ELT pipelines using Apache Spark and Delta Lake to transform raw (bronze) data into trusted curated (silver) and analytics-ready (gold) data layers.
Operationalize Databricks Workflows for orchestration, dependency management, and pipeline automation.
Apply schema evolution and data versioning to support agile data development.

Platform Integration & Data Ingestion:

Connect and ingest data from enterprise systems such as PeopleSoft, D2L, and Salesforce using APIs, JDBC, or other integration frameworks.
Implement connectors and ingestion frameworks that accommodate structured, semi-structured, and unstructured data.
Design standardized data ingestion processes with automated error handling, retries, and alerting.

Data Quality, Monitoring, and Governance:

Develop data quality checks, validation rules, and anomaly detection mechanisms to ensure data integrity across all layers.
Integrate monitoring and observability tools (e.g., Databricks metrics, Grafana) to track ETL performance, latency, and failures.
Implement Unity Catalog or equivalent tools for centralized metadata management, data lineage, and governance policy enforcement.

Security, Privacy, and Compliance:

Enforce data security best practices including row-level security, encryption at rest/in transit, and fine-grained access control via Unity Catalog.
Design and implement data masking, tokenization, and anonymization for compliance with privacy regulations (e.g., GDPR, FERPA).
Work with security teams to audit and certify compliance controls.

AI/ML-Ready Data Foundation:

Enable data scientists by delivering high-quality, feature-rich data sets for model training and inference.
Support AIOps/MLOps lifecycle workflows using MLflow for experiment tracking, model registry, and deployment within Databricks.
Collaborate with AI/ML teams to create reusable feature stores and training pipelines.

Cloud Data Architecture and Storage:

Architect and manage data lakes on Azure Data Lake Storage (ADLS) or Amazon S3, and design ingestion pipelines to feed the bronze layer.
Build data marts and warehousing solutions using platforms like Databricks.
Optimize data storage and access patterns for performance and cost-efficiency.

Documentation & Enablement:

Maintain technical documentation, architecture diagrams, data dictionaries, and runbooks for all pipelines and components.
Provide training and enablement sessions to internal stakeholders on the Databricks platform, Medallion Architecture, and data governance practices.
Conduct code reviews and promote reusable patterns and frameworks across teams.

Reporting and Accountability:

Submit a weekly schedule of hours worked and progress reports outlining completed tasks, upcoming plans, and blockers.
Track deliverables against roadmap milestones and communicate risks or dependencies.

Required Qualifications:

Hands-on experience with Databricks, Delta Lake, and Apache Spark for large-scale data engineering.
Deep understanding of ELT pipeline development, orchestration, and monitoring in cloud-native environments.
Experience implementing Medallion Architecture (Bronze/Silver/Gold) and working with data versioning and schema enforcement in enterprise grade environments.
Strong proficiency in SQL, Python, or Scala for data transformations and workflow logic.
Proven experience integrating enterprise platforms (e.g., PeopleSoft, Salesforce, D2L) into centralized data platforms.
Familiarity with data governance, lineage tracking, and metadata management tools.

Preferred Qualifications:

Prior UMGC or USM experience preferred.
Experience with Databricks Unity Catalog for metadata management and access control.
Experience deploying ML models at scale using MLFlow or similar MLOps tools.
Familiarity with cloud platforms like Azure or AWS, including storage, security, and networking aspects.
Knowledge of data warehouse design and star/snowflake schema modeling.

Equal Employer/Veterans/Disabled

Hiring | CloudSufi |Python Architect (PySpark, Fivetran/Airbyte)|Noida

at CLOUDSUFI

3 recruiters

Posted by Lishta Jain

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

8 - 14 yrs

₹30L - ₹45L / yr

Python

PySpark

Fivetran

Airbyte

Data engineering

+3 more

About Us :

CLOUDSUFI, a Google Cloud Premier Partner, a Data Science and Product Engineering organization building Products and Solutions for Technology and Enterprise industries. We firmly believe in the power of data to transform businesses and make better decisions. We combine unmatched experience in business processes with cutting edge infrastructure and cloud services. We partner with our customers to monetize their data and make enterprise data dance.

Our Values :

We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.

Equal Opportunity Statement :

CLOUDSUFI is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified candidates receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, and national origin status. We provide equal opportunities in employment, advancement, and all other areas of our workplace.

Role : Architect/Lead-Python(Hands-on experience with at least one SaaS ingestion platform such as Fivetran ,Airbyte,etc. is mandatory.)

Location : Noida, Delhi/NCR(Hybrid)

Experience : 8- 14 years

Education : BTech / BE / MCA / MSc Computer Science

Must have :

- 6+ years of data engineering; at least 3 years working on connector or integration framework development

- Deep Python expertise including PySpark, pyarrow, and an understanding of Spark's execution model (driver vs executor, serialization constraints, partition fan-out)

- Hands-on experience with at least one SaaS ingestion platform Fivetran, Airbyte, Google DTS, AWS Glue connectors, or equivalent at the connector-build level, not just configuration

- Strong understanding of OAuth 2.0 flows (auth code, PKCE, client credentials, JWT), rate limiting strategies (token bucket, leaky bucket, per-endpoint quotas), and incremental sync patterns (cursor, watermark, CDC)

- Experience designing shared connector frameworks reusable auth managers, rate governors, state stores not just per-connector scripts

- Ability to author and own TDDs and PRDs that can be handed to a junior engineer with minimal back-and-forth

Nice to have :

- Prior exposure to Databricks Asset Bundles / Declarative Automation Bundles or Lakeflow pipelines

- Experience with the Databricks Python Data Source API (DBR 15.4 LTS+) extremely rare, so treat practical Spark DSv2 Java/Scala background as equivalent

- GCP DTS or Cloud Data Fusion connector experience

- Knowledge of the specific source systems, particularly Social Ads APIs (Meta, LinkedIn, X) or enterprise SaaS (Salesforce, Oracle)

About Us :

Our Values :

We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.

Equal Opportunity Statement :

Role : Architect/Lead-Python(Hands-on experience with at least one SaaS ingestion platform such as Fivetran ,Airbyte,etc. is mandatory.)

Location : Noida, Delhi/NCR(Hybrid)

Experience : 8- 14 years

Education : BTech / BE / MCA / MSc Computer Science

Must have :

- 6+ years of data engineering; at least 3 years working on connector or integration framework development

- Deep Python expertise including PySpark, pyarrow, and an understanding of Spark's execution model (driver vs executor, serialization constraints, partition fan-out)

- Hands-on experience with at least one SaaS ingestion platform Fivetran, Airbyte, Google DTS, AWS Glue connectors, or equivalent at the connector-build level, not just configuration

- Experience designing shared connector frameworks reusable auth managers, rate governors, state stores not just per-connector scripts

- Ability to author and own TDDs and PRDs that can be handed to a junior engineer with minimal back-and-forth

Nice to have :

- Prior exposure to Databricks Asset Bundles / Declarative Automation Bundles or Lakeflow pipelines

- Experience with the Databricks Python Data Source API (DBR 15.4 LTS+) extremely rare, so treat practical Spark DSv2 Java/Scala background as equivalent

- GCP DTS or Cloud Data Fusion connector experience

- Knowledge of the specific source systems, particularly Social Ads APIs (Meta, LinkedIn, X) or enterprise SaaS (Salesforce, Oracle)

ETL Developer

Service Co

Agency job

via Vikash Technologies by Rishika Teja

Hyderabad, Bengaluru (Bangalore), Chennai

3 - 5 yrs

₹12L - ₹18L / yr

ETL

SSIS

SQL Server Reporting Services (SSRS)

MS SQLServer

* 3–5 years of hands-on experience in ETL development, including designing, developing, and optimizing ETL pipelines.

* Strong expertise in T-SQL, SSIS, and SSRS (mandatory skills).

* Strong experience with MS SQL Server for querying, data transformation, and handling large datasets.

* Good understanding of relational and non-relational databases.

* Excellent communication skills with the ability to collaborate with technical and business teams.

* 3–5 years of hands-on experience in ETL development, including designing, developing, and optimizing ETL pipelines.

* Strong expertise in T-SQL, SSIS, and SSRS (mandatory skills).

* Strong experience with MS SQL Server for querying, data transformation, and handling large datasets.

* Good understanding of relational and non-relational databases.

* Excellent communication skills with the ability to collaborate with technical and business teams.

Data Engineer

at Amura Health

3 candid answers

1 video

Posted by Sangeetha A

Chennai

4 - 7 yrs

₹1L - ₹30L / yr

Python

SQL

Amazon Web Services (AWS)

ELT

ETL

Data Engineer at Amura

Amura’s Vision

We believe that the most under-appreciated route to releasing untapped human potential is to build a healthier body, and through which a better brain. This allows us to do more of everything that is important to each one of us. Billions of healthier brains, sitting in healthier bodies, can take up more complex problems that defy solutions today, including many existential threats, and solve them in just a few decades.

Billions of healthier brains will make the world richer beyond what we can imagine today. The surplus wealth, combined with better human capabilities, will lead us to a new renaissance, giving us a richer and more beautiful culture. These healthier brains will be equipped with deeper intellect, be less acrimonious, more magnanimous, and have a kinder outlook on the world, resulting in a world that is better than any previous time.

We find this vision of the future exhilarating. Our hopes and dreams are to create this future as quickly as possible and ensure that it is widely distributed and optimized to maximize all forms of human excellence.

Role Overview

We are looking for a hands-on Data Engineer to design, build, and maintain scalable data pipelines and data platforms. You will work on ingesting, transforming, and serving data reliably for analytics, reporting, and downstream applications, collaborating closely with backend engineers, analysts, and data scientists. This role is ideal for someone who enjoys building robust data systems, working with large datasets, and writing clean, production-grade code.

Key Responsibilities

Data Pipelines & Development

Build and maintain reliable ETL/ELT pipelines for batch and near-real-time data processing.
Ingest data from multiple sources (databases, APIs, event streams, files).
Transform raw data into clean, analytics-ready datasets.
Optimize pipelines for performance, scalability, and cost.

Data Storage & Modeling

Design and manage data models in data warehouses or data lakes.
Work with SQL and NoSQL databases and modern data warehouses.
Implement partitioning, indexing, and efficient query patterns.
Maintain documentation for schemas, pipelines, and transformations.

Cloud & Tooling

Build data solutions on cloud platforms (AWS preferred).
Use services such as S3, Redshift, Athena, Glue, EMR, Lambda, Kinesis, or equivalents.
Work with orchestration tools like Airflow or similar schedulers.
Use version control, CI/CD, and Infrastructure-as-Code where applicable.

Data Quality & Reliability

Implement data validation, monitoring, and alerting for pipelines.
Troubleshoot data issues and ensure pipeline reliability.
Collaborate with stakeholders to resolve data discrepancies.

Collaboration

Partner with analytics, product, and engineering teams to understand data needs.
Support analysts and data scientists with clean, accessible datasets.
Participate in code reviews and contribute to data engineering best practices.

What We’re Looking For

Experience: 4-6+ years of experience as a Data Engineer / Data Developer.
Programming: Strong programming skills in Python.
Databases: Excellent knowledge of SQL and relational data modeling.
Pipelines: Experience building ETL/ELT pipelines in production.
Cloud: Hands-on experience with cloud-based data platforms (AWS preferred).
Concepts: Understanding of data warehousing concepts and best practices.

Nice to Have:

Experience with Spark, Kafka, dbt, or Flink.
Familiarity with orchestration tools like Airflow.
Experience with streaming or event-driven data pipelines.
Exposure to data quality or observability tools.
Experience working with large-scale or high-volume datasets.

Additional Information

Office Location: Chennai (Velachery).
Work Model: Work from Office - because great stories are built in person!.
Online Presence: https://amura.ai (@AmuraHealth on all social media).

Data Engineer at Amura

Amura’s Vision

Role Overview

Key Responsibilities

Data Pipelines & Development

Build and maintain reliable ETL/ELT pipelines for batch and near-real-time data processing.
Ingest data from multiple sources (databases, APIs, event streams, files).
Transform raw data into clean, analytics-ready datasets.
Optimize pipelines for performance, scalability, and cost.

Data Storage & Modeling

Design and manage data models in data warehouses or data lakes.
Work with SQL and NoSQL databases and modern data warehouses.
Implement partitioning, indexing, and efficient query patterns.
Maintain documentation for schemas, pipelines, and transformations.

Cloud & Tooling

Build data solutions on cloud platforms (AWS preferred).
Use services such as S3, Redshift, Athena, Glue, EMR, Lambda, Kinesis, or equivalents.
Work with orchestration tools like Airflow or similar schedulers.
Use version control, CI/CD, and Infrastructure-as-Code where applicable.

Data Quality & Reliability

Implement data validation, monitoring, and alerting for pipelines.
Troubleshoot data issues and ensure pipeline reliability.
Collaborate with stakeholders to resolve data discrepancies.

Collaboration

Partner with analytics, product, and engineering teams to understand data needs.
Support analysts and data scientists with clean, accessible datasets.
Participate in code reviews and contribute to data engineering best practices.

What We’re Looking For

Experience: 4-6+ years of experience as a Data Engineer / Data Developer.
Programming: Strong programming skills in Python.
Databases: Excellent knowledge of SQL and relational data modeling.
Pipelines: Experience building ETL/ELT pipelines in production.
Cloud: Hands-on experience with cloud-based data platforms (AWS preferred).
Concepts: Understanding of data warehousing concepts and best practices.

Nice to Have:

Experience with Spark, Kafka, dbt, or Flink.
Familiarity with orchestration tools like Airflow.
Experience with streaming or event-driven data pipelines.
Exposure to data quality or observability tools.
Experience working with large-scale or high-volume datasets.

Additional Information

Office Location: Chennai (Velachery).
Work Model: Work from Office - because great stories are built in person!.
Online Presence: https://amura.ai (@AmuraHealth on all social media).

Data Engineer

at Loyalty Juggernaut Inc

2 recruiters

Posted by Shraddha Dhavle

Hyderabad

2 - 6 yrs

₹5L - ₹15L / yr

ETL

Data Structures

Python

Amazon Web Services (AWS)

About LJI

Loyalty Juggernaut (LJI) is a leading B2B SaaS company redefining how enterprises drive customer engagement and loyalty. Our flagship platform, GRAVTY®, enables global brands to transform loyalty programs into measurable, revenue-generating growth engines.

Built as an AI-first, next-generation solution, GRAVTY® empowers organizations to deliver highly personalized, real-time experiences at scale—helping them increase customer lifetime value and deepen brand relationships.

Headquartered in Palo Alto, California, LJI partners with leading enterprises across 16 major industries including airlines, retail, hospitality, financial services and telecommunications powering some of the most innovative loyalty ecosystems worldwide.

Our Global Impact:

400+ Million members connected through our platform.
100+ Global Brands trust us to drive loyalty and brand devotion.
3-Time Winner of “Best Technology Innovation in Loyalty”.
Global recognitions for Excellence in Loyalty Management under numerous categories.
Recognised as a ‘Strong performer’ in The Forrester Wave™ Loyalty Platforms, Q4 2025.

Explore more about us at www.lji.io

What you will OWN:

Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources using SQL and AWS ‘big data’ technologies.
Create and maintain optimal data pipeline architecture.
Identify, design, and implement internal process improvements, automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Work with stakeholders, including the Technical Architects, Developers, Product Owners, and Executives, to assist with data-related technical issues and support their data infrastructure needs.
Create tools for data management and data analytics that can assist them in building and optimizing our product to become an innovative industry leader.

You would make a GREAT FIT if you have:

Have 2 to 5 years of relevant backend development experience, with solid expertise in Python.
Possess strong skills in Data Structures and Algorithms, and can write optimized, maintainable code.
Are familiar with database systems, and can comfortably work with PostgreSQL, as well as NoSQL solutions like MongoDB or DynamoDB.
Hands-on experience using Cloud Dataware houses like AWS Redshift, GBQ, etc.
Experience with AWS cloud services: EC2, EMR, RDS, Redshift, and AWS Batch would be an added advantage.
Have a solid understanding of ETL processes and tools and can build or modify ETL pipelines effectively.
Have experience managing or building data pipelines and architectures at scale.
Understand the nuances of data ingestion, transformation, storage, and analytics workflows.
Communicate clearly and work collaboratively across engineering, product.

Why Choose US?

This opportunity offers a dynamic and supportive work environment where you'll have the chance to not just collaborate with talented technocrats but also work with globally recognized brands, gain exposure, and carve your own career path.
You will get to innovate and dabble in the future of technology -Enterprise Cloud Computing, Blockchain, Machine Learning, AI, Mobile, Digital Wallets, and much more.

About LJI

Our Global Impact:

400+ Million members connected through our platform.
100+ Global Brands trust us to drive loyalty and brand devotion.
3-Time Winner of “Best Technology Innovation in Loyalty”.
Global recognitions for Excellence in Loyalty Management under numerous categories.
Recognised as a ‘Strong performer’ in The Forrester Wave™ Loyalty Platforms, Q4 2025.

Explore more about us at www.lji.io

What you will OWN:

Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources using SQL and AWS ‘big data’ technologies.
Create and maintain optimal data pipeline architecture.
Identify, design, and implement internal process improvements, automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Work with stakeholders, including the Technical Architects, Developers, Product Owners, and Executives, to assist with data-related technical issues and support their data infrastructure needs.
Create tools for data management and data analytics that can assist them in building and optimizing our product to become an innovative industry leader.

You would make a GREAT FIT if you have:

Have 2 to 5 years of relevant backend development experience, with solid expertise in Python.
Possess strong skills in Data Structures and Algorithms, and can write optimized, maintainable code.
Are familiar with database systems, and can comfortably work with PostgreSQL, as well as NoSQL solutions like MongoDB or DynamoDB.
Hands-on experience using Cloud Dataware houses like AWS Redshift, GBQ, etc.
Experience with AWS cloud services: EC2, EMR, RDS, Redshift, and AWS Batch would be an added advantage.
Have a solid understanding of ETL processes and tools and can build or modify ETL pipelines effectively.
Have experience managing or building data pipelines and architectures at scale.
Understand the nuances of data ingestion, transformation, storage, and analytics workflows.
Communicate clearly and work collaboratively across engineering, product.

Why Choose US?

This opportunity offers a dynamic and supportive work environment where you'll have the chance to not just collaborate with talented technocrats but also work with globally recognized brands, gain exposure, and carve your own career path.
You will get to innovate and dabble in the future of technology -Enterprise Cloud Computing, Blockchain, Machine Learning, AI, Mobile, Digital Wallets, and much more.

Data Engineer (Healthcare Data & SQL Expert)

at Mango Sciences

Posted by Supriya C

Remote only

5 - 7 yrs

₹10L - ₹25L / yr

Python

SQL

RDMS

ETL

Database Design

+7 more

Sr. DE / Data Engineer (Healthcare Data & SQL Expert)

Experience Level: 5–7 Years

Focus: Database Design, Advanced SQL, ETL/ELT Pipelines, and Healthcare Interoperability.

Summary

We are looking for a highly skilled Senior Data Engineer to join our healthcare data team. This role is perfect for a technical powerhouse who excels at building robust data pipelines and deeply understands database internals. You will be responsible for designing schemas, writing complex stored procedures, and optimizing SQL performance to handle clinical and claims data at scale. You will bridge the gap between raw data ingestion and high-performance analytics, ensuring all solutions meet HIPAA and FHIR standards.

What You’ll Do

1. Advanced SQL & Database Development

Schema Design: Design and implement relational schemas (MSSQL, PostgreSQL, Oracle) ensuring data integrity through constraints, triggers, and normalized structures.
Programmability: Write and maintain sophisticated Stored Procedures, Functions, and Views to handle complex business logic within the database layer.
Performance Tuning: Own query optimization. You should be the expert in reading EXPLAIN/ANALYZE plans, implementing advanced indexing strategies (Clustered, Non-Clustered, Columnstore), and managing partitioning.
Data Modeling: Build and manage dimensional models (Star/Snowflake) and implement Slowly Changing Dimensions (SCD Types 1, 2, and 4).
Getty Images

2. Data Engineering & Ingestion

Pipeline Development: Build and operate scalable ETL/ELT pipelines using Python and SQL to ingest data from EHRs, REST APIs, and flat files.
Orchestration: Use Apache Airflow to schedule jobs, manage dependencies, and implement robust retry/alerting logic.
API Integration: Develop Python-based ingestion frameworks that handle OAuth, pagination, and throttling for third-party healthcare data partners.

3. Healthcare Interoperability & Compliance

Standards: Map complex clinical data to HL7 FHIR resources and curated analytic layers.
Security: Implement "Privacy by Design" by enforcing HIPAA safeguards, including encryption at rest, access controls, and PII/PHI de-identification.

4. Operational Excellence

CI/CD: Use GitHub and automated pipelines to deploy database changes and data code.
Observability: Implement data quality tests (using tools like dbt or custom Python/SQL checks) to monitor freshness and accuracy.

What You’ll Bring

Experience: 5–7 years of professional data engineering experience, with a heavy emphasis on backend database development.
The SQL Expert Toolkit:
Expert SQL: Window functions, CTEs, recursive queries, and set-based transformations.
DB Internals: Deep knowledge of MSSQL, PostgreSQL, or Oracle. You should understand how the engine stores and retrieves data.
Optimization: Proven track record of turning "slow" queries into high-performance assets via indexing and refactoring.
The Engineering Toolkit:
Python: Intermediate to advanced (Pandas/Polars, Requests, SQLAlchemy, or PySpark).
Orchestration: Practical experience with Airflow (or Prefect/Dagster).
Legacy/Cloud mix: Proficiency in SSIS/SSMA or PowerShell is a plus for migrating legacy workloads to modern platforms.
The Domain Knowledge: Familiarity with FHIR/HL7 and an understanding of the importance of data governance in a regulated environment.

Technical "Must-Haves" for the Interview

Ability to whiteboard a complex Database Schema from scratch.
Ability to debug a long-running SQL query and explain the IO/CPU trade-offs of different index types.
Experience handling JSON/BSON data types within a relational database context.

Nice to Have

Experience with NoSQL systems like MongoDB or Elasticsearch.
Cloud experience (Azure, AWS, or GCP) specifically regarding managed SQL services.
Knowledge of dbt (data build tool) for managing transformations in the warehouse.

Sr. DE / Data Engineer (Healthcare Data & SQL Expert)

Experience Level: 5–7 Years

Focus: Database Design, Advanced SQL, ETL/ELT Pipelines, and Healthcare Interoperability.

Summary

What You’ll Do

1. Advanced SQL & Database Development

Schema Design: Design and implement relational schemas (MSSQL, PostgreSQL, Oracle) ensuring data integrity through constraints, triggers, and normalized structures.
Programmability: Write and maintain sophisticated Stored Procedures, Functions, and Views to handle complex business logic within the database layer.
Performance Tuning: Own query optimization. You should be the expert in reading EXPLAIN/ANALYZE plans, implementing advanced indexing strategies (Clustered, Non-Clustered, Columnstore), and managing partitioning.
Data Modeling: Build and manage dimensional models (Star/Snowflake) and implement Slowly Changing Dimensions (SCD Types 1, 2, and 4).
Getty Images

2. Data Engineering & Ingestion

Pipeline Development: Build and operate scalable ETL/ELT pipelines using Python and SQL to ingest data from EHRs, REST APIs, and flat files.
Orchestration: Use Apache Airflow to schedule jobs, manage dependencies, and implement robust retry/alerting logic.
API Integration: Develop Python-based ingestion frameworks that handle OAuth, pagination, and throttling for third-party healthcare data partners.

3. Healthcare Interoperability & Compliance

Standards: Map complex clinical data to HL7 FHIR resources and curated analytic layers.
Security: Implement "Privacy by Design" by enforcing HIPAA safeguards, including encryption at rest, access controls, and PII/PHI de-identification.

4. Operational Excellence

CI/CD: Use GitHub and automated pipelines to deploy database changes and data code.
Observability: Implement data quality tests (using tools like dbt or custom Python/SQL checks) to monitor freshness and accuracy.

What You’ll Bring

Experience: 5–7 years of professional data engineering experience, with a heavy emphasis on backend database development.
The SQL Expert Toolkit:
Expert SQL: Window functions, CTEs, recursive queries, and set-based transformations.
DB Internals: Deep knowledge of MSSQL, PostgreSQL, or Oracle. You should understand how the engine stores and retrieves data.
Optimization: Proven track record of turning "slow" queries into high-performance assets via indexing and refactoring.
The Engineering Toolkit:
Python: Intermediate to advanced (Pandas/Polars, Requests, SQLAlchemy, or PySpark).
Orchestration: Practical experience with Airflow (or Prefect/Dagster).
Legacy/Cloud mix: Proficiency in SSIS/SSMA or PowerShell is a plus for migrating legacy workloads to modern platforms.
The Domain Knowledge: Familiarity with FHIR/HL7 and an understanding of the importance of data governance in a regulated environment.

Technical "Must-Haves" for the Interview

Ability to whiteboard a complex Database Schema from scratch.
Ability to debug a long-running SQL query and explain the IO/CPU trade-offs of different index types.
Experience handling JSON/BSON data types within a relational database context.

Nice to Have

Experience with NoSQL systems like MongoDB or Elasticsearch.
Cloud experience (Azure, AWS, or GCP) specifically regarding managed SQL services.
Knowledge of dbt (data build tool) for managing transformations in the warehouse.

TC Migration Architect

at Ltts

Agency job

via Qntm Logic LLC by rahul batta

Bengaluru (Bangalore)

5 - 8 yrs

₹15L - ₹18L / yr

Migration

Verification and validation

SQL DB

CSV2TCXML

IPS

+7 more

· Strategy & Architecture: Collaborate with stakeholders to define end-to-end migration strategies, including data mapping, transformation, and validation rules.

· Technical Execution: Utilize tools like SQL DB, CSV2TCXML, IPS Upload, and ETL tools to migrate CAD and metadata.

· Customization: Develop custom migration solutions using BMIDE (Business Modeler IDE), ITK (Integration Toolkit), and SOA (Service Oriented Architecture).

· Project Leadership: Break down projects into manageable work packages, leading both onsite and offshore teams.

· Validation & Quality: Perform validation checks to ensure data integrity and accuracy post-migration.

· Integration Support: Manage CAD integrations (NX, Inventor, Creo) and PLM integrations (T4S, T4O, T4EA).

· Strategy & Architecture: Collaborate with stakeholders to define end-to-end migration strategies, including data mapping, transformation, and validation rules.

· Technical Execution: Utilize tools like SQL DB, CSV2TCXML, IPS Upload, and ETL tools to migrate CAD and metadata.

· Customization: Develop custom migration solutions using BMIDE (Business Modeler IDE), ITK (Integration Toolkit), and SOA (Service Oriented Architecture).

· Project Leadership: Break down projects into manageable work packages, leading both onsite and offshore teams.

· Validation & Quality: Perform validation checks to ensure data integrity and accuracy post-migration.

· Integration Support: Manage CAD integrations (NX, Inventor, Creo) and PLM integrations (T4S, T4O, T4EA).

Lead/VP - Data Engineer

A UK-centred leader in global finance

Agency job

via Cutshort Lightning by Bisman Gill

Pune

12yrs+

Best in industry

Amazon Web Services (AWS)

Python

PySpark

ETL

Purpose of the role

To build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure.

Accountabilities

Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data.
Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures.
Development of processing and analysis algorithms fit for the intended data complexity and volumes.
Collaboration with data scientist to build and deploy machine learning models.

Vice President Expectations

To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies; manage and maintain policies/ processes; deliver continuous improvements and escalate breaches of policies/procedures..
If managing a team, they define jobs and responsibilities, planning for the department’s future needs and operations, counselling employees on performance and contributing to employee pay decisions/changes. They may also lead a number of specialists to influence the operations of a department, in alignment with strategic as well as tactical priorities, while balancing short and long term goals and ensuring that budgets and schedules meet corporate requirements..
If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others..
OR for an individual contributor, they will be a subject matter expert within own discipline and will guide technical direction. They will lead collaborative, multi-year assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will train, guide and coach less experienced specialists and provide information affecting long term profits, organisational risks and strategic decisions..
Advise key stakeholders, including functional leadership teams and senior management on functional and cross functional areas of impact and alignment.
Manage and mitigate risks through assessment, in support of the control and governance agenda.
Demonstrate leadership and accountability for managing risk and strengthening controls in relation to the work your team does.
Demonstrate comprehensive understanding of the organisation functions to contribute to achieving the goals of the business.
Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategies.
Create solutions based on sophisticated analytical thought comparing and selecting complex alternatives. In-depth analysis with interpretative thinking will be required to define problems and develop innovative solutions.
Adopt and include the outcomes of extensive research in problem solving processes.
Seek out, build and maintain trusting relationships and partnerships with internal and external stakeholders in order to accomplish key business objectives, using influencing and negotiating skills to achieve outcomes.

To be a successful Senior Data Engineer, you should have experience with:

Hands on experience to work with large scale data platforms & in development of cloud solutions in AWS data platform with proven track record in driving business success.
Strong understanding of AWS and distributed computing paradigms, ability to design and develop data ingestion programs to process large data sets in Batch mode using Glue, Lambda, S3, redshift and snowflake and data bricks.
Ability to develop data ingestion programs to ingest real-time data from LIVE sources using Apache Kafka, Spark Streaming and related technologies. Hands on programming experience in python and PY-Spark.
Understanding of Dev Ops Pipelines using Jenkins, GitLab & should be strong in data modelling and Data architecture concepts & well versed with Project management tools and Agile Methodology.
Sound knowledge of data governance principles and tools (alation/glue data quality, mesh), Capable of suggesting solution architecture for diverse technology applications.

Additional relevant skills given below are highly valued:

Experience working in financial services industry & working in various Settlements and Sub ledger functions like PNS, Stock Record and Settlements, PNL.
Knowledge in BPS, IMPACT & Gloss products from Broadridge & creating ML model using python, Spark & Java.

You may be assessed on key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen, strategic thinking and digital and technology, as well as job-specific technical skills.

Purpose of the role

To build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure.

Accountabilities

Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data.
Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures.
Development of processing and analysis algorithms fit for the intended data complexity and volumes.
Collaboration with data scientist to build and deploy machine learning models.

Vice President Expectations

To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies; manage and maintain policies/ processes; deliver continuous improvements and escalate breaches of policies/procedures..
If managing a team, they define jobs and responsibilities, planning for the department’s future needs and operations, counselling employees on performance and contributing to employee pay decisions/changes. They may also lead a number of specialists to influence the operations of a department, in alignment with strategic as well as tactical priorities, while balancing short and long term goals and ensuring that budgets and schedules meet corporate requirements..
If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others..
OR for an individual contributor, they will be a subject matter expert within own discipline and will guide technical direction. They will lead collaborative, multi-year assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will train, guide and coach less experienced specialists and provide information affecting long term profits, organisational risks and strategic decisions..
Advise key stakeholders, including functional leadership teams and senior management on functional and cross functional areas of impact and alignment.
Manage and mitigate risks through assessment, in support of the control and governance agenda.
Demonstrate leadership and accountability for managing risk and strengthening controls in relation to the work your team does.
Demonstrate comprehensive understanding of the organisation functions to contribute to achieving the goals of the business.
Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategies.
Create solutions based on sophisticated analytical thought comparing and selecting complex alternatives. In-depth analysis with interpretative thinking will be required to define problems and develop innovative solutions.
Adopt and include the outcomes of extensive research in problem solving processes.
Seek out, build and maintain trusting relationships and partnerships with internal and external stakeholders in order to accomplish key business objectives, using influencing and negotiating skills to achieve outcomes.

To be a successful Senior Data Engineer, you should have experience with:

Hands on experience to work with large scale data platforms & in development of cloud solutions in AWS data platform with proven track record in driving business success.
Strong understanding of AWS and distributed computing paradigms, ability to design and develop data ingestion programs to process large data sets in Batch mode using Glue, Lambda, S3, redshift and snowflake and data bricks.
Ability to develop data ingestion programs to ingest real-time data from LIVE sources using Apache Kafka, Spark Streaming and related technologies. Hands on programming experience in python and PY-Spark.
Understanding of Dev Ops Pipelines using Jenkins, GitLab & should be strong in data modelling and Data architecture concepts & well versed with Project management tools and Agile Methodology.
Sound knowledge of data governance principles and tools (alation/glue data quality, mesh), Capable of suggesting solution architecture for diverse technology applications.

Additional relevant skills given below are highly valued:

Experience working in financial services industry & working in various Settlements and Sub ledger functions like PNS, Stock Record and Settlements, PNL.
Knowledge in BPS, IMPACT & Gloss products from Broadridge & creating ML model using python, Spark & Java.

Lead / Sr. Data Engineer (Architect & Engineering Owner)

at Mango Sciences

Posted by Supriya C

Remote only

6 - 12 yrs

₹15L - ₹30L / yr

Python

ETL

SQL

Database migration

Cloud transformation

+22 more

Lead / Sr. Data Engineer (Architect & Engineering Owner)

The Role

We are seeking a Lead Data Engineer who operates at the intersection of high-scale engineering and enterprise architecture. In this role, you will "own" our healthcare data platform end-to-end. You aren't just building pipelines; you are designing the blueprint for how clinical, claims, and sales data flow through our ecosystem. You will bridge the gap between legacy systems (MSSQL/Oracle) and modern cloud warehouses (Snowflake/Redshift/Databricks), ensuring our data is governed, HIPAA-compliant, and optimized for advanced analytics.

What You’ll Do

1. Architecture & Strategic Leadership

Design the Blueprint: Own the enterprise data architecture (Staging, Integration, Warehouse, and Semantic layers). Define the evolution from monolithic databases to scalable cloud-hosted analytics.
Modeling Mastery: Lead the design of complex Dimensional Models (Star/Snowflake) and implement advanced Slowly Changing Dimension (SCD) strategies to track historical clinical events.
Set the Standard: Establish coding, version control (GitHub), and CI/CD standards. Conduct design reviews and mentor a team of engineers to move from "task-takers" to "system-builders."

2. Advanced Data Engineering (Hands-on)

Modern ELT/ETL: Build and orchestrate production-grade pipelines using Python, Airflow, and dbt. Manage automated ingestion via Fivetran or custom-built frameworks for APIs and EHRs.
Multi-Engine Expertise: Operate seamlessly across PostgreSQL, MSSQL, and Oracle, while optimizing petabyte-scale cloud warehouses like Snowflake or Redshift.
Performance Tuning: Own query optimization. You should be the expert at using EXPLAIN/ANALYZE, partitioning, and indexing to reduce compute costs and latency.
Quality & Reconciliation: Design robust validation frameworks to ensure data integrity—essential for healthcare compliance and clinical trust.

3. Healthcare Interoperability & Governance

Data Standards: Map diverse datasets (EHR, API, Flat Files) to HL7 FHIR resources and curated analytic layers.
Privacy by Design: Embed HIPAA Security Rule safeguards (encryption, audit trails, and access controls) directly into the code and infrastructure.
Interoperability: Handle complex semi-structured data (JSON/XML) from third-party partners and EMR systems.

What You’ll Bring

Experience: 8–12+ years in Data Engineering/Architecture. You should have a track record of leading technical projects or mentoring teams.
The "Hybrid" Stack: * Expert SQL/PL-SQL: Deep experience with performance tuning in relational environments (Oracle/MSSQL).
Modern Tools: Practical experience with Snowflake/Redshift, dbt, and Airflow.
Programming: High proficiency in Python (Pandas, PySpark) or Java/Scala for custom ETL routines.
Architectural Depth: Clear understanding of SDLC, Agile (Scrum), and Data Modeling frameworks.
Healthcare Domain: Exposure to pharmaceutical or clinical data (Life Sciences, EMR, or Claims) is highly preferred.
Soft Skills: The ability to translate "clinical business needs" into "technical runbooks" and communicate effectively with stakeholders.

Nice to Have

AI/ML Integration: Experience supporting Data Science teams with feature extraction and model deployment (SageMaker/Azure ML).
Advanced Tooling: Familiarity with NoSQL (MongoDB), search engines (Elasticsearch), or niche ETL tools (Talend/Informatica) for migration purposes.
Cloud Infrastructure: Hands-on experience with AWS Glue, Lambda, or Azure Data Factory.

Lead / Sr. Data Engineer (Architect & Engineering Owner)

The Role

What You’ll Do

1. Architecture & Strategic Leadership

Design the Blueprint: Own the enterprise data architecture (Staging, Integration, Warehouse, and Semantic layers). Define the evolution from monolithic databases to scalable cloud-hosted analytics.
Modeling Mastery: Lead the design of complex Dimensional Models (Star/Snowflake) and implement advanced Slowly Changing Dimension (SCD) strategies to track historical clinical events.
Set the Standard: Establish coding, version control (GitHub), and CI/CD standards. Conduct design reviews and mentor a team of engineers to move from "task-takers" to "system-builders."

2. Advanced Data Engineering (Hands-on)

Modern ELT/ETL: Build and orchestrate production-grade pipelines using Python, Airflow, and dbt. Manage automated ingestion via Fivetran or custom-built frameworks for APIs and EHRs.
Multi-Engine Expertise: Operate seamlessly across PostgreSQL, MSSQL, and Oracle, while optimizing petabyte-scale cloud warehouses like Snowflake or Redshift.
Performance Tuning: Own query optimization. You should be the expert at using EXPLAIN/ANALYZE, partitioning, and indexing to reduce compute costs and latency.
Quality & Reconciliation: Design robust validation frameworks to ensure data integrity—essential for healthcare compliance and clinical trust.

3. Healthcare Interoperability & Governance

Data Standards: Map diverse datasets (EHR, API, Flat Files) to HL7 FHIR resources and curated analytic layers.
Privacy by Design: Embed HIPAA Security Rule safeguards (encryption, audit trails, and access controls) directly into the code and infrastructure.
Interoperability: Handle complex semi-structured data (JSON/XML) from third-party partners and EMR systems.

What You’ll Bring

Experience: 8–12+ years in Data Engineering/Architecture. You should have a track record of leading technical projects or mentoring teams.
The "Hybrid" Stack: * Expert SQL/PL-SQL: Deep experience with performance tuning in relational environments (Oracle/MSSQL).
Modern Tools: Practical experience with Snowflake/Redshift, dbt, and Airflow.
Programming: High proficiency in Python (Pandas, PySpark) or Java/Scala for custom ETL routines.
Architectural Depth: Clear understanding of SDLC, Agile (Scrum), and Data Modeling frameworks.
Healthcare Domain: Exposure to pharmaceutical or clinical data (Life Sciences, EMR, or Claims) is highly preferred.
Soft Skills: The ability to translate "clinical business needs" into "technical runbooks" and communicate effectively with stakeholders.

Nice to Have

AI/ML Integration: Experience supporting Data Science teams with feature extraction and model deployment (SageMaker/Azure ML).
Advanced Tooling: Familiarity with NoSQL (MongoDB), search engines (Elasticsearch), or niche ETL tools (Talend/Informatica) for migration purposes.
Cloud Infrastructure: Hands-on experience with AWS Glue, Lambda, or Azure Data Factory.

Power BI Engineer

A leading data & analytics intelligence technology solutions provider

Agency job

via HyrHub by Neha Koshy

Bengaluru (Bangalore)

4 - 5 yrs

₹12L - ₹18L / yr

PowerBI

Data modeling

DAX

SQL

ETL

+4 more

Key Skills:

Technical Skills

Power BI Development: 4-5 years of hands-on experience developing Power BI reports, dashboards, and data models
DAX: Strong proficiency in DAX (Data Analysis Expressions) for creating measures, calculated columns, and complex calculations
Power Query / M Language: Expertise in data transformation and ETL processes using Power Query
Data Modeling: Solid understanding of dimensional modeling, star schema, and data warehouse concepts
SQL: Proficient in SQL for data extraction, manipulation, and querying relational databases
Power BI Service: Experience with Power BI Service administration, workspace management, scheduled refreshes, and deployment pipelines
Custom Visualizations: Experience creating and configuring custom visuals, including use of AppSource visuals and custom visual development using Power BI Visuals SDK
API Integration: Hands-on experience with Power BI REST APIs for automating deployments, managing workspaces, and embedding reports
Knowledge of data visualization best practices and UI/UX principles for dashboard design
Experience with data source connectivity (SQL Server, Azure SQL, Oracle, SAP, Excel, APIs, web services)

Additional Required Qualifications

Bachelor’s degree in computer science, Information Systems, Business Analytics, or related field
Strong analytical and problem-solving abilities
Excellent communication skills to work with both technical and non-technical stakeholders
Ability to manage multiple projects and prioritize tasks effectively
Detail-oriented with commitment to delivering high-quality work
Client-facing experience with ability to gather requirements and present solutions

Preferred Qualifications

Microsoft Power BI certification (PL-300 or equivalent)
Experience with Azure ecosystem (Azure Data Factory, Azure Synapse Analytics, Azure SQL Database)
Knowledge of other Microsoft BI tools (SSRS, SSAS, Excel Power Pivot)
Familiarity with Python or R for advanced analytics integration
Experience with Dataflows and incremental refresh strategies
Understanding of API development for custom visuals or Power BI embedded solutions
Experience working in Agile/Scrum development environments

Key Skills:

Technical Skills

Power BI Development: 4-5 years of hands-on experience developing Power BI reports, dashboards, and data models
DAX: Strong proficiency in DAX (Data Analysis Expressions) for creating measures, calculated columns, and complex calculations
Power Query / M Language: Expertise in data transformation and ETL processes using Power Query
Data Modeling: Solid understanding of dimensional modeling, star schema, and data warehouse concepts
SQL: Proficient in SQL for data extraction, manipulation, and querying relational databases
Power BI Service: Experience with Power BI Service administration, workspace management, scheduled refreshes, and deployment pipelines
Custom Visualizations: Experience creating and configuring custom visuals, including use of AppSource visuals and custom visual development using Power BI Visuals SDK
API Integration: Hands-on experience with Power BI REST APIs for automating deployments, managing workspaces, and embedding reports
Knowledge of data visualization best practices and UI/UX principles for dashboard design
Experience with data source connectivity (SQL Server, Azure SQL, Oracle, SAP, Excel, APIs, web services)

Additional Required Qualifications

Bachelor’s degree in computer science, Information Systems, Business Analytics, or related field
Strong analytical and problem-solving abilities
Excellent communication skills to work with both technical and non-technical stakeholders
Ability to manage multiple projects and prioritize tasks effectively
Detail-oriented with commitment to delivering high-quality work
Client-facing experience with ability to gather requirements and present solutions

Preferred Qualifications

Microsoft Power BI certification (PL-300 or equivalent)
Experience with Azure ecosystem (Azure Data Factory, Azure Synapse Analytics, Azure SQL Database)
Knowledge of other Microsoft BI tools (SSRS, SSAS, Excel Power Pivot)
Familiarity with Python or R for advanced analytics integration
Experience with Dataflows and incremental refresh strategies
Understanding of API development for custom visuals or Power BI embedded solutions
Experience working in Agile/Scrum development environments

Snowflake DBT

at Wissen Technology

4 recruiters

Posted by Shrutika SaileshKumar

Bengaluru (Bangalore), Mumbai

4 - 8 yrs

Best in industry

Snowflake

Data Transformation Tool (DBT)

SQL

Snow flake schema

Python

+1 more

JD -

We are looking for a strong Data Engineer having hands on experience in building pipelines using Snowflake and DBT.

Key Responsibilities:

Develop, maintain, and optimize data pipelines using DBT and SQL on Snowflake DB.
Collaborate with data analysts, QA and business teams to build scalable data models.
Implement data transformations, testing, and documentation within the DBT framework.
Work on Snowflake for data warehousing tasks, including data ingestion, query optimization, and performance tuning.
Use Python (preferred) for automation, scripting, and additional data processing as needed.

Required Skills:

6+ years of experience in building data engineering pipelines.
Strong hands-on expertise with DBT and advanced SQL.
Experience working with modern columnar/MPP data warehouses, preferably Snowflake.
Knowledge of Python for data manipulation and workflow automation (preferred).
Good understanding of data modeling concepts, ETL/ELT processes, and best practice.

JD -

We are looking for a strong Data Engineer having hands on experience in building pipelines using Snowflake and DBT.

Key Responsibilities:

Develop, maintain, and optimize data pipelines using DBT and SQL on Snowflake DB.
Collaborate with data analysts, QA and business teams to build scalable data models.
Implement data transformations, testing, and documentation within the DBT framework.
Work on Snowflake for data warehousing tasks, including data ingestion, query optimization, and performance tuning.
Use Python (preferred) for automation, scripting, and additional data processing as needed.

Required Skills:

6+ years of experience in building data engineering pipelines.
Strong hands-on expertise with DBT and advanced SQL.
Experience working with modern columnar/MPP data warehouses, preferably Snowflake.
Knowledge of Python for data manipulation and workflow automation (preferred).
Good understanding of data modeling concepts, ETL/ELT processes, and best practice.

Data Engineer

at Credilio Financial Technologies Pvt. Ltd.

3 recruiters

Posted by Yusuf Qureshi

Mumbai

3 - 6 yrs

Best in industry

Python

Data engineering

Apache Airflow

ETL

CI/CD

+2 more

Job Role: Data Engineer

Location: Mumbai, Andheri.

WFO(Monday-Friday)

Looking for Immediate, 15 days joiner.

Required Skills

• 3+ years of hands-on experience as a Data Engineer.

• Strong proficiency in Python and PySpark programming for data engineering tasks.

• In-depth knowledge of ETL processes and data pipeline architectures. • Experience with Airflow or Step Functions for orchestration and scheduling.

• Solid experience working with AWS services.

• Proficiency in building and maintaining CI/CD pipelines

. • Experience with data modelling, database design, and querying.

• Strong problem-solving skills, with the ability to troubleshoot and optimize complex data pipelines.

Bonus Skills

• Knowledge of cloud platforms (e.g., AWS, GCP).

• Experience with containerization and Kubernetes.

• Experience in data security, encryption, and compliance best practices.

• Strong communication and collaboration skills.

Responsibilities

• Design, build, and maintain scalable and efficient ETL/ELT pipelines to process large-scale datasets.

• Implement and manage workflow automation and orchestration using Apache Airflow or Step Functions.

• Build and optimize data infrastructure using AWS services.

• Integrate and manage data warehouses.

• Design and develop dynamic and interactive dashboards using PowerBI, Metabase, and Superset, to present insights.

• Develop and maintain CI/CD pipelines for seamless deployment and continuous integration of data solutions.

• Monitor and troubleshoot data pipeline issues, ensuring data quality and consistency.

• Leverage best practices for data governance, security, and compliance in cloud environments.

• Collaborate with data scientists, analysts, and other stakeholders to understand data needs and deliver reliable data solutions.

Job Role: Data Engineer

Location: Mumbai, Andheri.

WFO(Monday-Friday)

Looking for Immediate, 15 days joiner.

Required Skills

• 3+ years of hands-on experience as a Data Engineer.

• Strong proficiency in Python and PySpark programming for data engineering tasks.

• In-depth knowledge of ETL processes and data pipeline architectures. • Experience with Airflow or Step Functions for orchestration and scheduling.

• Solid experience working with AWS services.

• Proficiency in building and maintaining CI/CD pipelines

. • Experience with data modelling, database design, and querying.

• Strong problem-solving skills, with the ability to troubleshoot and optimize complex data pipelines.

Bonus Skills

• Knowledge of cloud platforms (e.g., AWS, GCP).

• Experience with containerization and Kubernetes.

• Experience in data security, encryption, and compliance best practices.

• Strong communication and collaboration skills.

Responsibilities

• Design, build, and maintain scalable and efficient ETL/ELT pipelines to process large-scale datasets.

• Implement and manage workflow automation and orchestration using Apache Airflow or Step Functions.

• Build and optimize data infrastructure using AWS services.

• Integrate and manage data warehouses.

• Design and develop dynamic and interactive dashboards using PowerBI, Metabase, and Superset, to present insights.

• Develop and maintain CI/CD pipelines for seamless deployment and continuous integration of data solutions.

• Monitor and troubleshoot data pipeline issues, ensuring data quality and consistency.

• Leverage best practices for data governance, security, and compliance in cloud environments.

• Collaborate with data scientists, analysts, and other stakeholders to understand data needs and deliver reliable data solutions.

Integration Engineer- Boomi

at Oddr

Posted by Deepika Madgunki

Remote only

2 - 6 yrs

₹1L - ₹18L / yr

ETL

API

Microsoft Windows Azure

Integration

BOOMI

+2 more

Job Title: Integration Engineer

Integration Engineers are responsible for defining, developing, delivering, maintaining and supporting end-to-end Enterprise Integration solutions. Using a designated IPaaS solution (e.g. Boomi), Integration Engineers integrate multiple cloud and on-premise applications which help customers publish and consume data between Oddr and third party systems for a variety of tasks.

Job Summary:

We are seeking a skilled and experienced Integration Engineer to join our Technology team in India. The ideal candidate will have a strong background in implementing low-code/no-code integration platforms as a service (iPaaS), with a preference for experience in Boomi. The role requires an in-depth understanding of SQL and RESTful APIs. Experience with Intapp's Integration Builder is a significant plus.

Key Responsibilities:

- Design and implement integration solutions using iPaaS tools.

- Collaborate with customers, product, engineering and business stakeholders to translate business requirements into robust and scalable integration processes.

- Develop and maintain SQL queries and scripts to facilitate data manipulation and integration.

- Utilize RESTful API design and consumption to ensure seamless data flow between various systems and applications.

- Lead the configuration, deployment, and ongoing management of integration projects.

- Troubleshoot and resolve technical issues related to integration solutions.

- Document integration processes and create user guides for internal and external users.

- Stay current with the latest developments in iPaaS technologies and best practices.

Qualifications:

- Bachelor’s degree in Computer Science, Information Technology, or a related field.

- Minimum of 2 years’ experience in an integration engineering role with hands-on experience in an iPaaS tool, preferably Boomi.

- Proficiency in SQL and experience with database management and data integration patterns.

- Strong understanding of integration patterns and solutions, API design, and cloud-based technologies.

- Good understanding of RESTful APIs and integration.

- Excellent problem-solving and analytical skills.

- Strong communication and interpersonal skills, with the ability to work effectively in a team environment.

- Experience with various integration protocols (REST, SOAP, FTP, etc.) and data formats (JSON, XML, etc.).

Preferred Skills:

- Boomi (or other iPaaS) certifications

- Experience with Intapp's Integration Builder is highly desirable but not mandatory.

- SQL Knowledge is important

- Experience in building E2E integrations and communicating with stakeholders

- Knowledge of Azure Functions, LogicApps, And other Azure Services is highly desirable

What we offer:

- Competitive salary and benefits package.

- Dynamic and innovative work environment.

- Opportunities for professional growth and advancement.