Apache Spark Jobs in Delhi, NCR and Gurgaon

17+ Apache Spark Jobs in Delhi, NCR and Gurgaon | Apache Spark Job openings in Delhi, NCR and Gurgaon

Apply to 17+ Apache Spark Jobs in Delhi, NCR and Gurgaon on CutShort.io. Explore the latest Apache Spark Job opportunities across top companies like Google, Amazon & Adobe.

Data Engineer – Databricks

at NeoGenCode Technologies Pvt Ltd

2 candid answers

Posted by Akshay Patil

Noida, Bengaluru (Bangalore), Pune, Hyderabad, Chennai

6 - 8 yrs

₹6L - ₹12L / yr

Data engineering

databricks

Snow flake schema

Python

Apache Spark

+8 more

Job Title : Data Engineer – Databricks

Experience : 6+ Years

Location : Noida / Hyderabad / Chennai / Pune / Bengaluru (Hybrid)

Shift : IST (Normal Shift)

Job Summary :

We are seeking an experienced Data Engineer with strong expertise in Databricks, Snowflake, Python, and Spark to build and optimize scalable data pipelines and support AI/ML model deployments. The ideal candidate should have experience working with cloud-based data platforms and preferably possess exposure to the Healthcare domain.

Required Skills :

Databricks (Preferred)
Snowflake
Python
Apache Spark
SQL
Azure Cloud
Kubernetes
Apache Airflow
GitHub & CI/CD Pipelines
AI/ML Model Deployment
Data Analytics

Preferred :

Experience in the Healthcare domain.
Strong understanding of scalable data engineering architectures and best practices.

Job Title : Data Engineer – Databricks

Experience : 6+ Years

Location : Noida / Hyderabad / Chennai / Pune / Bengaluru (Hybrid)

Shift : IST (Normal Shift)

Job Summary :

Required Skills :

Databricks (Preferred)
Snowflake
Python
Apache Spark
SQL
Azure Cloud
Kubernetes
Apache Airflow
GitHub & CI/CD Pipelines
AI/ML Model Deployment
Data Analytics

Preferred :

Experience in the Healthcare domain.
Strong understanding of scalable data engineering architectures and best practices.

MLOps Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

7 - 12 yrs

₹40L - ₹80L / yr

Machine Learning (ML)

Apache Spark

Apache Airflow

Python

Amazon Web Services (AWS)

+23 more

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

We are looking for a Senior MLOps Engineer with 8+ years of experience building and managing production-grade ML platforms and pipelines. The ideal candidate will have strong expertise across AWS, Airflow/MWAA, Apache Spark, Kubernetes (EKS), and automation of ML lifecycle workflows. You will work closely with data science, data engineering, and platform teams to operationalize and scale ML models in production.

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

MLOps Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

8 - 12 yrs

₹60L - ₹80L / yr

Apache Airflow

Apache Spark

AWS CloudFormation

DevOps

MLOps

+19 more

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

MLOps Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

8 - 12 yrs

₹60L - ₹80L / yr

DevOps

Apache Spark

Apache Airflow

Machine Learning (ML)

Pipeline management

+13 more

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

DevSecOps Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

8 - 12 yrs

₹50L - ₹75L / yr

Ansible

Terraform

Amazon Web Services (AWS)

Platform as a Service (PaaS)

CI/CD

+30 more

ROLE & RESPONSIBILITIES:

We are hiring a Senior DevSecOps / Security Engineer with 8+ years of experience securing AWS cloud, on-prem infrastructure, DevOps platforms, MLOps environments, CI/CD pipelines, container orchestration, and data/ML platforms. This role is responsible for creating and maintaining a unified security posture across all systems used by DevOps and MLOps teams — including AWS, Kubernetes, EMR, MWAA, Spark, Docker, GitOps, observability tools, and network infrastructure.

KEY RESPONSIBILITIES:

1. Cloud Security (AWS)-

Secure all AWS resources consumed by DevOps/MLOps/Data Science: EC2, EKS, ECS, EMR, MWAA, S3, RDS, Redshift, Lambda, CloudFront, Glue, Athena, Kinesis, Transit Gateway, VPC Peering.
Implement IAM least privilege, SCPs, KMS, Secrets Manager, SSO & identity governance.
Configure AWS-native security: WAF, Shield, GuardDuty, Inspector, Macie, CloudTrail, Config, Security Hub.
Harden VPC architecture, subnets, routing, SG/NACLs, multi-account environments.
Ensure encryption of data at rest/in transit across all cloud services.

2. DevOps Security (IaC, CI/CD, Kubernetes, Linux)-

Infrastructure as Code & Automation Security:

Secure Terraform, CloudFormation, Ansible with policy-as-code (OPA, Checkov, tfsec).
Enforce misconfiguration scanning and automated remediation.

CI/CD Security:

Secure Jenkins, GitHub, GitLab pipelines with SAST, DAST, SCA, secrets scanning, image scanning.
Implement secure build, artifact signing, and deployment workflows.

Containers & Kubernetes:

Harden Docker images, private registries, runtime policies.
Enforce EKS security: RBAC, IRSA, PSP/PSS, network policies, runtime monitoring.
Apply CIS Benchmarks for Kubernetes and Linux.

Monitoring & Reliability:

Secure observability stack: Grafana, CloudWatch, logging, alerting, anomaly detection.
Ensure audit logging across cloud/platform layers.

3. MLOps Security (Airflow, EMR, Spark, Data Platforms, ML Pipelines)-

Pipeline & Workflow Security:

Secure Airflow/MWAA connections, secrets, DAGs, execution environments.
Harden EMR, Spark jobs, Glue jobs, IAM roles, S3 buckets, encryption, and access policies.

ML Platform Security:

Secure Jupyter/JupyterHub environments, containerized ML workspaces, and experiment tracking systems.
Control model access, artifact protection, model registry security, and ML metadata integrity.

Data Security:

Secure ETL/ML data flows across S3, Redshift, RDS, Glue, Kinesis.
Enforce data versioning security, lineage tracking, PII protection, and access governance.

ML Observability:

Implement drift detection (data drift/model drift), feature monitoring, audit logging.
Integrate ML monitoring with Grafana/Prometheus/CloudWatch.

4. Network & Endpoint Security-

Manage firewall policies, VPN, IDS/IPS, endpoint protection, secure LAN/WAN, Zero Trust principles.
Conduct vulnerability assessments, penetration test coordination, and network segmentation.
Secure remote workforce connectivity and internal office networks.

5. Threat Detection, Incident Response & Compliance-

Centralize log management (CloudWatch, OpenSearch/ELK, SIEM).
Build security alerts, automated threat detection, and incident workflows.
Lead incident containment, forensics, RCA, and remediation.
Ensure compliance with ISO 27001, SOC 2, GDPR, HIPAA (as applicable).
Maintain security policies, procedures, RRPs (Runbooks), and audits.

IDEAL CANDIDATE:

8+ years in DevSecOps, Cloud Security, Platform Security, or equivalent.
Proven ability securing AWS cloud ecosystems (IAM, EKS, EMR, MWAA, VPC, WAF, GuardDuty, KMS, Inspector, Macie).
Strong hands-on experience with Docker, Kubernetes (EKS), CI/CD tools, and Infrastructure-as-Code.
Experience securing ML platforms, data pipelines, and MLOps systems (Airflow/MWAA, Spark/EMR).
Strong Linux security (CIS hardening, auditing, intrusion detection).
Proficiency in Python, Bash, and automation/scripting.
Excellent knowledge of SIEM, observability, threat detection, monitoring systems.
Understanding of microservices, API security, serverless security.
Strong understanding of vulnerability management, penetration testing practices, and remediation plans.

EDUCATION:

Master’s degree in Cybersecurity, Computer Science, Information Technology, or related field.
Relevant certifications (AWS Security Specialty, CISSP, CEH, CKA/CKS) are a plus.

PERKS, BENEFITS AND WORK CULTURE:

Competitive Salary Package
Generous Leave Policy
Flexible Working Hours
Performance-Based Bonuses
Health Care Benefits

ROLE & RESPONSIBILITIES:

KEY RESPONSIBILITIES:

1. Cloud Security (AWS)-

Secure all AWS resources consumed by DevOps/MLOps/Data Science: EC2, EKS, ECS, EMR, MWAA, S3, RDS, Redshift, Lambda, CloudFront, Glue, Athena, Kinesis, Transit Gateway, VPC Peering.
Implement IAM least privilege, SCPs, KMS, Secrets Manager, SSO & identity governance.
Configure AWS-native security: WAF, Shield, GuardDuty, Inspector, Macie, CloudTrail, Config, Security Hub.
Harden VPC architecture, subnets, routing, SG/NACLs, multi-account environments.
Ensure encryption of data at rest/in transit across all cloud services.

2. DevOps Security (IaC, CI/CD, Kubernetes, Linux)-

Infrastructure as Code & Automation Security:

Secure Terraform, CloudFormation, Ansible with policy-as-code (OPA, Checkov, tfsec).
Enforce misconfiguration scanning and automated remediation.

CI/CD Security:

Secure Jenkins, GitHub, GitLab pipelines with SAST, DAST, SCA, secrets scanning, image scanning.
Implement secure build, artifact signing, and deployment workflows.

Containers & Kubernetes:

Harden Docker images, private registries, runtime policies.
Enforce EKS security: RBAC, IRSA, PSP/PSS, network policies, runtime monitoring.
Apply CIS Benchmarks for Kubernetes and Linux.

Monitoring & Reliability:

Secure observability stack: Grafana, CloudWatch, logging, alerting, anomaly detection.
Ensure audit logging across cloud/platform layers.

3. MLOps Security (Airflow, EMR, Spark, Data Platforms, ML Pipelines)-

Pipeline & Workflow Security:

Secure Airflow/MWAA connections, secrets, DAGs, execution environments.
Harden EMR, Spark jobs, Glue jobs, IAM roles, S3 buckets, encryption, and access policies.

ML Platform Security:

Secure Jupyter/JupyterHub environments, containerized ML workspaces, and experiment tracking systems.
Control model access, artifact protection, model registry security, and ML metadata integrity.

Data Security:

Secure ETL/ML data flows across S3, Redshift, RDS, Glue, Kinesis.
Enforce data versioning security, lineage tracking, PII protection, and access governance.

ML Observability:

Implement drift detection (data drift/model drift), feature monitoring, audit logging.
Integrate ML monitoring with Grafana/Prometheus/CloudWatch.

4. Network & Endpoint Security-

Manage firewall policies, VPN, IDS/IPS, endpoint protection, secure LAN/WAN, Zero Trust principles.
Conduct vulnerability assessments, penetration test coordination, and network segmentation.
Secure remote workforce connectivity and internal office networks.

5. Threat Detection, Incident Response & Compliance-

Centralize log management (CloudWatch, OpenSearch/ELK, SIEM).
Build security alerts, automated threat detection, and incident workflows.
Lead incident containment, forensics, RCA, and remediation.
Ensure compliance with ISO 27001, SOC 2, GDPR, HIPAA (as applicable).
Maintain security policies, procedures, RRPs (Runbooks), and audits.

IDEAL CANDIDATE:

8+ years in DevSecOps, Cloud Security, Platform Security, or equivalent.
Proven ability securing AWS cloud ecosystems (IAM, EKS, EMR, MWAA, VPC, WAF, GuardDuty, KMS, Inspector, Macie).
Strong hands-on experience with Docker, Kubernetes (EKS), CI/CD tools, and Infrastructure-as-Code.
Experience securing ML platforms, data pipelines, and MLOps systems (Airflow/MWAA, Spark/EMR).
Strong Linux security (CIS hardening, auditing, intrusion detection).
Proficiency in Python, Bash, and automation/scripting.
Excellent knowledge of SIEM, observability, threat detection, monitoring systems.
Understanding of microservices, API security, serverless security.
Strong understanding of vulnerability management, penetration testing practices, and remediation plans.

EDUCATION:

Master’s degree in Cybersecurity, Computer Science, Information Technology, or related field.
Relevant certifications (AWS Security Specialty, CISSP, CEH, CKA/CKS) are a plus.

PERKS, BENEFITS AND WORK CULTURE:

Competitive Salary Package
Generous Leave Policy
Flexible Working Hours
Performance-Based Bonuses
Health Care Benefits

MLOps Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

8 - 12 yrs

₹60L - ₹80L / yr

Apache Airflow

Apache Spark

AWS CloudFormation

MLOps

DevOps

+23 more

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

Review Criteria:

Strong MLOps profile
8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments
4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production
4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation
Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch
Must have hands-on Python for pipeline & automation development
4+ years of experience in AWS cloud, with recent companies
(Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Preferred:

Hands-on in Docker deployments for ML workflows on EKS / ECS
Experience with ML observability (data drift / model drift / performance monitoring / alerting) using CloudWatch / Grafana / Prometheus / OpenSearch.
Experience with CI / CD / CT using GitHub Actions / Jenkins.
Experience with JupyterHub/Notebooks, Linux, scripting, and metadata tracking for ML lifecycle.
Understanding of ML frameworks (TensorFlow / PyTorch) for deployment scenarios.

Job Specific Criteria:

CV Attachment is mandatory
Please provide CTC Breakup (Fixed + Variable)?
Are you okay for F2F round?
Have candidate filled the google form?

Role & Responsibilities:

Key Responsibilities:

Design and manage cloud-native ML platforms supporting training, inference, and model lifecycle automation.
Build ML/ETL pipelines using Apache Airflow / AWS MWAA and distributed data workflows using Apache Spark (EMR/Glue).
Containerize and deploy ML workloads using Docker, EKS, ECS/Fargate, and Lambda.
Develop CI/CT/CD pipelines integrating model validation, automated training, testing, and deployment.
Implement ML observability: model drift, data drift, performance monitoring, and alerting using CloudWatch, Grafana, Prometheus.
Ensure data governance, versioning, metadata tracking, reproducibility, and secure data pipelines.
Collaborate with data scientists to productionize notebooks, experiments, and model deployments.

Ideal Candidate:

8+ years in MLOps/DevOps with strong ML pipeline experience.
Strong hands-on experience with AWS:
Compute/Orchestration: EKS, ECS, EC2, Lambda
Data: EMR, Glue, S3, Redshift, RDS, Athena, Kinesis
Workflow: MWAA/Airflow, Step Functions
Monitoring: CloudWatch, OpenSearch, Grafana
Strong Python skills and familiarity with ML frameworks (TensorFlow/PyTorch/Scikit-learn).
Expertise with Docker, Kubernetes, Git, CI/CD tools (GitHub Actions/Jenkins).
Strong Linux, scripting, and troubleshooting skills.
Experience enabling reproducible ML environments using Jupyter Hub and containerized development workflows.

Education:

Master’s degree in computer science, Machine Learning, Data Engineering, or related field.

MLOps Engineer top b2c product company only

at Talent Pro

Posted by Mayank choudhary

Noida

8 - 12 yrs

₹60L - ₹85L / yr

MLOps

Apache Spark

Apache Airflow

Tier 1 college only

Mandatory (Experience 1) - Must have 8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments

Mandatory (Experience 2) - Must have 4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production

Mandatory (Experience 3) - Must have 4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation

Mandatory (Experience 4) - Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch

Mandatory (Experience 5) - Must have hands-on Python for pipeline & automation development

Mandatory (Experience 6) - Must have 4+ years of experience in AWS cloud, with recent companies

Mandatory (Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Mandatory (Experience 1) - Must have 8+ years of DevOps experience and 4+ years in MLOps / ML pipeline automation and production deployments

Mandatory (Experience 2) - Must have 4+ years hands-on experience in Apache Airflow / MWAA managing workflow orchestration in production

Mandatory (Experience 3) - Must have 4+ years hands-on experience in Apache Spark (EMR / Glue / managed or self-hosted) for distributed computation

Mandatory (Experience 4) - Must have strong hands-on experience across key AWS services including EKS/ECS/Fargate, Lambda, Kinesis, Athena/Redshift, S3, and CloudWatch

Mandatory (Experience 5) - Must have hands-on Python for pipeline & automation development

Mandatory (Experience 6) - Must have 4+ years of experience in AWS cloud, with recent companies

Mandatory (Company) - Product companies preferred; Exception for service company candidates with strong MLOps + AWS depth

Sr. Devops Engineer

AdTech Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

8 - 12 yrs

₹30L - ₹40L / yr

DevOps

Docker

CI/CD

Amazon Web Services (AWS)

AWS CloudFormation

+43 more

REVIEW CRITERIA:

MANDATORY:

Strong Senior/Lead DevOps Engineer Profile
Must have 8+ years of hands-on experience in DevOps engineering, with a strong focus on AWS cloud infrastructure and services (EC2, VPC, EKS, RDS, Lambda, CloudFront, etc.).
Must have strong system administration expertise (installation, tuning, troubleshooting, security hardening)
Must have solid experience in CI/CD pipeline setup and automation using tools such as Jenkins, GitHub Actions, or similar
Must have hands-on experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible
Must have strong database expertise across MongoDB and Snowflake (administration, performance optimization, integrations)
Must have experience with monitoring and observability tools such as Prometheus, Grafana, ELK, CloudWatch, or Datadog
Must have good exposure to containerization and orchestration using Docker and Kubernetes (EKS)
Must be currently working in an AWS-based environment (AWS experience must be in the current organization)
Its an IC role

PREFERRED:

Must be proficient in scripting languages (Bash, Python) for automation and operational tasks.
Must have strong understanding of security best practices, IAM, WAF, and GuardDuty configurations.
Exposure to DevSecOps and end-to-end automation of deployments, provisioning, and monitoring.
Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field.
Candidates from NCR region only (No outstation candidates).

ROLES AND RESPONSIBILITIES:

We are seeking a highly skilled Senior DevOps Engineer with 8+ years of hands-on experience in designing, automating, and optimizing cloud-native solutions on AWS. AWS and Linux expertise are mandatory. The ideal candidate will have strong experience across databases, automation, CI/CD, containers, and observability, with the ability to build and scale secure, reliable cloud environments.

KEY RESPONSIBILITIES:

Cloud & Infrastructure as Code (IaC)-

Architect and manage AWS environments ensuring scalability, security, and high availability.
Implement infrastructure automation using Terraform, CloudFormation, and Ansible.
Configure VPC Peering, Transit Gateway, and PrivateLink/Connect for advanced networking.

CI/CD & Automation:

Build and maintain CI/CD pipelines (Jenkins, GitHub, SonarQube, automated testing).
Automate deployments, provisioning, and monitoring across environments.

Containers & Orchestration:

Deploy and operate workloads on Docker and Kubernetes (EKS).
Implement IAM Roles for Service Accounts (IRSA) for secure pod-level access.
Optimize performance of containerized and microservices applications.

Monitoring & Reliability:

Implement observability with Prometheus, Grafana, ELK, CloudWatch, M/Monit, and Datadog.
Establish logging, alerting, and proactive monitoring for high availability.

Security & Compliance:

Apply AWS security best practices including IAM, IRSA, SSO, and role-based access control.
Manage WAF, Guard Duty, Inspector, and other AWS-native security tools.
Configure VPNs, firewalls, and secure access policies and AWS organizations.

Databases & Analytics:

Must have expertise in MongoDB, Snowflake, Aerospike, RDS, PostgreSQL, MySQL/MariaDB, and other RDBMS.
Manage data reliability, performance tuning, and cloud-native integrations.
Experience with Apache Airflow and Spark.

IDEAL CANDIDATE:

8+ years in DevOps engineering, with strong AWS Cloud expertise (EC2, VPC, TG, RDS, S3, IAM, EKS, EMR, SCP, MWAA, Lambda, CloudFront, SNS, SES etc.).
Linux expertise is mandatory (system administration, tuning, troubleshooting, CIS hardening etc).
Strong knowledge of databases: MongoDB, Snowflake, Aerospike, RDS, PostgreSQL, MySQL/MariaDB, and other RDBMS.
Hands-on with Docker, Kubernetes (EKS), Terraform, CloudFormation, Ansible.
Proven ability with CI/CD pipeline automation and DevSecOps practices.
Practical experience with VPC Peering, Transit Gateway, WAF, Guard Duty, Inspector and advanced AWS networking and security tools.
Expertise in observability tools: Prometheus, Grafana, ELK, CloudWatch, M/Monit, and Datadog.
Strong scripting skills (Shell/bash, Python, or similar) for automation.
Bachelor / Master’s degree
Effective communication skills

PERKS, BENEFITS AND WORK CULTURE:

Competitive Salary Package
Generous Leave Policy
Flexible Working Hours
Performance-Based Bonuses
Health Care Benefits

REVIEW CRITERIA:

MANDATORY:

Strong Senior/Lead DevOps Engineer Profile
Must have 8+ years of hands-on experience in DevOps engineering, with a strong focus on AWS cloud infrastructure and services (EC2, VPC, EKS, RDS, Lambda, CloudFront, etc.).
Must have strong system administration expertise (installation, tuning, troubleshooting, security hardening)
Must have solid experience in CI/CD pipeline setup and automation using tools such as Jenkins, GitHub Actions, or similar
Must have hands-on experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible
Must have strong database expertise across MongoDB and Snowflake (administration, performance optimization, integrations)
Must have experience with monitoring and observability tools such as Prometheus, Grafana, ELK, CloudWatch, or Datadog
Must have good exposure to containerization and orchestration using Docker and Kubernetes (EKS)
Must be currently working in an AWS-based environment (AWS experience must be in the current organization)
Its an IC role

PREFERRED:

Must be proficient in scripting languages (Bash, Python) for automation and operational tasks.
Must have strong understanding of security best practices, IAM, WAF, and GuardDuty configurations.
Exposure to DevSecOps and end-to-end automation of deployments, provisioning, and monitoring.
Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field.
Candidates from NCR region only (No outstation candidates).

ROLES AND RESPONSIBILITIES:

KEY RESPONSIBILITIES:

Cloud & Infrastructure as Code (IaC)-

Architect and manage AWS environments ensuring scalability, security, and high availability.
Implement infrastructure automation using Terraform, CloudFormation, and Ansible.
Configure VPC Peering, Transit Gateway, and PrivateLink/Connect for advanced networking.

CI/CD & Automation:

Build and maintain CI/CD pipelines (Jenkins, GitHub, SonarQube, automated testing).
Automate deployments, provisioning, and monitoring across environments.

Containers & Orchestration:

Deploy and operate workloads on Docker and Kubernetes (EKS).
Implement IAM Roles for Service Accounts (IRSA) for secure pod-level access.
Optimize performance of containerized and microservices applications.

Monitoring & Reliability:

Implement observability with Prometheus, Grafana, ELK, CloudWatch, M/Monit, and Datadog.
Establish logging, alerting, and proactive monitoring for high availability.

Security & Compliance:

Apply AWS security best practices including IAM, IRSA, SSO, and role-based access control.
Manage WAF, Guard Duty, Inspector, and other AWS-native security tools.
Configure VPNs, firewalls, and secure access policies and AWS organizations.

Databases & Analytics:

Must have expertise in MongoDB, Snowflake, Aerospike, RDS, PostgreSQL, MySQL/MariaDB, and other RDBMS.
Manage data reliability, performance tuning, and cloud-native integrations.
Experience with Apache Airflow and Spark.

IDEAL CANDIDATE:

8+ years in DevOps engineering, with strong AWS Cloud expertise (EC2, VPC, TG, RDS, S3, IAM, EKS, EMR, SCP, MWAA, Lambda, CloudFront, SNS, SES etc.).
Linux expertise is mandatory (system administration, tuning, troubleshooting, CIS hardening etc).
Strong knowledge of databases: MongoDB, Snowflake, Aerospike, RDS, PostgreSQL, MySQL/MariaDB, and other RDBMS.
Hands-on with Docker, Kubernetes (EKS), Terraform, CloudFormation, Ansible.
Proven ability with CI/CD pipeline automation and DevSecOps practices.
Practical experience with VPC Peering, Transit Gateway, WAF, Guard Duty, Inspector and advanced AWS networking and security tools.
Expertise in observability tools: Prometheus, Grafana, ELK, CloudWatch, M/Monit, and Datadog.
Strong scripting skills (Shell/bash, Python, or similar) for automation.
Bachelor / Master’s degree
Effective communication skills

PERKS, BENEFITS AND WORK CULTURE:

Competitive Salary Package
Generous Leave Policy
Flexible Working Hours
Performance-Based Bonuses
Health Care Benefits

VP - Data Architect (B2B SaaS)

Technology Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Delhi

10 - 15 yrs

₹105L - ₹140L / yr

Data engineering

Apache Spark

Apache

Apache Kafka

Java

+25 more

MANDATORY:

Super Quality Data Architect, Data Engineering Manager / Director Profile
Must have 12+ YOE in Data Engineering roles, with at least 2+ years in a Leadership role
Must have 7+ YOE in hands-on Tech development with Java (Highly preferred) or Python, Node.JS, GoLang
Must have strong experience in large data technologies, tools like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow, Presto etc.
Strong expertise in HLD and LLD, to design scalable, maintainable data architectures.
Must have managed a team of at least 5+ Data Engineers (Read Leadership role in CV)
Product Companies (Prefers high-scale, data-heavy companies)

PREFERRED:

Must be from Tier - 1 Colleges, preferred IIT
Candidates must have spent a minimum 3 yrs in each company.
Must have recent 4+ YOE with high-growth Product startups, and should have implemented Data Engineering systems from an early stage in the Company

ROLES & RESPONSIBILITIES:

Lead and mentor a team of data engineers, ensuring high performance and career growth.
Architect and optimize scalable data infrastructure, ensuring high availability and reliability.
Drive the development and implementation of data governance frameworks and best practices.
Work closely with cross-functional teams to define and execute a data roadmap.
Optimize data processing workflows for performance and cost efficiency.
Ensure data security, compliance, and quality across all data platforms.
Foster a culture of innovation and technical excellence within the data team.

IDEAL CANDIDATE:

10+ years of experience in software/data engineering, with at least 3+ years in a leadership role.
Expertise in backend development with programming languages such as Java, PHP, Python, Node.JS, GoLang, JavaScript, HTML, and CSS.
Proficiency in SQL, Python, and Scala for data processing and analytics.
Strong understanding of cloud platforms (AWS, GCP, or Azure) and their data services.
Strong foundation and expertise in HLD and LLD, as well as design patterns, preferably using Spring Boot or Google Guice
Experience in big data technologies such as Spark, Hadoop, Kafka, and distributed computing frameworks.
Hands-on experience with data warehousing solutions such as Snowflake, Redshift, or BigQuery
Deep knowledge of data governance, security, and compliance (GDPR, SOC2, etc.).
Experience in NoSQL databases like Redis, Cassandra, MongoDB, and TiDB.
Familiarity with automation and DevOps tools like Jenkins, Ansible, Docker, Kubernetes, Chef, Grafana, and ELK.
Proven ability to drive technical strategy and align it with business objectives.
Strong leadership, communication, and stakeholder management skills.

PREFERRED QUALIFICATIONS:

Experience in machine learning infrastructure or MLOps is a plus.
Exposure to real-time data processing and analytics.
Interest in data structures, algorithm analysis and design, multicore programming, and scalable architecture.
Prior experience in a SaaS or high-growth tech company.

MANDATORY:

Super Quality Data Architect, Data Engineering Manager / Director Profile
Must have 12+ YOE in Data Engineering roles, with at least 2+ years in a Leadership role
Must have 7+ YOE in hands-on Tech development with Java (Highly preferred) or Python, Node.JS, GoLang
Must have strong experience in large data technologies, tools like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow, Presto etc.
Strong expertise in HLD and LLD, to design scalable, maintainable data architectures.
Must have managed a team of at least 5+ Data Engineers (Read Leadership role in CV)
Product Companies (Prefers high-scale, data-heavy companies)

PREFERRED:

Must be from Tier - 1 Colleges, preferred IIT
Candidates must have spent a minimum 3 yrs in each company.
Must have recent 4+ YOE with high-growth Product startups, and should have implemented Data Engineering systems from an early stage in the Company

ROLES & RESPONSIBILITIES:

Lead and mentor a team of data engineers, ensuring high performance and career growth.
Architect and optimize scalable data infrastructure, ensuring high availability and reliability.
Drive the development and implementation of data governance frameworks and best practices.
Work closely with cross-functional teams to define and execute a data roadmap.
Optimize data processing workflows for performance and cost efficiency.
Ensure data security, compliance, and quality across all data platforms.
Foster a culture of innovation and technical excellence within the data team.

IDEAL CANDIDATE:

10+ years of experience in software/data engineering, with at least 3+ years in a leadership role.
Expertise in backend development with programming languages such as Java, PHP, Python, Node.JS, GoLang, JavaScript, HTML, and CSS.
Proficiency in SQL, Python, and Scala for data processing and analytics.
Strong understanding of cloud platforms (AWS, GCP, or Azure) and their data services.
Strong foundation and expertise in HLD and LLD, as well as design patterns, preferably using Spring Boot or Google Guice
Experience in big data technologies such as Spark, Hadoop, Kafka, and distributed computing frameworks.
Hands-on experience with data warehousing solutions such as Snowflake, Redshift, or BigQuery
Deep knowledge of data governance, security, and compliance (GDPR, SOC2, etc.).
Experience in NoSQL databases like Redis, Cassandra, MongoDB, and TiDB.
Familiarity with automation and DevOps tools like Jenkins, Ansible, Docker, Kubernetes, Chef, Grafana, and ELK.
Proven ability to drive technical strategy and align it with business objectives.
Strong leadership, communication, and stakeholder management skills.

PREFERRED QUALIFICATIONS:

Experience in machine learning infrastructure or MLOps is a plus.
Exposure to real-time data processing and analytics.
Interest in data structures, algorithm analysis and design, multicore programming, and scalable architecture.
Prior experience in a SaaS or high-growth tech company.

Senior Data Engineer (L2)

at Publicis Sapient

10 recruiters

Posted by Mohit Singh

Bengaluru (Bangalore), Pune, Hyderabad, Gurugram, Noida

5 - 11 yrs

₹20L - ₹36L / yr

PySpark

Data engineering

Big Data

Hadoop

Spark

+7 more

Publicis Sapient Overview:

The Senior Associate People Senior Associate L1 in Data Engineering, you will translate client requirements into technical design, and implement components for data engineering solution. Utilize deep understanding of data integration and big data design principles in creating custom solutions or implementing package solutions. You will independently drive design discussions to insure the necessary health of the overall solution

Job Summary:

As Senior Associate L2 in Data Engineering, you will translate client requirements into technical design, and implement components for data engineering solution. Utilize deep understanding of data integration and big data design principles in creating custom solutions or implementing package solutions. You will independently drive design discussions to insure the necessary health of the overall solution

The role requires a hands-on technologist who has strong programming background like Java / Scala / Python, should have experience in Data Ingestion, Integration and data Wrangling, Computation, Analytics pipelines and exposure to Hadoop ecosystem components. You are also required to have hands-on knowledge on at least one of AWS, GCP, Azure cloud platforms.

Role & Responsibilities:

Your role is focused on Design, Development and delivery of solutions involving:

• Data Integration, Processing & Governance

• Data Storage and Computation Frameworks, Performance Optimizations

• Analytics & Visualizations

• Infrastructure & Cloud Computing

• Data Management Platforms

• Implement scalable architectural models for data processing and storage

• Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time mode

• Build functionality for data analytics, search and aggregation

Experience Guidelines:

Mandatory Experience and Competencies:

# Competency

1.Overall 5+ years of IT experience with 3+ years in Data related technologies

2.Minimum 2.5 years of experience in Big Data technologies and working exposure in at least one cloud platform on related data services (AWS / Azure / GCP)

3.Hands-on experience with the Hadoop stack – HDFS, sqoop, kafka, Pulsar, NiFi, Spark, Spark Streaming, Flink, Storm, hive, oozie, airflow and other components required in building end to end data pipeline.

4.Strong experience in at least of the programming language Java, Scala, Python. Java preferable

5.Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc

6.Well-versed and working knowledge with data platform related services on at least 1 cloud platform, IAM and data security

Preferred Experience and Knowledge (Good to Have):

# Competency

1.Good knowledge of traditional ETL tools (Informatica, Talend, etc) and database technologies (Oracle, MySQL, SQL Server, Postgres) with hands on experience

2.Knowledge on data governance processes (security, lineage, catalog) and tools like Collibra, Alation etc

3.Knowledge on distributed messaging frameworks like ActiveMQ / RabbiMQ / Solace, search & indexing and Micro services architectures

4.Performance tuning and optimization of data pipelines

5.CI/CD – Infra provisioning on cloud, auto build & deployment pipelines, code quality

6.Cloud data specialty and other related Big data technology certifications

Personal Attributes:

• Strong written and verbal communication skills

• Articulation skills

• Good team player

• Self-starter who requires minimal oversight

• Ability to prioritize and manage multiple tasks

• Process orientation and the ability to define and set up processes

Publicis Sapient Overview:

Job Summary:

Role & Responsibilities:

Your role is focused on Design, Development and delivery of solutions involving:

• Data Integration, Processing & Governance

• Data Storage and Computation Frameworks, Performance Optimizations

• Analytics & Visualizations

• Infrastructure & Cloud Computing

• Data Management Platforms

• Implement scalable architectural models for data processing and storage

• Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time mode

• Build functionality for data analytics, search and aggregation

Experience Guidelines:

Mandatory Experience and Competencies:

# Competency

1.Overall 5+ years of IT experience with 3+ years in Data related technologies

2.Minimum 2.5 years of experience in Big Data technologies and working exposure in at least one cloud platform on related data services (AWS / Azure / GCP)

4.Strong experience in at least of the programming language Java, Scala, Python. Java preferable

5.Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc

6.Well-versed and working knowledge with data platform related services on at least 1 cloud platform, IAM and data security

Preferred Experience and Knowledge (Good to Have):

# Competency

1.Good knowledge of traditional ETL tools (Informatica, Talend, etc) and database technologies (Oracle, MySQL, SQL Server, Postgres) with hands on experience

2.Knowledge on data governance processes (security, lineage, catalog) and tools like Collibra, Alation etc

3.Knowledge on distributed messaging frameworks like ActiveMQ / RabbiMQ / Solace, search & indexing and Micro services architectures

4.Performance tuning and optimization of data pipelines

5.CI/CD – Infra provisioning on cloud, auto build & deployment pipelines, code quality

6.Cloud data specialty and other related Big data technology certifications

Personal Attributes:

• Strong written and verbal communication skills

• Articulation skills

• Good team player

• Self-starter who requires minimal oversight

• Ability to prioritize and manage multiple tasks

• Process orientation and the ability to define and set up processes

Senior Data Engineering Role - Google Cloud Platform with Spark

A LEADING US BASED MNC

Agency job

via Zeal Consultants by Zeal Consultants

Bengaluru (Bangalore), Hyderabad, Delhi, Gurugram

5 - 10 yrs

₹14L - ₹15L / yr

Google Cloud Platform (GCP)

Spark

PySpark

Apache Spark

"DATA STREAMING"

Data Engineering : Senior Engineer / Manager

As Senior Engineer/ Manager in Data Engineering, you will translate client requirements into technical design, and implement components for a data engineering solutions. Utilize a deep understanding of data integration and big data design principles in creating custom solutions or implementing package solutions. You will independently drive design discussions to insure the necessary health of the overall solution.

Must Have skills :

1. GCP

2. Spark streaming : Live data streaming experience is desired.

3. Any 1 coding language: Java/Pyhton /Scala

Skills & Experience :

- Overall experience of MINIMUM 5+ years with Minimum 4 years of relevant experience in Big Data technologies

- Hands-on experience with the Hadoop stack - HDFS, sqoop, kafka, Pulsar, NiFi, Spark, Spark Streaming, Flink, Storm, hive, oozie, airflow and other components required in building end to end data pipeline. Working knowledge on real-time data pipelines is added advantage.

- Strong experience in at least of the programming language Java, Scala, Python. Java preferable

- Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc.

- Well-versed and working knowledge with data platform related services on GCP

- Bachelor's degree and year of work experience of 6 to 12 years or any combination of education, training and/or experience that demonstrates the ability to perform the duties of the position

Your Impact :

- Data Ingestion, Integration and Transformation

- Data Storage and Computation Frameworks, Performance Optimizations

- Analytics & Visualizations

- Infrastructure & Cloud Computing

- Data Management Platforms

- Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time

- Build functionality for data analytics, search and aggregation

Data Engineering : Senior Engineer / Manager

Must Have skills :

1. GCP

2. Spark streaming : Live data streaming experience is desired.

3. Any 1 coding language: Java/Pyhton /Scala

Skills & Experience :

- Overall experience of MINIMUM 5+ years with Minimum 4 years of relevant experience in Big Data technologies

- Strong experience in at least of the programming language Java, Scala, Python. Java preferable

- Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc.

- Well-versed and working knowledge with data platform related services on GCP

- Bachelor's degree and year of work experience of 6 to 12 years or any combination of education, training and/or experience that demonstrates the ability to perform the duties of the position

Your Impact :

- Data Ingestion, Integration and Transformation

- Data Storage and Computation Frameworks, Performance Optimizations

- Analytics & Visualizations

- Infrastructure & Cloud Computing

- Data Management Platforms

- Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time

- Build functionality for data analytics, search and aggregation

Data Engineer

at Career Forge

2 candid answers

Posted by Mohammad Faiz

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

5 - 7 yrs

₹12L - ₹15L / yr

Python

Apache Spark

PySpark

Data engineering

ETL

+10 more

🚀 Exciting Opportunity: Data Engineer Position in Gurugram 🌐

Hello

We are actively seeking a talented and experienced Data Engineer to join our dynamic team at Reality Motivational Venture in Gurugram (Gurgaon). If you're passionate about data, thrive in a collaborative environment, and possess the skills we're looking for, we want to hear from you!

Position: Data Engineer

Location: Gurugram (Gurgaon)

Experience: 5+ years

Key Skills:

- Python

- Spark, Pyspark

- Data Governance

- Cloud (AWS/Azure/GCP)

Main Responsibilities:

- Define and set up analytics environments for "Big Data" applications in collaboration with domain experts.

- Implement ETL processes for telemetry-based and stationary test data.

- Support in defining data governance, including data lifecycle management.

- Develop large-scale data processing engines and real-time search and analytics based on time series data.

- Ensure technical, methodological, and quality aspects.

- Support CI/CD processes.

- Foster know-how development and transfer, continuous improvement of leading technologies within Data Engineering.

- Collaborate with solution architects on the development of complex on-premise, hybrid, and cloud solution architectures.

Qualification Requirements:

- BSc, MSc, MEng, or PhD in Computer Science, Informatics/Telematics, Mathematics/Statistics, or a comparable engineering degree.

- Proficiency in Python and the PyData stack (Pandas/Numpy).

- Experience in high-level programming languages (C#/C++/Java).

- Familiarity with scalable processing environments like Dask (or Spark).

- Proficient in Linux and scripting languages (Bash Scripts).

- Experience in containerization and orchestration of containerized services (Kubernetes).

- Education in database technologies (SQL/OLAP and Non-SQL).

- Interest in Big Data storage technologies (Elastic, ClickHouse).

- Familiarity with Cloud technologies (Azure, AWS, GCP).

- Fluent English communication skills (speaking and writing).

- Ability to work constructively with a global team.

- Willingness to travel for business trips during development projects.

Preferable:

- Working knowledge of vehicle architectures, communication, and components.

- Experience in additional programming languages (C#/C++/Java, R, Scala, MATLAB).

- Experience in time-series processing.

How to Apply:

Interested candidates, please share your updated CV/resume with me.

Thank you for considering this exciting opportunity.