PySpark Jobs in Bangalore (Bengaluru)

50+ PySpark Jobs in Bangalore (Bengaluru) | PySpark Job openings in Bangalore (Bengaluru)

Apply to 50+ PySpark Jobs in Bangalore (Bengaluru) on CutShort.io. Explore the latest PySpark Job opportunities across top companies like Google, Amazon & Adobe.

Databricks Admin

One of the reputed Client in India

Agency job

via Evalutech Prospect Services Private Limited by HR Evalutech

Bengaluru (Bangalore), Mumbai, Delhi, Gurugram, Noida, Hyderabad, Pune

6 - 8 yrs

₹12L - ₹13L / yr

Amazon Web Services (AWS)

Python

PySpark

Our Client is looking to hire Databricks Amin immediatly.

This is PAN-INDIA Bulk hiring

Minimum of 6-8+ years with Databricks, Pyspark/Python and AWS.

Must have AWS

Notice 15-30 days is preferred.

Share profiles at hr at etpspl dot com

Please refer/share our email to your friends/colleagues who are looking for job.

Our Client is looking to hire Databricks Amin immediatly.

This is PAN-INDIA Bulk hiring

Minimum of 6-8+ years with Databricks, Pyspark/Python and AWS.

Must have AWS

Notice 15-30 days is preferred.

Share profiles at hr at etpspl dot com

Please refer/share our email to your friends/colleagues who are looking for job.

Python Developer

at Wissen Technology

4 recruiters

Posted by Nishita Bangera

Bengaluru (Bangalore)

4 - 8 yrs

Best in industry

Python

SQL

PySpark

Django

Key Responsibilities

Develop and maintain Python-based applications.
Design and optimize SQL queries and databases.
Collaborate with cross-functional teams to define, design, and ship new features.
Write clean, maintainable, and efficient code.
Troubleshoot and debug applications.
Participate in code reviews and contribute to team knowledge sharing.

Qualifications and Required Skills

Strong proficiency in Python programming.
Experience with SQL and database management.
Experience with web frameworks such as Django or Flask.
Knowledge of front-end technologies like HTML, CSS, and JavaScript.
Familiarity with version control systems like Git.
Strong problem-solving skills and attention to detail.
Excellent communication and teamwork skills.

Good to Have Skills

Experience with cloud platforms like AWS or Azure.
Knowledge of containerization technologies like Docker.
Familiarity with continuous integration and continuous deployment (CI/CD) pipelines

Key Responsibilities

Develop and maintain Python-based applications.
Design and optimize SQL queries and databases.
Collaborate with cross-functional teams to define, design, and ship new features.
Write clean, maintainable, and efficient code.
Troubleshoot and debug applications.
Participate in code reviews and contribute to team knowledge sharing.

Qualifications and Required Skills

Strong proficiency in Python programming.
Experience with SQL and database management.
Experience with web frameworks such as Django or Flask.
Knowledge of front-end technologies like HTML, CSS, and JavaScript.
Familiarity with version control systems like Git.
Strong problem-solving skills and attention to detail.
Excellent communication and teamwork skills.

Good to Have Skills

Experience with cloud platforms like AWS or Azure.
Knowledge of containerization technologies like Docker.
Familiarity with continuous integration and continuous deployment (CI/CD) pipelines

Data Engineer

at Wissen Technology

4 recruiters

Posted by Gagandeep Kaur

Bengaluru (Bangalore), Mumbai, Pune

4 - 7 yrs

Best in industry

Python

PySpark

pandas

Airflow

Data engineering

Wissen Technology is hiring for Data Engineer

About Wissen Technology: At Wissen Technology, we deliver niche, custom-built products that solve complex business challenges across industries worldwide. Founded in 2015, our core philosophy is built around a strong product engineering mindset—ensuring every solution is architected and delivered right the first time. Today, Wissen Technology has a global footprint with 2000+ employees across offices in the US, UK, UAE, India, and Australia. Our commitment to excellence translates into delivering 2X impact compared to traditional service providers. How do we achieve this? Through a combination of deep domain knowledge, cutting-edge technology expertise, and a relentless focus on quality. We don’t just meet expectations—we exceed them by ensuring faster time-to-market, reduced rework, and greater alignment with client objectives. We have a proven track record of building mission-critical systems across industries, including financial services, healthcare, retail, manufacturing, and more. Wissen stands apart through its unique delivery models. Our outcome-based projects ensure predictable costs and timelines, while our agile pods provide clients the flexibility to adapt to their evolving business needs. Wissen leverages its thought leadership and technology prowess to drive superior business outcomes. Our success is powered by top-tier talent. Our mission is clear: to be the partner of choice for building world-class custom products that deliver exceptional impact—the first time, every time.

Job Summary: Wissen Technology is hiring a Data Engineer with expertise in Python, Pandas, Airflow, and Azure Cloud Services. The ideal candidate will have strong communication skills and experience with Kubernetes.

Experience: 4-7 years

Notice Period: Immediate- 15 days

Location: Pune, Mumbai, Bangalore

Mode of Work: Hybrid

Key Responsibilities:

Develop and maintain data pipelines using Python and Pandas.
Implement and manage workflows using Airflow.
Utilize Azure Cloud Services for data storage and processing.
Collaborate with cross-functional teams to understand data requirements and deliver solutions.
Ensure data quality and integrity throughout the data lifecycle.
Optimize and scale data infrastructure to meet business needs.

Qualifications and Required Skills:

Proficiency in Python (Must Have).
Strong experience with Pandas (Must Have).
Expertise in Airflow (Must Have).
Experience with Azure Cloud Services.
Good communication skills.

Good to Have Skills:

Experience with Pyspark.
Knowledge of Kubernetes.

Wissen Sites:

Website: http://www.wissen.com
LinkedIn: https://www.linkedin.com/company/wissen-technology
Wissen Leadership: https://www.wissen.com/company/leadership-team/
Wissen Live: https://www.linkedin.com/company/wissen-technology/posts/feedView=All
Wissen Thought Leadership: https://www.wissen.com/articles/

Wissen Technology is hiring for Data Engineer

Experience: 4-7 years

Notice Period: Immediate- 15 days

Location: Pune, Mumbai, Bangalore

Mode of Work: Hybrid

Key Responsibilities:

Develop and maintain data pipelines using Python and Pandas.
Implement and manage workflows using Airflow.
Utilize Azure Cloud Services for data storage and processing.
Collaborate with cross-functional teams to understand data requirements and deliver solutions.
Ensure data quality and integrity throughout the data lifecycle.
Optimize and scale data infrastructure to meet business needs.

Qualifications and Required Skills:

Proficiency in Python (Must Have).
Strong experience with Pandas (Must Have).
Expertise in Airflow (Must Have).
Experience with Azure Cloud Services.
Good communication skills.

Good to Have Skills:

Experience with Pyspark.
Knowledge of Kubernetes.

Wissen Sites:

Website: http://www.wissen.com
LinkedIn: https://www.linkedin.com/company/wissen-technology
Wissen Leadership: https://www.wissen.com/company/leadership-team/
Wissen Live: https://www.linkedin.com/company/wissen-technology/posts/feedView=All
Wissen Thought Leadership: https://www.wissen.com/articles/

Hiring _Azure Data Bricks

at Wissen Technology

4 recruiters

Posted by Bipasha Rath

Mumbai, Bengaluru (Bangalore), Pune

3 - 7 yrs

Best in industry

Python

pandas

PySpark

Experience: 3–7 Years

Locations: Pune / Bangalore / Mumbai

Notice Period :Immediate joiner only

Employment Type: Full-time

🛠️ Key Skills (Mandatory):

Python: Strong coding skills for data manipulation and automation.
PySpark: Experience with distributed data processing using Spark.
SQL: Proficient in writing complex queries for data extraction and transformation.
Azure Databricks: Hands-on experience with notebooks, Delta Lake, and MLflow

Interested candidates please share resume with details below.

Total Experience -

Relevant Experience in Python,Pyspark,AQL,Azure Data bricks-

Current CTC -

Expected CTC -

Notice period -

Current Location -

Desired Location -

Experience: 3–7 Years

Locations: Pune / Bangalore / Mumbai

Notice Period :Immediate joiner only

Employment Type: Full-time

🛠️ Key Skills (Mandatory):

Python: Strong coding skills for data manipulation and automation.
PySpark: Experience with distributed data processing using Spark.
SQL: Proficient in writing complex queries for data extraction and transformation.
Azure Databricks: Hands-on experience with notebooks, Delta Lake, and MLflow

Interested candidates please share resume with details below.

Total Experience -

Relevant Experience in Python,Pyspark,AQL,Azure Data bricks-

Current CTC -

Expected CTC -

Notice period -

Current Location -

Desired Location -

DATA ENGINEER

at Wissen Technology

4 recruiters

Posted by Janane Mohanasankaran

Bengaluru (Bangalore), Pune, Mumbai

7 - 12 yrs

Best in industry

Python

pandas

PySpark

SQL

Data engineering

Wissen Technology is hiring for Data Engineer

About Wissen Technology:At Wissen Technology, we deliver niche, custom-built products that solve complex business challenges across industries worldwide. Founded in 2015, our core philosophy is built around a strong product engineering mindset—ensuring every solution is architected and delivered right the first time. Today, Wissen Technology has a global footprint with 2000+ employees across offices in the US, UK, UAE, India, and Australia. Our commitment to excellence translates into delivering 2X impact compared to traditional service providers. How do we achieve this? Through a combination of deep domain knowledge, cutting-edge technology expertise, and a relentless focus on quality. We don’t just meet expectations—we exceed them by ensuring faster time-to-market, reduced rework, and greater alignment with client objectives. We have a proven track record of building mission-critical systems across industries, including financial services, healthcare, retail, manufacturing, and more. Wissen stands apart through its unique delivery models. Our outcome-based projects ensure predictable costs and timelines, while our agile pods provide clients the flexibility to adapt to their evolving business needs. Wissen leverages its thought leadership and technology prowess to drive superior business outcomes. Our success is powered by top-tier talent. Our mission is clear: to be the partner of choice for building world-class custom products that deliver exceptional impact—the first time, every time.

Job Summary:Wissen Technology is hiring a Data Engineer with a strong background in Python, data engineering, and workflow optimization. The ideal candidate will have experience with Delta Tables, Parquet, and be proficient in Pandas and PySpark.

Experience:7+ years

Location:Pune, Mumbai, Bangalore

Mode of Work:Hybrid

Key Responsibilities:

Develop and maintain data pipelines using Python (Pandas, PySpark).
Optimize data workflows and ensure efficient data processing.
Work with Delta Tables and Parquet for data storage and management.
Collaborate with cross-functional teams to understand data requirements and deliver solutions.
Ensure data quality and integrity throughout the data lifecycle.
Implement best practices for data engineering and workflow optimization.

Qualifications and Required Skills:

Proficiency in Python, specifically with Pandas and PySpark.
Strong experience in data engineering and workflow optimization.
Knowledge of Delta Tables and Parquet.
Excellent problem-solving skills and attention to detail.
Ability to work collaboratively in a team environment.
Strong communication skills.

Good to Have Skills:

Experience with Databricks.
Knowledge of Apache Spark, DBT, and Airflow.
Advanced Pandas optimizations.
Familiarity with PyTest/DBT testing frameworks.

Wissen Sites:

Website: http://www.wissen.com
LinkedIn: https://www.linkedin.com/company/wissen-technology
Wissen Leadership: https://www.wissen.com/company/leadership-team/
Wissen Live: https://www.linkedin.com/company/wissen-technology/posts/feedView=All
Wissen Thought Leadership: https://www.wissen.com/articles/

Wissen | Driving Digital Transformation

A technology consultancy that drives digital innovation by connecting strategy and execution, helping global clients to strengthen their core technology.

Wissen Technology is hiring for Data Engineer

About Wissen Technology:At Wissen Technology, we deliver niche, custom-built products that solve complex business challenges across industries worldwide. Founded in 2015, our core philosophy is built around a strong product engineering mindset—ensuring every solution is architected and delivered right the first time. Today, Wissen Technology has a global footprint with 2000+ employees across offices in the US, UK, UAE, India, and Australia. Our commitment to excellence translates into delivering 2X impact compared to traditional service providers. How do we achieve this? Through a combination of deep domain knowledge, cutting-edge technology expertise, and a relentless focus on quality. We don’t just meet expectations—we exceed them by ensuring faster time-to-market, reduced rework, and greater alignment with client objectives. We have a proven track record of building mission-critical systems across industries, including financial services, healthcare, retail, manufacturing, and more. Wissen stands apart through its unique delivery models. Our outcome-based projects ensure predictable costs and timelines, while our agile pods provide clients the flexibility to adapt to their evolving business needs. Wissen leverages its thought leadership and technology prowess to drive superior business outcomes. Our success is powered by top-tier talent. Our mission is clear: to be the partner of choice for building world-class custom products that deliver exceptional impact—the first time, every time.

Experience:7+ years

Location:Pune, Mumbai, Bangalore

Mode of Work:Hybrid

Key Responsibilities:

Develop and maintain data pipelines using Python (Pandas, PySpark).
Optimize data workflows and ensure efficient data processing.
Work with Delta Tables and Parquet for data storage and management.
Collaborate with cross-functional teams to understand data requirements and deliver solutions.
Ensure data quality and integrity throughout the data lifecycle.
Implement best practices for data engineering and workflow optimization.

Qualifications and Required Skills:

Proficiency in Python, specifically with Pandas and PySpark.
Strong experience in data engineering and workflow optimization.
Knowledge of Delta Tables and Parquet.
Excellent problem-solving skills and attention to detail.
Ability to work collaboratively in a team environment.
Strong communication skills.

Good to Have Skills:

Experience with Databricks.
Knowledge of Apache Spark, DBT, and Airflow.
Advanced Pandas optimizations.
Familiarity with PyTest/DBT testing frameworks.

Wissen Sites:

Website: http://www.wissen.com
LinkedIn: https://www.linkedin.com/company/wissen-technology
Wissen Leadership: https://www.wissen.com/company/leadership-team/
Wissen Live: https://www.linkedin.com/company/wissen-technology/posts/feedView=All
Wissen Thought Leadership: https://www.wissen.com/articles/

Wissen | Driving Digital Transformation

A technology consultancy that drives digital innovation by connecting strategy and execution, helping global clients to strengthen their core technology.

PySpark/Scala Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by Jhansi Padiy

Chennai, Hyderabad, Kolkata, Delhi, Pune, Bengaluru (Bangalore)

4 - 10 yrs

₹6L - ₹30L / yr

Scala

PySpark

Spark

Amazon Web Services (AWS)

Job Title: PySpark/Scala Developer

Functional Skills: Experience in Credit Risk/Regulatory risk domain

Technical Skills: Spark ,PySpark, Python, Hive, Scala, MapReduce, Unix shell scripting

Good to Have Skills: Exposure to Machine Learning Techniques

Job Description:

5+ Years of experience with Developing/Fine tuning and implementing programs/applications

Using Python/PySpark/Scala on Big Data/Hadoop Platform.

Roles and Responsibilities:

a) Work with a Leading Bank’s Risk Management team on specific projects/requirements pertaining to risk Models in

consumer and wholesale banking

b) Enhance Machine Learning Models using PySpark or Scala

c) Work with Data Scientists to Build ML Models based on Business Requirements and Follow ML Cycle to Deploy them all

the way to Production Environment

d) Participate Feature Engineering, Training Models, Scoring and retraining

e) Architect Data Pipeline and Automate Data Ingestion and Model Jobs

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

Job Title: PySpark/Scala Developer

Functional Skills: Experience in Credit Risk/Regulatory risk domain

Technical Skills: Spark ,PySpark, Python, Hive, Scala, MapReduce, Unix shell scripting

Good to Have Skills: Exposure to Machine Learning Techniques

Job Description:

5+ Years of experience with Developing/Fine tuning and implementing programs/applications

Using Python/PySpark/Scala on Big Data/Hadoop Platform.

Roles and Responsibilities:

a) Work with a Leading Bank’s Risk Management team on specific projects/requirements pertaining to risk Models in

consumer and wholesale banking

b) Enhance Machine Learning Models using PySpark or Scala

c) Work with Data Scientists to Build ML Models based on Business Requirements and Follow ML Cycle to Deploy them all

the way to Production Environment

d) Participate Feature Engineering, Training Models, Scoring and retraining

e) Architect Data Pipeline and Automate Data Ingestion and Model Jobs

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

PySpark/Scala Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Bengaluru (Bangalore), Hyderabad, Pune, Delhi, Kolkata, Chennai

5 - 8 yrs

₹7L - ₹30L / yr

Scala

Python

PySpark

Apache Hive

Spark

+3 more

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

AWS Data Engineer

at Deqode

1 recruiter

Posted by Shraddha Katare

Pune, Bengaluru (Bangalore)

5 - 8 yrs

₹5L - ₹13L / yr

Amazon Web Services (AWS)

databricks

PySpark

SQL

Profile: AWS Data Engineer

Mandate skills :AWS + Databricks + Pyspark + SQL role

Location: Bangalore/Pune/Hyderabad/Chennai/Gurgaon:

Notice Period: Immediate

Key Requirements :

Design, build, and maintain scalable data pipelines to collect, process, and store from multiple datasets.
Optimize data storage solutions for better performance, scalability, and cost-efficiency.
Develop and manage ETL/ELT processes to transform data as per schema definitions, apply slicing and dicing, and make it available for downstream jobs and other teams.
Collaborate closely with cross-functional teams to understand system and product functionalities, pace up feature development, and capture evolving data requirements.
Engage with stakeholders to gather requirements and create curated datasets for downstream consumption and end-user reporting.
Automate deployment and CI/CD processes using GitHub workflows, identifying areas to reduce manual, repetitive work.
Ensure compliance with data governance policies, privacy regulations, and security protocols.
Utilize cloud platforms like AWS and work on Databricks for data processing with S3 Storage.
Work with distributed systems and big data technologies such as Spark, SQL, and Delta Lake.
Integrate with SFTP to push data securely from Databricks to remote locations.
Analyze and interpret spark query execution plans to fine-tune queries for faster and more efficient processing.
Strong problem-solving and troubleshooting skills in large-scale distributed systems.

Profile: AWS Data Engineer

Mandate skills :AWS + Databricks + Pyspark + SQL role

Location: Bangalore/Pune/Hyderabad/Chennai/Gurgaon:

Notice Period: Immediate

Key Requirements :

Design, build, and maintain scalable data pipelines to collect, process, and store from multiple datasets.
Optimize data storage solutions for better performance, scalability, and cost-efficiency.
Develop and manage ETL/ELT processes to transform data as per schema definitions, apply slicing and dicing, and make it available for downstream jobs and other teams.
Collaborate closely with cross-functional teams to understand system and product functionalities, pace up feature development, and capture evolving data requirements.
Engage with stakeholders to gather requirements and create curated datasets for downstream consumption and end-user reporting.
Automate deployment and CI/CD processes using GitHub workflows, identifying areas to reduce manual, repetitive work.
Ensure compliance with data governance policies, privacy regulations, and security protocols.
Utilize cloud platforms like AWS and work on Databricks for data processing with S3 Storage.
Work with distributed systems and big data technologies such as Spark, SQL, and Delta Lake.
Integrate with SFTP to push data securely from Databricks to remote locations.
Analyze and interpret spark query execution plans to fine-tune queries for faster and more efficient processing.
Strong problem-solving and troubleshooting skills in large-scale distributed systems.

Solution/Technical Architect (Databricks)

at Quintica

Posted by Nitin D

Remote, Bengaluru (Bangalore), Pune, Chennai, Nagpur

5 - 15 yrs

₹20L - ₹30L / yr

databricks

PySpark

Apache Spark

CI/CD

Data engineering

Technical Architect (Databricks)

10+ Years Data Engineering Experience with expertise in Databricks
3+ years of consulting experience
Completed Data Engineering Professional certification & required classes
Minimum 2-3 projects delivered with hands-on experience in Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
Experience in Spark and/or Hadoop, Flink, Presto, other popular big data engines
Familiarity with Databricks multi-hop pipeline architecture

Sr. Data Engineer (Databricks)

5+ Years Data Engineering Experience with expertise in Databricks
Completed Data Engineering Associate certification & required classes
Minimum 1 project delivered with hands-on experience in development on Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
SQL delivery experience, and familiarity with Bigquery, Synapse or Redshift
Proficient in Python, knowledge of additional databricks programming languages (Scala)

Technical Architect (Databricks)

10+ Years Data Engineering Experience with expertise in Databricks
3+ years of consulting experience
Completed Data Engineering Professional certification & required classes
Minimum 2-3 projects delivered with hands-on experience in Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
Experience in Spark and/or Hadoop, Flink, Presto, other popular big data engines
Familiarity with Databricks multi-hop pipeline architecture

Sr. Data Engineer (Databricks)

5+ Years Data Engineering Experience with expertise in Databricks
Completed Data Engineering Associate certification & required classes
Minimum 1 project delivered with hands-on experience in development on Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
SQL delivery experience, and familiarity with Bigquery, Synapse or Redshift
Proficient in Python, knowledge of additional databricks programming languages (Scala)

Data Engineer

at Wissen Technology

4 recruiters

Posted by Annie Varghese

Pune, Mumbai, Bengaluru (Bangalore)

3 - 8 yrs

Best in industry

snowflake

Apache Airflow

ETL

Python

PySpark

+1 more

Job Summary:

We are looking for a highly skilled and experienced Data Engineer with deep expertise in Airflow, dbt, Python, and Snowflake. The ideal candidate will be responsible for designing, building, and managing scalable data pipelines and transformation frameworks to enable robust data workflows across the organization.

Key Responsibilities:

Design and implement scalable ETL/ELT pipelines using Apache Airflow for orchestration.
Develop modular and maintainable data transformation models using dbt.
Write high-performance data processing scripts and automation using Python.
Build and maintain data models and pipelines on Snowflake.
Collaborate with data analysts, data scientists, and business teams to deliver clean, reliable, and timely data.
Monitor and optimize pipeline performance and troubleshoot issues proactively.
Follow best practices in version control, testing, and CI/CD for data projects.

Must-Have Skills:

Strong hands-on experience with Apache Airflow for scheduling and orchestrating data workflows.
Proficiency in dbt (data build tool) for building scalable and testable data models.
Expert-level skills in Python for data processing and automation.
Solid experience with Snowflake, including SQL performance tuning, data modeling, and warehouse management.
Strong understanding of data engineering best practices including modularity, testing, and deployment.

Good to Have:

Experience working with cloud platforms (AWS/GCP/Azure).
Familiarity with CI/CD pipelines for data (e.g., GitHub Actions, GitLab CI).
Exposure to modern data stack tools (e.g., Fivetran, Stitch, Looker).
Knowledge of data security and governance best practices.

Note : One face-to-face (F2F) round is mandatory, and as per the process, you will need to visit the office for this.

Job Summary:

Key Responsibilities:

Design and implement scalable ETL/ELT pipelines using Apache Airflow for orchestration.
Develop modular and maintainable data transformation models using dbt.
Write high-performance data processing scripts and automation using Python.
Build and maintain data models and pipelines on Snowflake.
Collaborate with data analysts, data scientists, and business teams to deliver clean, reliable, and timely data.
Monitor and optimize pipeline performance and troubleshoot issues proactively.
Follow best practices in version control, testing, and CI/CD for data projects.

Must-Have Skills:

Strong hands-on experience with Apache Airflow for scheduling and orchestrating data workflows.
Proficiency in dbt (data build tool) for building scalable and testable data models.
Expert-level skills in Python for data processing and automation.
Solid experience with Snowflake, including SQL performance tuning, data modeling, and warehouse management.
Strong understanding of data engineering best practices including modularity, testing, and deployment.

Good to Have:

Experience working with cloud platforms (AWS/GCP/Azure).
Familiarity with CI/CD pipelines for data (e.g., GitHub Actions, GitLab CI).
Exposure to modern data stack tools (e.g., Fivetran, Stitch, Looker).
Knowledge of data security and governance best practices.

Note : One face-to-face (F2F) round is mandatory, and as per the process, you will need to visit the office for this.

AWS Data Engineer

at VyTCDC

Posted by Gobinath Sundaram

Chennai, Bengaluru (Bangalore), Hyderabad, Mumbai, Pune, Noida

4 - 6 yrs

₹3L - ₹21L / yr

AWS Data Engineer

Amazon Web Services (AWS)

Python

PySpark

databricks

+1 more

Key Responsibilities

Design and implement ETL/ELT pipelines using Databricks, PySpark, and AWS Glue
Develop and maintain scalable data architectures on AWS (S3, EMR, Lambda, Redshift, RDS)
Perform data wrangling, cleansing, and transformation using Python and SQL
Collaborate with data scientists to integrate Generative AI models into analytics workflows
Build dashboards and reports to visualize insights using tools like Power BI or Tableau
Ensure data quality, governance, and security across all data assets
Optimize performance of data pipelines and troubleshoot bottlenecks
Work closely with stakeholders to understand data requirements and deliver actionable insights

🧪 Required Skills

Skill AreaTools & TechnologiesCloud PlatformsAWS (S3, Lambda, Glue, EMR, Redshift)Big DataDatabricks, Apache Spark, PySparkProgrammingPython, SQLData EngineeringETL/ELT, Data Lakes, Data WarehousingAnalyticsData Modeling, Visualization, BI ReportingGen AI IntegrationOpenAI, Hugging Face, LangChain (preferred)DevOps (Bonus)Git, Jenkins, Terraform, Docker

📚 Qualifications

Bachelor's or Master’s degree in Computer Science, Data Science, or related field
3+ years of experience in data engineering or data analytics
Hands-on experience with Databricks, PySpark, and AWS
Familiarity with Generative AI tools and frameworks is a strong plus
Strong problem-solving and communication skills

🌟 Preferred Traits

Analytical mindset with attention to detail
Passion for data and emerging technologies
Ability to work independently and in cross-functional teams
Eagerness to learn and adapt in a fast-paced environment

Key Responsibilities

Design and implement ETL/ELT pipelines using Databricks, PySpark, and AWS Glue
Develop and maintain scalable data architectures on AWS (S3, EMR, Lambda, Redshift, RDS)
Perform data wrangling, cleansing, and transformation using Python and SQL
Collaborate with data scientists to integrate Generative AI models into analytics workflows
Build dashboards and reports to visualize insights using tools like Power BI or Tableau
Ensure data quality, governance, and security across all data assets
Optimize performance of data pipelines and troubleshoot bottlenecks
Work closely with stakeholders to understand data requirements and deliver actionable insights

🧪 Required Skills

📚 Qualifications

Bachelor's or Master’s degree in Computer Science, Data Science, or related field
3+ years of experience in data engineering or data analytics
Hands-on experience with Databricks, PySpark, and AWS
Familiarity with Generative AI tools and frameworks is a strong plus
Strong problem-solving and communication skills

🌟 Preferred Traits

Analytical mindset with attention to detail
Passion for data and emerging technologies
Ability to work independently and in cross-functional teams
Eagerness to learn and adapt in a fast-paced environment

Sr Python Developer

at Risosu Consulting LLP

Posted by Vandana Saxena

Bengaluru (Bangalore)

5 - 7 yrs

₹12L - ₹18L / yr

Python

PySpark

SQL

Job Title: Python Developer

Location: Bangalore

Experience: 5–7 Years

Employment Type: Full-Time

Job Description:

We are seeking an experienced Python Developer with strong proficiency in data analysis tools and PySpark, along with a solid understanding of SQL syntax. The ideal candidate will work on large-scale data processing and analysis tasks within a fast-paced environment.

Key Requirements:

Python: Hands-on experience with Python, specifically in data analysis using libraries such as pandas, numpy, etc.

PySpark: Proficiency in writing efficient PySpark code for distributed data processing.

SQL: Strong knowledge of SQL syntax and experience in writing optimized queries.

Ability to work independently and collaborate effectively with cross-functional teams.

Job Title: Python Developer

Location: Bangalore

Experience: 5–7 Years

Employment Type: Full-Time

Job Description:

Key Requirements:

Python: Hands-on experience with Python, specifically in data analysis using libraries such as pandas, numpy, etc.

PySpark: Proficiency in writing efficient PySpark code for distributed data processing.

SQL: Strong knowledge of SQL syntax and experience in writing optimized queries.

Ability to work independently and collaborate effectively with cross-functional teams.

AWS data engineer

at Tekit Software solution Pvt Ltd

Posted by himanshi Tripathi

Hyderabad, Bengaluru (Bangalore)

8 - 10 yrs

₹15L - ₹27L / yr

Amazon Web Services (AWS)

Python

PySpark

SQL

🔍 Job Description:

We are looking for an experienced and highly skilled Technical Lead to guide the development and enhancement of a large-scale Data Observability solution built on AWS. This platform is pivotal in delivering monitoring, reporting, and actionable insights across the client's data landscape.

The Technical Lead will drive end-to-end feature delivery, mentor junior engineers, and uphold engineering best practices. The position reports to the Programme Technical Lead / Architect and involves close collaboration to align on platform vision, technical priorities, and success KPIs.

🎯 Key Responsibilities:

Lead the design, development, and delivery of features for the data observability solution.
Mentor and guide junior engineers, promoting technical growth and engineering excellence.
Collaborate with the architect to align on platform roadmap, vision, and success metrics.
Ensure high quality, scalability, and performance in data engineering solutions.
Contribute to code reviews, architecture discussions, and operational readiness.

🔧 Primary Must-Have Skills (Non-Negotiable):

5+ years in Data Engineering or Software Engineering roles.
3+ years in a technical team or squad leadership capacity.
Deep expertise in AWS Data Services: Glue, EMR, Kinesis, Lambda, Athena, S3.
Advanced programming experience with PySpark, Python, and SQL.
Proven experience in building scalable, production-grade data pipelines on cloud platforms.

🔍 Job Description:

🎯 Key Responsibilities:

Lead the design, development, and delivery of features for the data observability solution.
Mentor and guide junior engineers, promoting technical growth and engineering excellence.
Collaborate with the architect to align on platform roadmap, vision, and success metrics.
Ensure high quality, scalability, and performance in data engineering solutions.
Contribute to code reviews, architecture discussions, and operational readiness.

🔧 Primary Must-Have Skills (Non-Negotiable):

5+ years in Data Engineering or Software Engineering roles.
3+ years in a technical team or squad leadership capacity.
Deep expertise in AWS Data Services: Glue, EMR, Kinesis, Lambda, Athena, S3.
Advanced programming experience with PySpark, Python, and SQL.
Proven experience in building scalable, production-grade data pipelines on cloud platforms.

Data Engineer – GCP + Spark + DBT

at NeoGenCode Technologies Pvt Ltd

2 candid answers

Posted by Akshay Patil

Bengaluru (Bangalore)

8 - 12 yrs

₹15L - ₹22L / yr

Data engineering

Google Cloud Platform (GCP)

Data Transformation Tool (DBT)

Google Dataform

BigQuery

+6 more

Job Title : Data Engineer – GCP + Spark + DBT

Location : Bengaluru (On-site at Client Location | 3 Days WFO)

Experience : 8 to 12 Years

Level : Associate Architect

Type : Full-time

Job Overview :

We are looking for a seasoned Data Engineer to join the Data Platform Engineering team supporting a Unified Data Platform (UDP). This role requires hands-on expertise in DBT, GCP, BigQuery, and PySpark, with a solid foundation in CI/CD, data pipeline optimization, and agile delivery.

Mandatory Skills : GCP, DBT, Google Dataform, BigQuery, PySpark/Spark SQL, Advanced SQL, CI/CD, Git, Agile Methodologies.

Key Responsibilities :

Design, build, and optimize scalable data pipelines using BigQuery, DBT, and PySpark.
Leverage GCP-native services like Cloud Storage, Pub/Sub, Dataproc, Cloud Functions, and Composer for ETL/ELT workflows.
Implement and maintain CI/CD for data engineering projects with Git-based version control.
Collaborate with cross-functional teams including Infra, Security, and DataOps for reliable, secure, and high-quality data delivery.
Lead code reviews, mentor junior engineers, and enforce best practices in data engineering.
Participate in Agile sprints, backlog grooming, and Jira-based project tracking.

Must-Have Skills :

Strong experience with DBT, Google Dataform, and BigQuery
Hands-on expertise with PySpark/Spark SQL
Proficient in GCP for data engineering workflows
Solid knowledge of SQL optimization, Git, and CI/CD pipelines
Agile team experience and strong problem-solving abilities

Nice-to-Have Skills :

Familiarity with Databricks, Delta Lake, or Kafka
Exposure to data observability and quality frameworks (e.g., Great Expectations, Soda)
Knowledge of MDM patterns, Terraform, or IaC is a plus

Job Title : Data Engineer – GCP + Spark + DBT

Location : Bengaluru (On-site at Client Location | 3 Days WFO)

Experience : 8 to 12 Years

Level : Associate Architect

Type : Full-time

Job Overview :

Mandatory Skills : GCP, DBT, Google Dataform, BigQuery, PySpark/Spark SQL, Advanced SQL, CI/CD, Git, Agile Methodologies.

Key Responsibilities :

Design, build, and optimize scalable data pipelines using BigQuery, DBT, and PySpark.
Leverage GCP-native services like Cloud Storage, Pub/Sub, Dataproc, Cloud Functions, and Composer for ETL/ELT workflows.
Implement and maintain CI/CD for data engineering projects with Git-based version control.
Collaborate with cross-functional teams including Infra, Security, and DataOps for reliable, secure, and high-quality data delivery.
Lead code reviews, mentor junior engineers, and enforce best practices in data engineering.
Participate in Agile sprints, backlog grooming, and Jira-based project tracking.

Must-Have Skills :

Strong experience with DBT, Google Dataform, and BigQuery
Hands-on expertise with PySpark/Spark SQL
Proficient in GCP for data engineering workflows
Solid knowledge of SQL optimization, Git, and CI/CD pipelines
Agile team experience and strong problem-solving abilities

Nice-to-Have Skills :

Familiarity with Databricks, Delta Lake, or Kafka
Exposure to data observability and quality frameworks (e.g., Great Expectations, Soda)
Knowledge of MDM patterns, Terraform, or IaC is a plus

Python developer

at Wissen Technology

4 recruiters

Posted by Praffull Shinde

Pune, Mumbai, Bengaluru (Bangalore)

4 - 8 yrs

₹14L - ₹26L / yr

Python

PySpark

Django

Flask

RESTful APIs

+3 more

Job title - Python developer

Exp – 4 to 6 years

Location – Pune/Mum/B’lore

PFB JD

Requirements:

Proven experience as a Python Developer
Strong knowledge of core Python and Pyspark concepts
Experience with web frameworks such as Django or Flask
Good exposure to any cloud platform (GCP Preferred)
CI/CD exposure required
Solid understanding of RESTful APIs and how to build them
Experience working with databases like Oracle DB and MySQL
Ability to write efficient SQL queries and optimize database performance
Strong problem-solving skills and attention to detail
Strong SQL programing (stored procedure, functions)
Excellent communication and interpersonal skill

Roles and Responsibilities

Design, develop, and maintain data pipelines and ETL processes using pyspark
Work closely with data scientists and analysts to provide them with clean, structured data.
Optimize data storage and retrieval for performance and scalability.
Collaborate with cross-functional teams to gather data requirements.
Ensure data quality and integrity through data validation and cleansing processes.
Monitor and troubleshoot data-related issues to ensure data pipeline reliability.
Stay up to date with industry best practices and emerging technologies in data engineering.

Job title - Python developer

Exp – 4 to 6 years

Location – Pune/Mum/B’lore

PFB JD

Requirements:

Proven experience as a Python Developer
Strong knowledge of core Python and Pyspark concepts
Experience with web frameworks such as Django or Flask
Good exposure to any cloud platform (GCP Preferred)
CI/CD exposure required
Solid understanding of RESTful APIs and how to build them
Experience working with databases like Oracle DB and MySQL
Ability to write efficient SQL queries and optimize database performance
Strong problem-solving skills and attention to detail
Strong SQL programing (stored procedure, functions)
Excellent communication and interpersonal skill

Roles and Responsibilities

Design, develop, and maintain data pipelines and ETL processes using pyspark
Work closely with data scientists and analysts to provide them with clean, structured data.
Optimize data storage and retrieval for performance and scalability.
Collaborate with cross-functional teams to gather data requirements.
Ensure data quality and integrity through data validation and cleansing processes.
Monitor and troubleshoot data-related issues to ensure data pipeline reliability.
Stay up to date with industry best practices and emerging technologies in data engineering.

AWS Data Engineer

at Deqode

1 recruiter

Posted by Alisha Das

Bengaluru (Bangalore), Mumbai, Pune, Chennai, Gurugram

5.6 - 7 yrs

₹10L - ₹28L / yr

Amazon Web Services (AWS)

Python

PySpark

SQL

Job Summary:

As an AWS Data Engineer, you will be responsible for designing, developing, and maintaining scalable, high-performance data pipelines using AWS services. With 6+ years of experience, you’ll collaborate closely with data architects, analysts, and business stakeholders to build reliable, secure, and cost-efficient data infrastructure across the organization.

Key Responsibilities:

Design, develop, and manage scalable data pipelines using AWS Glue, Lambda, and other serverless technologies
Implement ETL workflows and transformation logic using PySpark and Python on AWS Glue
Leverage AWS Redshift for warehousing, performance tuning, and large-scale data queries
Work with AWS DMS and RDS for database integration and migration
Optimize data flows and system performance for speed and cost-effectiveness
Deploy and manage infrastructure using AWS CloudFormation templates
Collaborate with cross-functional teams to gather requirements and build robust data solutions
Ensure data integrity, quality, and security across all systems and processes

Required Skills & Experience:

6+ years of experience in Data Engineering with strong AWS expertise
Proficient in Python and PySpark for data processing and ETL development
Hands-on experience with AWS Glue, Lambda, DMS, RDS, and Redshift
Strong SQL skills for building complex queries and performing data analysis
Familiarity with AWS CloudFormation and infrastructure as code principles
Good understanding of serverless architecture and cost-optimized design
Ability to write clean, modular, and maintainable code
Strong analytical thinking and problem-solving skills

Job Summary:

Key Responsibilities:

Design, develop, and manage scalable data pipelines using AWS Glue, Lambda, and other serverless technologies
Implement ETL workflows and transformation logic using PySpark and Python on AWS Glue
Leverage AWS Redshift for warehousing, performance tuning, and large-scale data queries
Work with AWS DMS and RDS for database integration and migration
Optimize data flows and system performance for speed and cost-effectiveness
Deploy and manage infrastructure using AWS CloudFormation templates
Collaborate with cross-functional teams to gather requirements and build robust data solutions
Ensure data integrity, quality, and security across all systems and processes

Required Skills & Experience:

6+ years of experience in Data Engineering with strong AWS expertise
Proficient in Python and PySpark for data processing and ETL development
Hands-on experience with AWS Glue, Lambda, DMS, RDS, and Redshift
Strong SQL skills for building complex queries and performing data analysis
Familiarity with AWS CloudFormation and infrastructure as code principles
Good understanding of serverless architecture and cost-optimized design
Ability to write clean, modular, and maintainable code
Strong analytical thinking and problem-solving skills

ETL Automation Tester

at E2E Infoware Management Services

Posted by Monika S

Bengaluru (Bangalore), Pune, Chennai

5 - 12 yrs

₹5L - ₹25L / yr

PySpark

Automation

SQL

Skill Name: ETL Automation Testing

Location: Bangalore, Chennai and Pune

Experience: 5+ Years

Required:

Experience in ETL Automation Testing

Strong experience in Pyspark.

Skill Name: ETL Automation Testing

Location: Bangalore, Chennai and Pune

Experience: 5+ Years

Required:

Experience in ETL Automation Testing

Strong experience in Pyspark.

Senior Data Engineer

at Wissen Technology

4 recruiters

Posted by Vishakha Walunj

Bengaluru (Bangalore), Pune, Mumbai

7 - 12 yrs

Best in industry

PySpark

databricks

SQL

Python

Required Skills:

Hands-on experience with Databricks, PySpark
Proficiency in SQL, Python, and Spark.
Understanding of data warehousing concepts and data modeling.
Experience with CI/CD pipelines and version control (e.g., Git).
Fundamental knowledge of any cloud services, preferably Azure or GCP.

Good to Have:

Bigquery
Experience with performance tuning and data governance.

Required Skills:

Hands-on experience with Databricks, PySpark
Proficiency in SQL, Python, and Spark.
Understanding of data warehousing concepts and data modeling.
Experience with CI/CD pipelines and version control (e.g., Git).
Fundamental knowledge of any cloud services, preferably Azure or GCP.

Good to Have:

Bigquery
Experience with performance tuning and data governance.

AWS Data Engineer

at Deqode

1 recruiter

Posted by Roshni Maji

Pune, Bengaluru (Bangalore), Gurugram, Chennai, Mumbai

5 - 7 yrs

₹6L - ₹20L / yr

Amazon Web Services (AWS)

Amazon Redshift

AWS Glue

Python

PySpark

Position: AWS Data Engineer

Experience: 5 to 7 Years

Location: Bengaluru, Pune, Chennai, Mumbai, Gurugram

Work Mode: Hybrid (3 days work from office per week)

Employment Type: Full-time

About the Role:

We are seeking a highly skilled and motivated AWS Data Engineer with 5–7 years of experience in building and optimizing data pipelines, architectures, and data sets. The ideal candidate will have strong experience with AWS services including Glue, Athena, Redshift, Lambda, DMS, RDS, and CloudFormation. You will be responsible for managing the full data lifecycle from ingestion to transformation and storage, ensuring efficiency and performance.

Key Responsibilities:

Design, develop, and optimize scalable ETL pipelines using AWS Glue, Python/PySpark, and SQL.
Work extensively with AWS services such as Glue, Athena, Lambda, DMS, RDS, Redshift, CloudFormation, and other serverless technologies.
Implement and manage data lake and warehouse solutions using AWS Redshift and S3.
Optimize data models and storage for cost-efficiency and performance.
Write advanced SQL queries to support complex data analysis and reporting requirements.
Collaborate with stakeholders to understand data requirements and translate them into scalable solutions.
Ensure high data quality and integrity across platforms and processes.
Implement CI/CD pipelines and best practices for infrastructure as code using CloudFormation or similar tools.

Required Skills & Experience:

Strong hands-on experience with Python or PySpark for data processing.
Deep knowledge of AWS Glue, Athena, Lambda, Redshift, RDS, DMS, and CloudFormation.
Proficiency in writing complex SQL queries and optimizing them for performance.
Familiarity with serverless architectures and AWS best practices.
Experience in designing and maintaining robust data architectures and data lakes.
Ability to troubleshoot and resolve data pipeline issues efficiently.
Strong communication and stakeholder management skills.

Position: AWS Data Engineer

Experience: 5 to 7 Years

Location: Bengaluru, Pune, Chennai, Mumbai, Gurugram

Work Mode: Hybrid (3 days work from office per week)

Employment Type: Full-time

About the Role:

Key Responsibilities:

Design, develop, and optimize scalable ETL pipelines using AWS Glue, Python/PySpark, and SQL.
Work extensively with AWS services such as Glue, Athena, Lambda, DMS, RDS, Redshift, CloudFormation, and other serverless technologies.
Implement and manage data lake and warehouse solutions using AWS Redshift and S3.
Optimize data models and storage for cost-efficiency and performance.
Write advanced SQL queries to support complex data analysis and reporting requirements.
Collaborate with stakeholders to understand data requirements and translate them into scalable solutions.
Ensure high data quality and integrity across platforms and processes.
Implement CI/CD pipelines and best practices for infrastructure as code using CloudFormation or similar tools.

Required Skills & Experience:

Strong hands-on experience with Python or PySpark for data processing.
Deep knowledge of AWS Glue, Athena, Lambda, Redshift, RDS, DMS, and CloudFormation.
Proficiency in writing complex SQL queries and optimizing them for performance.
Familiarity with serverless architectures and AWS best practices.
Experience in designing and maintaining robust data architectures and data lakes.
Ability to troubleshoot and resolve data pipeline issues efficiently.
Strong communication and stakeholder management skills.

AWS Data Engineer

at Deqode

1 recruiter

Posted by Roshni Maji

Bengaluru (Bangalore), Pune, Mumbai, Chennai, Gurugram

5 - 7 yrs

₹5L - ₹19L / yr

Python

PySpark

Amazon Web Services (AWS)

aws

Amazon Redshift

+1 more

Position: AWS Data Engineer

Experience: 5 to 7 Years

Location: Bengaluru, Pune, Chennai, Mumbai, Gurugram

Work Mode: Hybrid (3 days work from office per week)

Employment Type: Full-time

About the Role:

Key Responsibilities:

Design, develop, and optimize scalable ETL pipelines using AWS Glue, Python/PySpark, and SQL.
Work extensively with AWS services such as Glue, Athena, Lambda, DMS, RDS, Redshift, CloudFormation, and other serverless technologies.
Implement and manage data lake and warehouse solutions using AWS Redshift and S3.
Optimize data models and storage for cost-efficiency and performance.
Write advanced SQL queries to support complex data analysis and reporting requirements.
Collaborate with stakeholders to understand data requirements and translate them into scalable solutions.
Ensure high data quality and integrity across platforms and processes.
Implement CI/CD pipelines and best practices for infrastructure as code using CloudFormation or similar tools.

Required Skills & Experience:

Strong hands-on experience with Python or PySpark for data processing.
Deep knowledge of AWS Glue, Athena, Lambda, Redshift, RDS, DMS, and CloudFormation.
Proficiency in writing complex SQL queries and optimizing them for performance.
Familiarity with serverless architectures and AWS best practices.
Experience in designing and maintaining robust data architectures and data lakes.
Ability to troubleshoot and resolve data pipeline issues efficiently.
Strong communication and stakeholder management skills.

Position: AWS Data Engineer

Experience: 5 to 7 Years

Location: Bengaluru, Pune, Chennai, Mumbai, Gurugram

Work Mode: Hybrid (3 days work from office per week)

Employment Type: Full-time

About the Role:

Key Responsibilities:

Design, develop, and optimize scalable ETL pipelines using AWS Glue, Python/PySpark, and SQL.
Work extensively with AWS services such as Glue, Athena, Lambda, DMS, RDS, Redshift, CloudFormation, and other serverless technologies.
Implement and manage data lake and warehouse solutions using AWS Redshift and S3.
Optimize data models and storage for cost-efficiency and performance.
Write advanced SQL queries to support complex data analysis and reporting requirements.
Collaborate with stakeholders to understand data requirements and translate them into scalable solutions.
Ensure high data quality and integrity across platforms and processes.
Implement CI/CD pipelines and best practices for infrastructure as code using CloudFormation or similar tools.

Required Skills & Experience:

Strong hands-on experience with Python or PySpark for data processing.
Deep knowledge of AWS Glue, Athena, Lambda, Redshift, RDS, DMS, and CloudFormation.
Proficiency in writing complex SQL queries and optimizing them for performance.
Familiarity with serverless architectures and AWS best practices.
Experience in designing and maintaining robust data architectures and data lakes.
Ability to troubleshoot and resolve data pipeline issues efficiently.
Strong communication and stakeholder management skills.

ETL Developer

at Deqode

1 recruiter

Posted by Mokshada Solanki

Bengaluru (Bangalore), Mumbai, Pune, Gurugram

4 - 5 yrs

₹4L - ₹20L / yr

SQL

Amazon Web Services (AWS)

Migration

PySpark

ETL

Job Summary:

Seeking a seasoned SQL + ETL Developer with 4+ years of experience in managing large-scale datasets and cloud-based data pipelines. The ideal candidate is hands-on with MySQL, PySpark, AWS Glue, and ETL workflows, with proven expertise in AWS migration and performance optimization.

Key Responsibilities:

Develop and optimize complex SQL queries and stored procedures to handle large datasets (100+ million records).
Build and maintain scalable ETL pipelines using AWS Glue and PySpark.
Work on data migration tasks in AWS environments.
Monitor and improve database performance; automate key performance indicators and reports.
Collaborate with cross-functional teams to support data integration and delivery requirements.
Write shell scripts for automation and manage ETL jobs efficiently.

Required Skills:

Strong experience with MySQL, complex SQL queries, and stored procedures.
Hands-on experience with AWS Glue, PySpark, and ETL processes.
Good understanding of AWS ecosystem and migration strategies.
Proficiency in shell scripting.
Strong communication and collaboration skills.

Nice to Have:

Working knowledge of Python.
Experience with AWS RDS.

Job Summary:

Key Responsibilities:

Develop and optimize complex SQL queries and stored procedures to handle large datasets (100+ million records).
Build and maintain scalable ETL pipelines using AWS Glue and PySpark.
Work on data migration tasks in AWS environments.
Monitor and improve database performance; automate key performance indicators and reports.
Collaborate with cross-functional teams to support data integration and delivery requirements.
Write shell scripts for automation and manage ETL jobs efficiently.

Required Skills:

Strong experience with MySQL, complex SQL queries, and stored procedures.
Hands-on experience with AWS Glue, PySpark, and ETL processes.
Good understanding of AWS ecosystem and migration strategies.
Proficiency in shell scripting.
Strong communication and collaboration skills.

Nice to Have:

Working knowledge of Python.
Experience with AWS RDS.

Data Engineer - AWS

at Deqode

1 recruiter

Posted by Shraddha Katare

Bengaluru (Bangalore), Pune, Chennai, Mumbai, Gurugram

5 - 7 yrs

₹5L - ₹19L / yr

Amazon Web Services (AWS)

Python

PySpark

SQL

redshift

Profile: AWS Data Engineer

Mode- Hybrid

Experience- 5+7 years

Locations - Bengaluru, Pune, Chennai, Mumbai, Gurugram

Roles and Responsibilities

Design and maintain ETL pipelines using AWS Glue and Python/PySpark
Optimize SQL queries for Redshift and Athena
Develop Lambda functions for serverless data processing
Configure AWS DMS for database migration and replication
Implement infrastructure as code with CloudFormation
Build optimized data models for performance
Manage RDS databases and AWS service integrations
Troubleshoot and improve data processing efficiency
Gather requirements from business stakeholders
Implement data quality checks and validation
Document data pipelines and architecture
Monitor workflows and implement alerting
Keep current with AWS services and best practices

Required Technical Expertise:

Python/PySpark for data processing
AWS Glue for ETL operations
Redshift and Athena for data querying
AWS Lambda and serverless architecture
AWS DMS and RDS management
CloudFormation for infrastructure
SQL optimization and performance tuning

Profile: AWS Data Engineer

Mode- Hybrid

Experience- 5+7 years

Locations - Bengaluru, Pune, Chennai, Mumbai, Gurugram

Roles and Responsibilities

Design and maintain ETL pipelines using AWS Glue and Python/PySpark
Optimize SQL queries for Redshift and Athena
Develop Lambda functions for serverless data processing
Configure AWS DMS for database migration and replication
Implement infrastructure as code with CloudFormation
Build optimized data models for performance
Manage RDS databases and AWS service integrations
Troubleshoot and improve data processing efficiency
Gather requirements from business stakeholders
Implement data quality checks and validation
Document data pipelines and architecture
Monitor workflows and implement alerting
Keep current with AWS services and best practices

Required Technical Expertise:

Python/PySpark for data processing
AWS Glue for ETL operations
Redshift and Athena for data querying
AWS Lambda and serverless architecture
AWS DMS and RDS management
CloudFormation for infrastructure
SQL optimization and performance tuning

AWS Data Engineer

at Deqode

1 recruiter

Posted by Alisha Das

Pune, Mumbai, Bengaluru (Bangalore), Chennai

4 - 7 yrs

₹5L - ₹15L / yr

Amazon Web Services (AWS)

Python

PySpark

Glue semantics

Amazon Redshift

+1 more

Job Overview:

We are seeking an experienced AWS Data Engineer to join our growing data team. The ideal candidate will have hands-on experience with AWS Glue, Redshift, PySpark, and other AWS services to build robust, scalable data pipelines. This role is perfect for someone passionate about data engineering, automation, and cloud-native development.

Key Responsibilities:

Design, build, and maintain scalable and efficient ETL pipelines using AWS Glue, PySpark, and related tools.
Integrate data from diverse sources and ensure its quality, consistency, and reliability.
Work with large datasets in structured and semi-structured formats across cloud-based data lakes and warehouses.
Optimize and maintain data infrastructure, including Amazon Redshift, for high performance.
Collaborate with data analysts, data scientists, and product teams to understand data requirements and deliver solutions.
Automate data validation, transformation, and loading processes to support real-time and batch data processing.
Monitor and troubleshoot data pipeline issues and ensure smooth operations in production environments.

Required Skills:

5 to 7 years of hands-on experience in data engineering roles.
Strong proficiency in Python and PySpark for data transformation and scripting.
Deep understanding and practical experience with AWS Glue, AWS Redshift, S3, and other AWS data services.
Solid understanding of SQL and database optimization techniques.
Experience working with large-scale data pipelines and high-volume data environments.
Good knowledge of data modeling, warehousing, and performance tuning.

Preferred/Good to Have:

Experience with workflow orchestration tools like Airflow or Step Functions.
Familiarity with CI/CD for data pipelines.
Knowledge of data governance and security best practices on AWS.

Job Overview:

Key Responsibilities:

Design, build, and maintain scalable and efficient ETL pipelines using AWS Glue, PySpark, and related tools.
Integrate data from diverse sources and ensure its quality, consistency, and reliability.
Work with large datasets in structured and semi-structured formats across cloud-based data lakes and warehouses.
Optimize and maintain data infrastructure, including Amazon Redshift, for high performance.
Collaborate with data analysts, data scientists, and product teams to understand data requirements and deliver solutions.
Automate data validation, transformation, and loading processes to support real-time and batch data processing.
Monitor and troubleshoot data pipeline issues and ensure smooth operations in production environments.

Required Skills:

5 to 7 years of hands-on experience in data engineering roles.
Strong proficiency in Python and PySpark for data transformation and scripting.
Deep understanding and practical experience with AWS Glue, AWS Redshift, S3, and other AWS data services.
Solid understanding of SQL and database optimization techniques.
Experience working with large-scale data pipelines and high-volume data environments.
Good knowledge of data modeling, warehousing, and performance tuning.

Preferred/Good to Have:

Experience with workflow orchestration tools like Airflow or Step Functions.
Familiarity with CI/CD for data pipelines.
Knowledge of data governance and security best practices on AWS.

ETL Developer

at Deqode

1 recruiter

Posted by Shraddha Katare

Pune, Mumbai, Bengaluru (Bangalore), Gurugram

4 - 6 yrs

₹5L - ₹10L / yr

ETL

SQL

Amazon Web Services (AWS)

PySpark

KPI

Role - ETL Developer

Work Mode - Hybrid

Experience- 4+ years

Location - Pune, Gurgaon, Bengaluru, Mumbai

Required Skills - AWS, AWS Glue, Pyspark, ETL, SQL

Required Skills:

4+ years of hands-on experience in MySQL, including SQL queries and procedure development
Experience in Pyspark, AWS, AWS Glue
Experience in AWS ,Migration
Experience with automated scripting and tracking KPIs/metrics for database performance
Proficiency in shell scripting and ETL.
Strong communication skills and a collaborative team player
Knowledge of Python and AWS RDS is a plus

Role - ETL Developer

Work Mode - Hybrid

Experience- 4+ years

Location - Pune, Gurgaon, Bengaluru, Mumbai

Required Skills - AWS, AWS Glue, Pyspark, ETL, SQL

Required Skills:

4+ years of hands-on experience in MySQL, including SQL queries and procedure development
Experience in Pyspark, AWS, AWS Glue
Experience in AWS ,Migration
Experience with automated scripting and tracking KPIs/metrics for database performance
Proficiency in shell scripting and ETL.
Strong communication skills and a collaborative team player
Knowledge of Python and AWS RDS is a plus

Data Engineer

at Wissen Technology

4 recruiters

Posted by Hanisha Pralayakaveri

Bengaluru (Bangalore), Mumbai

5 - 9 yrs

Best in industry

Python

Amazon Web Services (AWS)

PySpark

Data engineering

Job Description: Data Engineer

Position Overview:

Role Overview

We are seeking a skilled Python Data Engineer with expertise in designing and implementing data solutions using the AWS cloud platform. The ideal candidate will be responsible for building and maintaining scalable, efficient, and secure data pipelines while leveraging Python and AWS services to enable robust data analytics and decision-making processes.

Key Responsibilities

· Design, develop, and optimize data pipelines using Python and AWS services such as Glue, Lambda, S3, EMR, Redshift, Athena, and Kinesis.

· Implement ETL/ELT processes to extract, transform, and load data from various sources into centralized repositories (e.g., data lakes or data warehouses).

· Collaborate with cross-functional teams to understand business requirements and translate them into scalable data solutions.

· Monitor, troubleshoot, and enhance data workflows for performance and cost optimization.

· Ensure data quality and consistency by implementing validation and governance practices.

· Work on data security best practices in compliance with organizational policies and regulations.

· Automate repetitive data engineering tasks using Python scripts and frameworks.

· Leverage CI/CD pipelines for deployment of data workflows on AWS.