Big data Jobs in Pune

50+ Big data Jobs in Pune | Big data Job openings in Pune

Apply to 50+ Big data Jobs in Pune on CutShort.io. Explore the latest Big data Job opportunities across top companies like Google, Amazon & Adobe.

PySpark/Scala Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Bengaluru (Bangalore), Hyderabad, Pune, Delhi, Kolkata, Chennai

5 - 8 yrs

₹7L - ₹30L / yr

Scala

Python

PySpark

Apache Hive

Spark

+3 more

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

IBM MDM Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Chennai, Hyderabad, Kolkata, Delhi, Pune, Bengaluru (Bangalore)

5 - 8 yrs

₹7L - ₹30L / yr

Informatica MDM

MDM

ETL

Big Data

• Technical expertise in the area of development of Master Data Management, data extraction, transformation, and load (ETL) applications, big data using existing and emerging technology platforms and cloud architecture

• Functions as lead developer• Support System Analysis, Technical/Data design, development, unit testing, and oversee end-to-end data solution.

• Technical SME in Master Data Management application, ETL, big data and cloud technologies

• Collaborate with IT teams to ensure technical designs and implementations account for requirements, standards, and best practices

• Performance tuning of end-to-end MDM, database, ETL, Big data processes or in the source/target database endpoints as needed.

• Mentor and advise junior members of team to provide guidance.

• Perform a technical lead and solution lead role for a team of onshore and offshore developers

• Functions as lead developer• Support System Analysis, Technical/Data design, development, unit testing, and oversee end-to-end data solution.

• Technical SME in Master Data Management application, ETL, big data and cloud technologies

• Collaborate with IT teams to ensure technical designs and implementations account for requirements, standards, and best practices

• Performance tuning of end-to-end MDM, database, ETL, Big data processes or in the source/target database endpoints as needed.

• Mentor and advise junior members of team to provide guidance.

• Perform a technical lead and solution lead role for a team of onshore and offshore developers

Big Data Engineer-Release

at Rigel Networks Pvt Ltd

Posted by Minakshi Soni

Pune

5 - 9 yrs

₹8L - ₹15L / yr

Big data Engineer

Software deployment

Release Management

Software release life cycle

Release engineering

+6 more

Dear Candidate,

We are urgently looking for a Release- Big data Engineer For Pune Location.

Experience : 5-8 yrs

Location : Pune

Skills: Big data Engineer , Release Engineer ,DevOps, Aws/Azure/GCP Cloud exp. ,

JD:

Oversee the end-to-end release lifecycle, from planning to post-production monitoring. Coordinate with cross-functional teams (DBA, BizOps, DevOps, DNS).
Partner with development teams to resolve technical challenges in deployment and automation test runs
Work with shared services DBA teams for schema-based multi-tenancy designs and smooth migrations.
Drive automation for batch deployments and DR exercises. YAML based micro service deployment using shell/Python/Go
Provide oversight for Big Data toolsets for deployment (e.g., Spark, Hive, HBase) in private cloud and public cloud CDP environments
Ensure high-quality releases with a focus on stability and long-term performance.
Able to run the automation batch scripts and debug the deployment and functional aspects/ work with dev leads to resolve the release cycle issues.

Regards,

Minakshi Soni

Executive- Talent Acquisition

Rigel Networks

Dear Candidate,

We are urgently looking for a Release- Big data Engineer For Pune Location.

Experience : 5-8 yrs

Location : Pune

Skills: Big data Engineer , Release Engineer ,DevOps, Aws/Azure/GCP Cloud exp. ,

JD:

Oversee the end-to-end release lifecycle, from planning to post-production monitoring. Coordinate with cross-functional teams (DBA, BizOps, DevOps, DNS).
Partner with development teams to resolve technical challenges in deployment and automation test runs
Work with shared services DBA teams for schema-based multi-tenancy designs and smooth migrations.
Drive automation for batch deployments and DR exercises. YAML based micro service deployment using shell/Python/Go
Provide oversight for Big Data toolsets for deployment (e.g., Spark, Hive, HBase) in private cloud and public cloud CDP environments
Ensure high-quality releases with a focus on stability and long-term performance.
Able to run the automation batch scripts and debug the deployment and functional aspects/ work with dev leads to resolve the release cycle issues.

Regards,

Minakshi Soni

Executive- Talent Acquisition

Rigel Networks

QA Engineer

at Wissen Technology

4 recruiters

Posted by Vijayalakshmi Selvaraj

Bengaluru (Bangalore), Pune

5 - 10 yrs

Best in industry

PythonAnywhere

Amazon Web Services (AWS)

Big Data

At least 5 years of experience in testing and developing automation tests.

A minimum of 3 years of experience writing tests in Python, with a preference for experience in designing automation frameworks.

Experience in developing automation for big data testing, including data ingestion, data processing, and data migration, is highly desirable.

Familiarity with Playwright or other browser application testing frameworks is a significant advantage.

Proficiency in object-oriented programming and principles is required.

Extensive knowledge of AWS services is essential.

Strong expertise in REST API testing and SQL is required.

A solid understanding of testing and development life cycle methodologies is necessary.

Knowledge of the financial industry and trading systems is a plus

At least 5 years of experience in testing and developing automation tests.

A minimum of 3 years of experience writing tests in Python, with a preference for experience in designing automation frameworks.

Experience in developing automation for big data testing, including data ingestion, data processing, and data migration, is highly desirable.

Familiarity with Playwright or other browser application testing frameworks is a significant advantage.

Proficiency in object-oriented programming and principles is required.

Extensive knowledge of AWS services is essential.

Strong expertise in REST API testing and SQL is required.

A solid understanding of testing and development life cycle methodologies is necessary.

Knowledge of the financial industry and trading systems is a plus

Senior Data Engineer

at Wissen Technology

4 recruiters

Posted by Vijayalakshmi Selvaraj

Pune

5 - 10 yrs

Best in industry

Object Oriented Programming (OOPs)

Amazon Redshift

DSA

Big Data

Hadoop

+3 more

Job Summary:

We are seeking a skilled Senior Data Engineer with expertise in application programming, big data technologies, and cloud services. This role involves solving complex problems, designing scalable systems, and working with advanced technologies to deliver innovative solutions.

Key Responsibilities:

Develop and maintain scalable applications using OOP principles, data structures, and problem-solving skills.
Build robust solutions using Java, Python, or Scala.
Work with big data technologies like Apache Spark for large-scale data processing.
Utilize AWS services, especially Amazon Redshift, for cloud-based solutions.
Manage databases including SQL, NoSQL (e.g., MongoDB, Cassandra), with Snowflake as a plus.

Qualifications:

5+ years of experience in software development.
Strong skills in OOPS, data structures, and problem-solving.
Proficiency in Java, Python, or Scala.
Experience with Spark, AWS (Redshift mandatory), and databases (SQL/NoSQL).
Snowflake experience is good to have.

Job Summary:

Key Responsibilities:

Develop and maintain scalable applications using OOP principles, data structures, and problem-solving skills.
Build robust solutions using Java, Python, or Scala.
Work with big data technologies like Apache Spark for large-scale data processing.
Utilize AWS services, especially Amazon Redshift, for cloud-based solutions.
Manage databases including SQL, NoSQL (e.g., MongoDB, Cassandra), with Snowflake as a plus.

Qualifications:

5+ years of experience in software development.
Strong skills in OOPS, data structures, and problem-solving.
Proficiency in Java, Python, or Scala.
Experience with Spark, AWS (Redshift mandatory), and databases (SQL/NoSQL).
Snowflake experience is good to have.

Customer Success / Analytics Team

at DocNexus

Posted by Mahek Chhatrapati

Remote, Hyderabad, Pune

2 - 10 yrs

₹10L - ₹25L / yr

SQL Query Analyzer

Big Data

Customer Support

At DocNexus, we’re revolutionizing how life sciences companies search and generate insights. Our search platform unlocks powerful insights, and we're seeking a Customer Success Team Member with strong technical skills to help our customers harness its full potential.

What you’ll do:

Customer Support: Troubleshoot and resolve customer queries, particularly around referral reports, data anomalies, and data generation using our platform.
Data Queries (BigQuery/ClickHouse): Respond to customer requests for custom data queries, working with large datasets in BigQuery and ClickHouse to deliver precise insights.
Onboarding & Training: Lead onboarding for new customers, guide teams on platform usage, and manage access requests.
Listen & Improve: Collect and act on customer feedback to continuously improve the platform, collaborating with the product team to enhance functionality.
Technical Documentation: Assist with technical resources and help create training materials for both internal and customer use.

What you bring:

Strong Technical Skills: Proficient in querying with BigQuery and ClickHouse. Comfortable working with complex data, writing custom queries, and resolving technical issues.
Customer-Focused: Excellent communication skills, able to translate technical data insights to non-technical users and provide solutions clearly and effectively.
Problem-Solver: Strong analytical skills and a proactive mindset to address customer needs and overcome challenges in a fast-paced environment.
Team Player: Work collaboratively with both internal teams and customers to ensure success.

If you're passionate about data, thrive in a technical environment, and are excited to support life sciences teams in their data-driven decision-making, we'd love to hear from you!

What you’ll do:

Customer Support: Troubleshoot and resolve customer queries, particularly around referral reports, data anomalies, and data generation using our platform.
Data Queries (BigQuery/ClickHouse): Respond to customer requests for custom data queries, working with large datasets in BigQuery and ClickHouse to deliver precise insights.
Onboarding & Training: Lead onboarding for new customers, guide teams on platform usage, and manage access requests.
Listen & Improve: Collect and act on customer feedback to continuously improve the platform, collaborating with the product team to enhance functionality.
Technical Documentation: Assist with technical resources and help create training materials for both internal and customer use.

What you bring:

Strong Technical Skills: Proficient in querying with BigQuery and ClickHouse. Comfortable working with complex data, writing custom queries, and resolving technical issues.
Customer-Focused: Excellent communication skills, able to translate technical data insights to non-technical users and provide solutions clearly and effectively.
Problem-Solver: Strong analytical skills and a proactive mindset to address customer needs and overcome challenges in a fast-paced environment.
Team Player: Work collaboratively with both internal teams and customers to ensure success.

If you're passionate about data, thrive in a technical environment, and are excited to support life sciences teams in their data-driven decision-making, we'd love to hear from you!

Cassandra DBA

at Cornertree

1 recruiter

Posted by Deepesh Shrimal

Bengaluru (Bangalore), Pune, Hyderabad, Gurugram, Noida

5 - 10 yrs

₹15L - ₹30L / yr

Cassandra

PySpark

Data engineering

Big Data

Hadoop

+3 more

Skills:

Experience with Cassandra, including installing configuring and monitoring a Cassandra cluster.

Experience with Cassandra data modeling and CQL scripting. Experience with DataStax Enterprise Graph

Experience with both Windows and Linux Operating Systems. Knowledge of Microsoft .NET Framework (C#, NETCore).

Ability to perform effectively in a team-oriented environment

Skills:

Experience with Cassandra, including installing configuring and monitoring a Cassandra cluster.

Experience with Cassandra data modeling and CQL scripting. Experience with DataStax Enterprise Graph

Experience with both Windows and Linux Operating Systems. Knowledge of Microsoft .NET Framework (C#, NETCore).

Ability to perform effectively in a team-oriented environment

Azure Data Engineer

at TVARIT GmbH

2 candid answers

Posted by Shivani Kawade

Remote, Pune

2 - 4 yrs

₹8L - ₹20L / yr

Python

PySpark

ETL

databricks

Azure

+6 more

TVARIT GmbH develops and delivers solutions in the field of artificial intelligence (AI) for the Manufacturing, automotive, and process industries. With its software products, TVARIT makes it possible for its customers to make intelligent and well-founded decisions, e.g., in forward-looking Maintenance, increasing the OEE and predictive quality. We have renowned reference customers, competent technology, a good research team from renowned Universities, and the award of a renowned AI prize (e.g., EU Horizon 2020) which makes Tvarit one of the most innovative AI companies in Germany and Europe.

We are looking for a self-motivated person with a positive "can-do" attitude and excellent oral and written communication skills in English.

We are seeking a skilled and motivated Data Engineer from the manufacturing Industry with over two years of experience to join our team. As a data engineer, you will be responsible for designing, building, and maintaining the infrastructure required for the collection, storage, processing, and analysis of large and complex data sets. The ideal candidate will have a strong foundation in ETL pipelines and Python, with additional experience in Azure and Terraform being a plus. This role requires a proactive individual who can contribute to our data infrastructure and support our analytics and data science initiatives.

Skills Required

Experience in the manufacturing industry (metal industry is a plus)
2+ years of experience as a Data Engineer
Experience in data cleaning & structuring and data manipulation
ETL Pipelines: Proven experience in designing, building, and maintaining ETL pipelines.
Python: Strong proficiency in Python programming for data manipulation, transformation, and automation.
Experience in SQL and data structures
Knowledge in big data technologies such as Spark, Flink, Hadoop, Apache and NoSQL databases.
Knowledge of cloud technologies (at least one) such as AWS, Azure, and Google Cloud Platform.
Proficient in data management and data governance
Strong analytical and problem-solving skills.
Excellent communication and teamwork abilities.

Nice To Have

Azure: Experience with Azure data services (e.g., Azure Data Factory, Azure Databricks, Azure SQL Database).
Terraform: Knowledge of Terraform for infrastructure as code (IaC) to manage cloud.

We are looking for a self-motivated person with a positive "can-do" attitude and excellent oral and written communication skills in English.

Skills Required

Experience in the manufacturing industry (metal industry is a plus)
2+ years of experience as a Data Engineer
Experience in data cleaning & structuring and data manipulation
ETL Pipelines: Proven experience in designing, building, and maintaining ETL pipelines.
Python: Strong proficiency in Python programming for data manipulation, transformation, and automation.
Experience in SQL and data structures
Knowledge in big data technologies such as Spark, Flink, Hadoop, Apache and NoSQL databases.
Knowledge of cloud technologies (at least one) such as AWS, Azure, and Google Cloud Platform.
Proficient in data management and data governance
Strong analytical and problem-solving skills.
Excellent communication and teamwork abilities.

Nice To Have

Azure: Experience with Azure data services (e.g., Azure Data Factory, Azure Databricks, Azure SQL Database).
Terraform: Knowledge of Terraform for infrastructure as code (IaC) to manage cloud.

Senior Data Engineer

at TVARIT GmbH

2 candid answers

Posted by Shivani Kawade

Remote, Pune

2 - 6 yrs

₹8L - ₹25L / yr

SQL Azure

databricks

Python

SQL

ETL

+9 more

TVARIT GmbH develops and delivers solutions in the field of artificial intelligence (AI) for the Manufacturing, automotive, and process industries. With its software products, TVARIT makes it possible for its customers to make intelligent and well-founded decisions, e.g., in forward-looking Maintenance, increasing the OEE and predictive quality. We have renowned reference customers, competent technology, a good research team from renowned Universities, and the award of a renowned AI prize (e.g., EU Horizon 2020) which makes TVARIT one of the most innovative AI companies in Germany and Europe.

We are looking for a self-motivated person with a positive "can-do" attitude and excellent oral and written communication skills in English.

We are seeking a skilled and motivated senior Data Engineer from the manufacturing Industry with over four years of experience to join our team. The Senior Data Engineer will oversee the department’s data infrastructure, including developing a data model, integrating large amounts of data from different systems, building & enhancing a data lake-house & subsequent analytics environment, and writing scripts to facilitate data analysis. The ideal candidate will have a strong foundation in ETL pipelines and Python, with additional experience in Azure and Terraform being a plus. This role requires a proactive individual who can contribute to our data infrastructure and support our analytics and data science initiatives.

Skills Required:

Experience in the manufacturing industry (metal industry is a plus)
4+ years of experience as a Data Engineer
Experience in data cleaning & structuring and data manipulation
Architect and optimize complex data pipelines, leading the design and implementation of scalable data infrastructure, and ensuring data quality and reliability at scale
ETL Pipelines: Proven experience in designing, building, and maintaining ETL pipelines.
Python: Strong proficiency in Python programming for data manipulation, transformation, and automation.
Experience in SQL and data structures
Knowledge in big data technologies such as Spark, Flink, Hadoop, Apache, and NoSQL databases.
Knowledge of cloud technologies (at least one) such as AWS, Azure, and Google Cloud Platform.
Proficient in data management and data governance
Strong analytical experience & skills that can extract actionable insights from raw data to help improve the business.
Strong analytical and problem-solving skills.
Excellent communication and teamwork abilities.

Nice To Have:

Azure: Experience with Azure data services (e.g., Azure Data Factory, Azure Databricks, Azure SQL Database).
Terraform: Knowledge of Terraform for infrastructure as code (IaC) to manage cloud.
Bachelor’s degree in computer science, Information Technology, Engineering, or a related field from top-tier Indian Institutes of Information Technology (IIITs).
Benefits And Perks
A culture that fosters innovation, creativity, continuous learning, and resilience
Progressive leave policy promoting work-life balance
Mentorship opportunities with highly qualified internal resources and industry-driven programs
Multicultural peer groups and supportive workplace policies
Annual workcation program allowing you to work from various scenic locations
Experience the unique environment of a dynamic start-up

Why should you join TVARIT ?

Working at TVARIT, a deep-tech German IT startup, offers a unique blend of innovation, collaboration, and growth opportunities. We seek individuals eager to adapt and thrive in a rapidly evolving environment.

If this opportunity excites you and aligns with your career aspirations, we encourage you to apply today!

TVARIT GmbH develops and delivers solutions in the field of artificial intelligence (AI) for the Manufacturing, automotive, and process industries. With its software products, TVARIT makes it possible for its customers to make intelligent and well-founded decisions, e.g., in forward-looking Maintenance, increasing the OEE and predictive quality. We have renowned reference customers, competent technology, a good research team from renowned Universities, and the award of a renowned AI prize (e.g., EU Horizon 2020) which makes TVARIT one of the most innovative AI companies in Germany and Europe.

We are looking for a self-motivated person with a positive "can-do" attitude and excellent oral and written communication skills in English.

Skills Required:

Experience in the manufacturing industry (metal industry is a plus)
4+ years of experience as a Data Engineer
Experience in data cleaning & structuring and data manipulation
Architect and optimize complex data pipelines, leading the design and implementation of scalable data infrastructure, and ensuring data quality and reliability at scale
ETL Pipelines: Proven experience in designing, building, and maintaining ETL pipelines.
Python: Strong proficiency in Python programming for data manipulation, transformation, and automation.
Experience in SQL and data structures
Knowledge in big data technologies such as Spark, Flink, Hadoop, Apache, and NoSQL databases.
Knowledge of cloud technologies (at least one) such as AWS, Azure, and Google Cloud Platform.
Proficient in data management and data governance
Strong analytical experience & skills that can extract actionable insights from raw data to help improve the business.
Strong analytical and problem-solving skills.
Excellent communication and teamwork abilities.

Nice To Have:

Azure: Experience with Azure data services (e.g., Azure Data Factory, Azure Databricks, Azure SQL Database).
Terraform: Knowledge of Terraform for infrastructure as code (IaC) to manage cloud.
Bachelor’s degree in computer science, Information Technology, Engineering, or a related field from top-tier Indian Institutes of Information Technology (IIITs).
Benefits And Perks
A culture that fosters innovation, creativity, continuous learning, and resilience
Progressive leave policy promoting work-life balance
Mentorship opportunities with highly qualified internal resources and industry-driven programs
Multicultural peer groups and supportive workplace policies
Annual workcation program allowing you to work from various scenic locations
Experience the unique environment of a dynamic start-up

Why should you join TVARIT ?

If this opportunity excites you and aligns with your career aspirations, we encourage you to apply today!

Kafka Developer

at iLink Systems

1 video

1 recruiter

Posted by Ganesh Sooriyamoorthu

Chennai, Pune, Noida, Bengaluru (Bangalore)

5 - 15 yrs

₹10L - ₹15L / yr

Apache Kafka

Big Data

Java

Spark

Hadoop

+1 more

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

SDE-1 (Backend Developer)

at Vcriate Internet Services Private Limited

Posted by Shivashish Mishra

Pune

0 - 1 yrs

₹10L - ₹15L / yr

Java

J2EE

Spring Boot

Hibernate (Java)

SQL

+6 more

1. Work closely with senior engineers to design, implement and deploy applications that impact the business with an emphasis on mobile, payments, and product website development
2. Design software and make technology choices across the stack (from data storage to application to front-end)
3. Understand a range of tier-1 systems/services that power our product to make scalable changes to critical path code
4. Own the design and delivery of an integral piece of a tier-1 system or application
5. Work closely with product managers, UX designers, and end users and integrate software components into a fully functional system
6. Work on the management and execution of project plans and delivery commitments
7. Take ownership of product/feature end-to-end for all phases from the development to the production
8. Ensure the developed features are scalable and highly available with no quality concerns
9. Work closely with senior engineers for refining and implementation
10. Manage and execute project plans and delivery commitments
11. Create and execute appropriate quality plans, project plans, test strategies, and processes for development activities in concert with business and project management efforts

Test Automation Engineer

at Concentric AI

7 candid answers

1 product

Posted by Gopal Agarwal

Pune

2 - 10 yrs

₹2L - ₹50L / yr

Software Testing (QA)

Test Automation (QA)

Python

Jenkins

Automation

+9 more

•3-10 years of experience in test automation for distributed scalable software

• Good QA engineering background with proven automation skills

• Able to understand, design and define approach for automation (Backend/UI/service)

• Design and develop automation scripts for QA testing and tools for quality measurements

• Good to have knowledge of Microservices, API, Web services testing

• Strong in Cloud Engineering skillsets (performance, response time, horizontal scale testing)

• Expertise using automation tools/frameworks (Pytest, Jenkins, Robot, etc)

• Expert at one of the scripting languages – Python, shell, etc

• High level system admin skills to configure and manage test environments

• Basics of Kubernetes and databases like Cassandra, Elasticsearch, MongoDB, etc

• Must have worked in agile environment with CI/CD knowledge

• Having security testing background is a plus