Bigdata jobs

50+ Big data Jobs in India

Apply to 50+ Big data Jobs on CutShort.io. Find your next job, effortlessly. Browse Big data Jobs and apply today!

Data Engineer

at venanalytics

2 candid answers

Posted by Rincy jain

Remote only

2 - 3 yrs

₹7L - ₹12L / yr

SQL

PowerBI

Python

Big Data

About the Role:

We are looking for a highly skilled Data Engineer with a strong foundation in Power BI, SQL, Python, and Big Data ecosystems to help design, build, and optimize end-to-end data solutions. The ideal candidate is passionate about solving complex data problems, transforming raw data into actionable insights, and contributing to data-driven decision-making across the organization.

Key Responsibilities:

Data Modelling & Visualization
Build scalable and high-quality data models in Power BI using best practices.
Define relationships, hierarchies, and measures to support effective storytelling.
Ensure dashboards meet standards in accuracy, visualization principles, and timelines.
Data Transformation & ETL
Perform advanced data transformation using Power Query (M Language) beyond UI-based steps.
Design and optimize ETL pipelines using SQL, Python, and Big Data tools.
Manage and process large-scale datasets from various sources and formats.
Business Problem Translation
Collaborate with cross-functional teams to translate complex business problems into scalable, data-centric solutions.
Decompose business questions into testable hypotheses and identify relevant datasets for validation.
Performance & Troubleshooting
Continuously optimize performance of dashboards and pipelines for latency, reliability, and scalability.
Troubleshoot and resolve issues related to data access, quality, security, and latency, adhering to SLAs.
Analytical Storytelling
Apply analytical thinking to design insightful dashboards—prioritizing clarity and usability over aesthetics.
Develop data narratives that drive business impact.
Solution Design
Deliver wireframes, POCs, and final solutions aligned with business requirements and technical feasibility.

Required Skills & Experience:

Minimum 3+ years of experience as a Data Engineer or in a similar data-focused role.
Strong expertise in Power BI: data modeling, DAX, Power Query (M Language), and visualization best practices.
Hands-on with Python and SQL for data analysis, automation, and backend data transformation.
Deep understanding of data storytelling, visual best practices, and dashboard performance tuning.
Familiarity with DAX Studio and Tabular Editor.
Experience in handling high-volume data in production environments.

Preferred (Good to Have):

Exposure to Big Data technologies such as:
PySpark
Hadoop
Hive / HDFS
Spark Streaming (optional but preferred)

Why Join Us?

Work with a team that's passionate about data innovation.
Exposure to modern data stack and tools.
Flat structure and collaborative culture.
Opportunity to influence data strategy and architecture decisions.

About the Role:

Key Responsibilities:

Data Modelling & Visualization
Build scalable and high-quality data models in Power BI using best practices.
Define relationships, hierarchies, and measures to support effective storytelling.
Ensure dashboards meet standards in accuracy, visualization principles, and timelines.
Data Transformation & ETL
Perform advanced data transformation using Power Query (M Language) beyond UI-based steps.
Design and optimize ETL pipelines using SQL, Python, and Big Data tools.
Manage and process large-scale datasets from various sources and formats.
Business Problem Translation
Collaborate with cross-functional teams to translate complex business problems into scalable, data-centric solutions.
Decompose business questions into testable hypotheses and identify relevant datasets for validation.
Performance & Troubleshooting
Continuously optimize performance of dashboards and pipelines for latency, reliability, and scalability.
Troubleshoot and resolve issues related to data access, quality, security, and latency, adhering to SLAs.
Analytical Storytelling
Apply analytical thinking to design insightful dashboards—prioritizing clarity and usability over aesthetics.
Develop data narratives that drive business impact.
Solution Design
Deliver wireframes, POCs, and final solutions aligned with business requirements and technical feasibility.

Required Skills & Experience:

Minimum 3+ years of experience as a Data Engineer or in a similar data-focused role.
Strong expertise in Power BI: data modeling, DAX, Power Query (M Language), and visualization best practices.
Hands-on with Python and SQL for data analysis, automation, and backend data transformation.
Deep understanding of data storytelling, visual best practices, and dashboard performance tuning.
Familiarity with DAX Studio and Tabular Editor.
Experience in handling high-volume data in production environments.

Preferred (Good to Have):

Exposure to Big Data technologies such as:
PySpark
Hadoop
Hive / HDFS
Spark Streaming (optional but preferred)

Why Join Us?

Work with a team that's passionate about data innovation.
Exposure to modern data stack and tools.
Flat structure and collaborative culture.
Opportunity to influence data strategy and architecture decisions.

GCP Data Engineer - BigQuery

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Bengaluru (Bangalore), Hyderabad, Kolkata

6 - 10 yrs

₹7L - ₹30L / yr

Google Cloud Platform (GCP)

Big Data

BIGQuery

PySpark

Work with the team in capacity of GCP Data Engineer on day-to-day activities.
Solve problems at hand with utmost clarity and speed.
Work with Data analysts and architects to help them solve any specific issues with tooling/processes.
Design, build and operationalize large scale enterprise data solutions and applications using one or more of GCP data and analytics services in combination with 3rd parties - Python/Java/React.js, AirFlow ETL skills - GCP services (BigQuery, Dataflow, Cloud SQL, Cloud Functions, Data Lake.
Design and build production data pipelines from ingestion to consumption within a big data architecture.
GCP BQ modeling and performance tuning techniques.
RDBMS and No-SQL database experience.
Knowledge on orchestrating workloads on cloud.

Work with the team in capacity of GCP Data Engineer on day-to-day activities.
Solve problems at hand with utmost clarity and speed.
Work with Data analysts and architects to help them solve any specific issues with tooling/processes.
Design, build and operationalize large scale enterprise data solutions and applications using one or more of GCP data and analytics services in combination with 3rd parties - Python/Java/React.js, AirFlow ETL skills - GCP services (BigQuery, Dataflow, Cloud SQL, Cloud Functions, Data Lake.
Design and build production data pipelines from ingestion to consumption within a big data architecture.
GCP BQ modeling and performance tuning techniques.
RDBMS and No-SQL database experience.
Knowledge on orchestrating workloads on cloud.

DevOps Engineer- GCP Expert

at venanalytics

2 candid answers

Posted by Rincy jain

Remote only

3 - 5 yrs

₹9L - ₹15L / yr

Google Cloud Platform (GCP)

Docker

Kubernetes

Django

API

+1 more

About the Role

We are looking for a highly motivated DevOps Engineer with a strong background in cloud technologies, big data ecosystems, and software development lifecycles to lead cross-functional teams in delivering high-impact projects. The ideal candidate will combine excellent project management skills with technical acumen in GCP, DevOps, and Python-based applications.

Key Responsibilities

Lead end-to-end project planning, execution, and delivery, ensuring alignment
Create and maintain project documentation including detailed timelines, sprint boards, risk logs, and weekly status reports.
Facilitate Agile ceremonies: daily stand-ups, sprint planning, retrospectives, and backlog grooming.
Actively manage risks, scope changes, resource allocation, and project dependencies to ensure delivery without disruptions.
Ensure compliance with QA processes and security/compliance standards throughout the SDLC.
Collaborate with stakeholders and senior leadership to communicate progress, blockers, and key milestones.
Provide mentorship and support to cross-functional team members to drive continuous improvement and team performance.
Coordinate with clients and act as a key point of contact for requirement gathering, updates, and escalations.

Required Skills & Experience

Cloud & DevOps

Proficient in Google Cloud Platform (GCP) services: Compute, Storage, Networking, IAM.
Hands-on experience with cloud deployments and infrastructure as code.
Strong working knowledge of CI/CD pipelines, Docker, Kubernetes, and Terraform (or similar tools).

Big Data & Data Engineering

Experience with large-scale data processing using tools like PySpark, Hadoop, Hive, HDFS, and Spark Streaming (preferred).
Proven experience in managing and optimizing big data pipelines and ensuring high performance.

Programming & Frameworks

Strong proficiency in Python with experience in Django (REST APIs, ORM, deployment workflows).
Familiarity with Git and version control best practices.
Basic knowledge of Linux administration and shell scripting.

Nice to Have

Knowledge or prior experience in the Media & Advertising domain.
Experience in client-facing roles and handling stakeholder communications.
Proven ability to manage technical teams (5–6 members).

Why Join Us?

Work on cutting-edge cloud and data engineering projects
Collaborate with a talented, fast-paced team
Flexible work setup and culture of ownership

About the Role

Key Responsibilities

Lead end-to-end project planning, execution, and delivery, ensuring alignment
Create and maintain project documentation including detailed timelines, sprint boards, risk logs, and weekly status reports.
Facilitate Agile ceremonies: daily stand-ups, sprint planning, retrospectives, and backlog grooming.
Actively manage risks, scope changes, resource allocation, and project dependencies to ensure delivery without disruptions.
Ensure compliance with QA processes and security/compliance standards throughout the SDLC.
Collaborate with stakeholders and senior leadership to communicate progress, blockers, and key milestones.
Provide mentorship and support to cross-functional team members to drive continuous improvement and team performance.
Coordinate with clients and act as a key point of contact for requirement gathering, updates, and escalations.

Required Skills & Experience

Cloud & DevOps

Proficient in Google Cloud Platform (GCP) services: Compute, Storage, Networking, IAM.
Hands-on experience with cloud deployments and infrastructure as code.
Strong working knowledge of CI/CD pipelines, Docker, Kubernetes, and Terraform (or similar tools).

Big Data & Data Engineering

Experience with large-scale data processing using tools like PySpark, Hadoop, Hive, HDFS, and Spark Streaming (preferred).
Proven experience in managing and optimizing big data pipelines and ensuring high performance.

Programming & Frameworks

Strong proficiency in Python with experience in Django (REST APIs, ORM, deployment workflows).
Familiarity with Git and version control best practices.
Basic knowledge of Linux administration and shell scripting.

Nice to Have

Knowledge or prior experience in the Media & Advertising domain.
Experience in client-facing roles and handling stakeholder communications.
Proven ability to manage technical teams (5–6 members).

Why Join Us?

Work on cutting-edge cloud and data engineering projects
Collaborate with a talented, fast-paced team
Flexible work setup and culture of ownership

DevOps Engineer – Big Data Systems

at venanalytics

2 candid answers

Posted by Rincy jain

Mumbai

3 - 5 yrs

₹10L - ₹15L / yr

Google Cloud Platform (GCP)

DevOps

Python

Big Data

CI/CD

+3 more

About the Role

We are looking for a highly motivated Project Manager with a strong background in cloud technologies, big data ecosystems, and software development lifecycles to lead cross-functional teams in delivering high-impact projects. The ideal candidate will combine excellent project management skills with technical acumen in GCP, DevOps, and Python-based applications.

Key Responsibilities

Lead end-to-end project planning, execution, and delivery, ensuring alignment with business goals and timelines.
Create and maintain project documentation including detailed timelines, sprint boards, risk logs, and weekly status reports.
Facilitate Agile ceremonies: daily stand-ups, sprint planning, retrospectives, and backlog grooming.
Actively manage risks, scope changes, resource allocation, and project dependencies to ensure delivery without disruptions.
Ensure compliance with QA processes and security/compliance standards throughout the SDLC.
Collaborate with stakeholders and senior leadership to communicate progress, blockers, and key milestones.
Provide mentorship and support to cross-functional team members to drive continuous improvement and team performance.
Coordinate with clients and act as a key point of contact for requirement gathering, updates, and escalations.

Required Skills & Experience

Cloud & DevOps

Proficient in Google Cloud Platform (GCP) services: Compute, Storage, Networking, IAM.
Hands-on experience with cloud deployments and infrastructure as code.
Strong working knowledge of CI/CD pipelines, Docker, Kubernetes, and Terraform (or similar tools).

Big Data & Data Engineering

Experience with large-scale data processing using tools like PySpark, Hadoop, Hive, HDFS, and Spark Streaming (preferred).
Proven experience in managing and optimizing big data pipelines and ensuring high performance.

Programming & Frameworks

Strong proficiency in Python with experience in Django (REST APIs, ORM, deployment workflows).
Familiarity with Git and version control best practices.
Basic knowledge of Linux administration and shell scripting.

Nice to Have

Knowledge or prior experience in the Media & Advertising domain.
Experience in client-facing roles and handling stakeholder communications.
Proven ability to manage technical teams (5–6 members).

Why Join Us?

Work on cutting-edge cloud and data engineering projects
Collaborate with a talented, fast-paced team
Flexible work setup and culture of ownership
Continuous learning and upskilling environment
Inclusive health benefits included

About the Role

Key Responsibilities

Lead end-to-end project planning, execution, and delivery, ensuring alignment with business goals and timelines.
Create and maintain project documentation including detailed timelines, sprint boards, risk logs, and weekly status reports.
Facilitate Agile ceremonies: daily stand-ups, sprint planning, retrospectives, and backlog grooming.
Actively manage risks, scope changes, resource allocation, and project dependencies to ensure delivery without disruptions.
Ensure compliance with QA processes and security/compliance standards throughout the SDLC.
Collaborate with stakeholders and senior leadership to communicate progress, blockers, and key milestones.
Provide mentorship and support to cross-functional team members to drive continuous improvement and team performance.
Coordinate with clients and act as a key point of contact for requirement gathering, updates, and escalations.

Required Skills & Experience

Cloud & DevOps

Proficient in Google Cloud Platform (GCP) services: Compute, Storage, Networking, IAM.
Hands-on experience with cloud deployments and infrastructure as code.
Strong working knowledge of CI/CD pipelines, Docker, Kubernetes, and Terraform (or similar tools).

Big Data & Data Engineering

Experience with large-scale data processing using tools like PySpark, Hadoop, Hive, HDFS, and Spark Streaming (preferred).
Proven experience in managing and optimizing big data pipelines and ensuring high performance.

Programming & Frameworks

Strong proficiency in Python with experience in Django (REST APIs, ORM, deployment workflows).
Familiarity with Git and version control best practices.
Basic knowledge of Linux administration and shell scripting.

Nice to Have

Knowledge or prior experience in the Media & Advertising domain.
Experience in client-facing roles and handling stakeholder communications.
Proven ability to manage technical teams (5–6 members).

Why Join Us?

Work on cutting-edge cloud and data engineering projects
Collaborate with a talented, fast-paced team
Flexible work setup and culture of ownership
Continuous learning and upskilling environment
Inclusive health benefits included

Lead Software Engineer

Auditoria.ai

Agency job

via Options Executive Search Pvt Ltd by Achyut Menon

Remote, Hyderabad

4 - 7 yrs

₹30L - ₹40L / yr

NodeJS (Node.js)

Amazon Web Services (AWS)

CI/CD

Cassandra

AWS RDS

+3 more

About Company:

Auditoria is an AI-driven SaaS automation provider for corporate finance that automates back-oﬃce business processes involving tasks, analytics, and responses in Vendor Management, Accounts Payable, Accounts Receivable, and Planning. By leveraging natural language processing, artificial intelligence, and machine learning, Auditoria removes friction and repetition from mundane tasks

while automating complex functions and providing real-time visibility into cash performance. Corporate finance and accounting teams use Auditoria to accelerate business value while minimizing heavy IT

involvement, improving business resilience, lowering attrition, and accelerating business insights.

Founded in 2019 and backed by Venrock, Workday Ventures, Neotribe Ventures, Engineering Capital, and Firebolt Ventures, we build intelligent automation by combining fine-grained analytical

orchestration of a company's typical financial and audit workflows with conversational AI, delivering rapid value to the finance/audit back oﬃce.

In 2021, Auditoria earned industry recognition by being named to the Intelligent Apps Top 40 List, SSON's Shared Services & Outsourcing Impact Awards, the Constellation Research ShortList for AI-Driven Cognitive Applications, HFS Research Hot Vendors, 2021 CRN Emerging Vendors List, TiE50 Award, and the winner of the inaugural Pitch Event by Constellation Research.

The opportunity for you:

We are building an AI/ML-enabled SAAS solution to help manage the cash performance of enterprises. You would be working on solving complex problems in the FinTech space.

Responsibilities:

Own the design and development of the core areas of Auditoria’s product, leveraging the latest tech stack hosted on AWS cloud.
Collaborating across multiple teams/timezones to help deliver quality solutions as per roadmap.
Partner with business teams to deliver incremental value to clients
Champion Auditoria’s values and tech culture.

Requirements:

Worked with many of the following: multi-tenant SaaS, CI/CD environments, monitoring tools, Kubernetes and containers, Istio, workflow engines, data stores (RDBMS, Cassandra, Neo4j), AWS services, integrations with enterprise systems (SSO, email, etc.).
Experience with system design & data modeling; familiarity with RDBMS and Big Data

Job Description:

Hands-on experience building applications leveraging AWS services like Step Functions, RDS, Cassandra, Kinesis, and ELK is a big plus.
Fluent coding skills in Node.js.
10+ years of professional, hands-on experience developing and shipping complex software systems.
Embraces startup setup, can work through unknowns, resource constraints & multiple priorities with creativity & resourcefulness.
Experience with the Agile development process and zeal for engineering best practices.
BS or MS in Computer Science or related Engineering degree. Preferably from IIT/NIT/BITS

About Company:

involvement, improving business resilience, lowering attrition, and accelerating business insights.

Founded in 2019 and backed by Venrock, Workday Ventures, Neotribe Ventures, Engineering Capital, and Firebolt Ventures, we build intelligent automation by combining fine-grained analytical

orchestration of a company's typical financial and audit workflows with conversational AI, delivering rapid value to the finance/audit back oﬃce.

The opportunity for you:

We are building an AI/ML-enabled SAAS solution to help manage the cash performance of enterprises. You would be working on solving complex problems in the FinTech space.

Responsibilities:

Own the design and development of the core areas of Auditoria’s product, leveraging the latest tech stack hosted on AWS cloud.
Collaborating across multiple teams/timezones to help deliver quality solutions as per roadmap.
Partner with business teams to deliver incremental value to clients
Champion Auditoria’s values and tech culture.

Requirements:

Worked with many of the following: multi-tenant SaaS, CI/CD environments, monitoring tools, Kubernetes and containers, Istio, workflow engines, data stores (RDBMS, Cassandra, Neo4j), AWS services, integrations with enterprise systems (SSO, email, etc.).
Experience with system design & data modeling; familiarity with RDBMS and Big Data

Job Description:

Hands-on experience building applications leveraging AWS services like Step Functions, RDS, Cassandra, Kinesis, and ELK is a big plus.
Fluent coding skills in Node.js.
10+ years of professional, hands-on experience developing and shipping complex software systems.
Embraces startup setup, can work through unknowns, resource constraints & multiple priorities with creativity & resourcefulness.
Experience with the Agile development process and zeal for engineering best practices.
BS or MS in Computer Science or related Engineering degree. Preferably from IIT/NIT/BITS

Assistant professor for CSE under CS ( M.E ,M.Tech)

at jk

Posted by mithul m

Coimbatore, Bengaluru (Bangalore), Mumbai

1 - 4 yrs

₹3.4L - ₹5L / yr

Python

Javascript

Java

HTML/CSS

Big Data

+2 more

The Assistant Professor in CSE will teach undergraduate and graduate courses, conduct independent and collaborative research, mentor students, and contribute to departmental and institutional service.

Data Engineer

at Tecblic Private LImited

Posted by HR HR

Ahmedabad

4 - 5 yrs

₹8L - ₹12L / yr

Microsoft Windows Azure

SQL

Python

PySpark

ETL

+2 more

🚀 We Are Hiring: Data Engineer | 4+ Years Experience 🚀

Job description

🔍 Job Title: Data Engineer

📍 Location: Ahmedabad

🚀 Work Mode: On-Site Opportunity

📅 Experience: 4+ Years

🕒 Employment Type: Full-Time

⏱️ Availability : Immediate Joiner Preferred

Join Our Team as a Data Engineer

We are seeking a passionate and experienced Data Engineer to be a part of our dynamic and forward-thinking team in Ahmedabad. This is an exciting opportunity for someone who thrives on transforming raw data into powerful insights and building scalable, high-performance data infrastructure.

As a Data Engineer, you will work closely with data scientists, analysts, and cross-functional teams to design robust data pipelines, optimize data systems, and enable data-driven decision-making across the organization.

Your Key Responsibilities

Architect, build, and maintain scalable and reliable data pipelines from diverse data sources.

Design effective data storage, retrieval mechanisms, and data models to support analytics and business needs.

Implement data validation, transformation, and quality monitoring processes.

Collaborate with cross-functional teams to deliver impactful, data-driven solutions.

Proactively identify bottlenecks and optimize existing workflows and processes.

Provide guidance and mentorship to junior engineers in the team.

Skills & Expertise We’re Looking For

3+ years of hands-on experience in Data Engineering or related roles.

Strong expertise in Python and data pipeline design.

Experience working with Big Data tools like Hadoop, Spark, Hive.

Proficiency with SQL, NoSQL databases, and data warehousing solutions.

Solid experience in cloud platforms - Azure

Familiar with distributed computing, data modeling, and performance tuning.

Understanding of DevOps, Power Automate, and Microsoft Fabric is a plus.

Strong analytical thinking, collaboration skills, Excellent Communication Skill and the ability to work independently or as part of a team.

Qualifications

Bachelor’s degree in Computer Science, Data Science, or a related field.

🚀 We Are Hiring: Data Engineer | 4+ Years Experience 🚀

Job description

🔍 Job Title: Data Engineer

📍 Location: Ahmedabad

🚀 Work Mode: On-Site Opportunity

📅 Experience: 4+ Years

🕒 Employment Type: Full-Time

⏱️ Availability : Immediate Joiner Preferred

Join Our Team as a Data Engineer

Your Key Responsibilities

Architect, build, and maintain scalable and reliable data pipelines from diverse data sources.

Design effective data storage, retrieval mechanisms, and data models to support analytics and business needs.

Implement data validation, transformation, and quality monitoring processes.

Collaborate with cross-functional teams to deliver impactful, data-driven solutions.

Proactively identify bottlenecks and optimize existing workflows and processes.

Provide guidance and mentorship to junior engineers in the team.

Skills & Expertise We’re Looking For

3+ years of hands-on experience in Data Engineering or related roles.

Strong expertise in Python and data pipeline design.

Experience working with Big Data tools like Hadoop, Spark, Hive.

Proficiency with SQL, NoSQL databases, and data warehousing solutions.

Solid experience in cloud platforms - Azure

Familiar with distributed computing, data modeling, and performance tuning.

Understanding of DevOps, Power Automate, and Microsoft Fabric is a plus.

Strong analytical thinking, collaboration skills, Excellent Communication Skill and the ability to work independently or as part of a team.

Qualifications

Bachelor’s degree in Computer Science, Data Science, or a related field.

Assistant professor for M.E, M.TECH under CSE

at GK

Posted by ashok s

Coimbatore

0 - 5 yrs

₹2.5L - ₹7L / yr

Python

C++

HTML/CSS

Javascript

Big Data

+2 more

A Computer Scientist/Engineer designs, develops, tests, and integrates computer software and hardware systems. This pivotal role blends deep knowledge of computer architecture with advanced software engineering—driving innovation in platforms spanning from embedded systems and networks to AI and cybersecurity

Staff Data Engineer

at Hypersonix Inc

2 candid answers

1 product

Posted by Reshika Mendiratta

Remote only

7yrs+

Upto ₹40L / yr (Varies

)

SQL

Python

ETL

Data engineering

Big Data

+2 more

About the Company

Hypersonix.ai is disrupting the e-commerce space with AI, ML and advanced decision capabilities to drive real-time business insights. Hypersonix.ai has been built ground up with new age technology to simplify the consumption of data for our customers in various industry verticals. Hypersonix.ai is seeking a well-rounded, hands-on product leader to help lead product management of key capabilities and features.

About the Role

We are looking for talented and driven Data Engineers at various levels to work with customers to build the data warehouse, analytical dashboards and ML capabilities as per customer needs.

Roles and Responsibilities

Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet functional / non-functional business requirements; should write complex queries in an optimized way
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
Run ad-hoc analysis utilizing the data pipeline to provide actionable insights
Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs
Keep our data separated and secure across national boundaries through multiple data centers and AWS regions
Work with analytics and data scientist team members and assist them in building and optimizing our product into an innovative industry leader

Requirements

Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
Experience building and optimizing ‘big data’ data pipelines, architectures and data sets
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
Strong analytic skills related to working with unstructured datasets
Build processes supporting data transformation, data structures, metadata, dependency and workload management
A successful history of manipulating, processing and extracting value from large disconnected datasets
Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
Experience supporting and working with cross-functional teams in a dynamic environment
We are looking for a candidate with 7+ years of experience in a Data Engineer role, who has attained a Graduate degree in Computer Science, Information Technology or completed MCA.

About the Company

About the Role

We are looking for talented and driven Data Engineers at various levels to work with customers to build the data warehouse, analytical dashboards and ML capabilities as per customer needs.

Roles and Responsibilities

Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet functional / non-functional business requirements; should write complex queries in an optimized way
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
Run ad-hoc analysis utilizing the data pipeline to provide actionable insights
Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs
Keep our data separated and secure across national boundaries through multiple data centers and AWS regions
Work with analytics and data scientist team members and assist them in building and optimizing our product into an innovative industry leader

Requirements

Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
Experience building and optimizing ‘big data’ data pipelines, architectures and data sets
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
Strong analytic skills related to working with unstructured datasets
Build processes supporting data transformation, data structures, metadata, dependency and workload management
A successful history of manipulating, processing and extracting value from large disconnected datasets
Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
Experience supporting and working with cross-functional teams in a dynamic environment
We are looking for a candidate with 7+ years of experience in a Data Engineer role, who has attained a Graduate degree in Computer Science, Information Technology or completed MCA.

Bigdata, pyspark

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Pune

7 - 10 yrs

₹7L - ₹20L / yr

Big Data

PySpark

Google Cloud Platform (GCP)

True Hands-On Developer in Programming Languages like Java or Scala . Expertise in Apache Spark . Database modelling and working with any of the SQL or NoSQL Database is must. Working knowledge of Scripting languages like shell/python. Experience of working with Cloudera is Preferred Orchestration tools like Airflow or Oozie would be a value addition. Knowledge of Table formats like Delta or Iceberg is plus to have. Working experience of Version controls like Git, build tools like Maven is recommended. Having software development experience is good to have along with Data Engineering experience

Cassandra DB admin (Quick joiners 30-45 days)

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by Jhansi Padiy

Bengaluru (Bangalore)

4 - 10 yrs

₹5L - ₹25L / yr

opscenter

Cassandra

solar

Spark

Linux/Unix

Will need to manage Cassandra/Solr/Spark/Graph clusters.

2. Strong experience in UNIX required, including command line editors, scripting (shell, Python, Perl, etc

3. Expert Understanding of Unix/Linux Operating systems.

4. Monitor Cassandra clusters using OpsCenter,

5. Experience in upgrades of Cassandra/Solr/Spark required

6. Manage nodes of Cassandra/Solr/Spark cluster

7. Cassandra cluster connectivity and security

8. Must-have supported Cassandra cluster in production

9. Using monitoring and management tools (e.g. OpsCenter)

10. Troubleshoot Cassandra/Solr/Spark Cluster

11. Database backup and recovery

12. Disk space management, Automate manual tasks

13. Certification as Datastax Administrator or Developer will be preferred.

14. Additional responsibility to monitor and maintain Tomcat cluster.

15. Should have experience in performance and capacity planning.

16. Should have experience in Datastax Cassandra & Apache Cassandra.

17. Provide 24x7 support for critical production systems.

18. Decommissioning and commissioning the Node on running cluster/existing cluster.

19. Understanding of Cassandra cluster management by using datastax Opscenter, Devcenter and

nodetool utility commands & Other Utilities.

20. Clear understanding of Relational DB / NoSQL Concepts and Architecture.

21. Ability to multi-task and context-switch effectively between different activities and teams

Will need to manage Cassandra/Solr/Spark/Graph clusters.

2. Strong experience in UNIX required, including command line editors, scripting (shell, Python, Perl, etc

3. Expert Understanding of Unix/Linux Operating systems.

4. Monitor Cassandra clusters using OpsCenter,

5. Experience in upgrades of Cassandra/Solr/Spark required

6. Manage nodes of Cassandra/Solr/Spark cluster

7. Cassandra cluster connectivity and security

8. Must-have supported Cassandra cluster in production

9. Using monitoring and management tools (e.g. OpsCenter)

10. Troubleshoot Cassandra/Solr/Spark Cluster

11. Database backup and recovery

12. Disk space management, Automate manual tasks

13. Certification as Datastax Administrator or Developer will be preferred.

14. Additional responsibility to monitor and maintain Tomcat cluster.

15. Should have experience in performance and capacity planning.

16. Should have experience in Datastax Cassandra & Apache Cassandra.

17. Provide 24x7 support for critical production systems.

18. Decommissioning and commissioning the Node on running cluster/existing cluster.

19. Understanding of Cassandra cluster management by using datastax Opscenter, Devcenter and

nodetool utility commands & Other Utilities.

20. Clear understanding of Relational DB / NoSQL Concepts and Architecture.

21. Ability to multi-task and context-switch effectively between different activities and teams

Senior Machine Learning Engineer

at NeoGenCode Technologies Pvt Ltd

2 candid answers

Posted by Akshay Patil

Chennai

8 - 12 yrs

₹10L - ₹26L / yr

Python

Machine Learning (ML)

Scikit-Learn

TensorFlow

PyTorch

+10 more

Job Title : Senior Machine Learning Engineer

Experience : 8+ Years

Location : Chennai

Notice Period : Immediate Joiners Only

Work Mode : Hybrid

Job Summary :

We are seeking an experienced Machine Learning Engineer with a strong background in Python, ML algorithms, and data-driven development.

The ideal candidate should have hands-on experience with popular ML frameworks and tools, solid understanding of clustering and classification techniques, and be comfortable working in Unix-based environments with Agile teams.

Mandatory Skills :

Programming Languages : Python
Machine Learning : Strong experience with ML algorithms, models, and libraries such as Scikit-learn, TensorFlow, and PyTorch
ML Concepts : Proficiency in supervised and unsupervised learning, including techniques such as K-Means, DBSCAN, and Fuzzy Clustering
Operating Systems : RHEL or any Unix-based OS
Databases : Oracle or any relational database
Version Control : Git
Development Methodologies : Agile

Desired Skills :

Experience with issue tracking tools such as Azure DevOps or JIRA.
Understanding of data science concepts.
Familiarity with Big Data algorithms, models, and libraries.

Job Title : Senior Machine Learning Engineer

Experience : 8+ Years

Location : Chennai

Notice Period : Immediate Joiners Only

Work Mode : Hybrid

Job Summary :

We are seeking an experienced Machine Learning Engineer with a strong background in Python, ML algorithms, and data-driven development.

Mandatory Skills :

Programming Languages : Python
Machine Learning : Strong experience with ML algorithms, models, and libraries such as Scikit-learn, TensorFlow, and PyTorch
ML Concepts : Proficiency in supervised and unsupervised learning, including techniques such as K-Means, DBSCAN, and Fuzzy Clustering
Operating Systems : RHEL or any Unix-based OS
Databases : Oracle or any relational database
Version Control : Git
Development Methodologies : Agile

Desired Skills :

Experience with issue tracking tools such as Azure DevOps or JIRA.
Understanding of data science concepts.
Familiarity with Big Data algorithms, models, and libraries.

Data Engineer

Top tier global IT consulting company

Agency job

via AccioJob by AccioJobHiring Board

Hyderabad, Pune, Noida

0 - 0 yrs

₹11L - ₹11L / yr

Python

MySQL

Big Data

AccioJob is conducting a Walk-In Hiring Drive with a reputed global IT consulting company at AccioJob Skill Centres for the position of Data Engineer, specifically for female candidates.

To Apply, Register and select your Slot here: https://go.acciojob.com/8p9ZXN

We will not consider your application if you do not register and select slot via the above link.

Required Skills: Python, Database(MYSQL), Big Data(Spark, Kafka)

Eligibility:

Degree: B.Tech/BE
Branch: CSE – AI & DS / AI & ML
Graduation Year: 2024 & 2025

Note: Only Female Candidates can apply for this job opportunity

Work Details:

Work Mode: Work From Office
Work Location: Bangalore & Coimbatore
CTC: 11.1 LPA

Evaluation Process:

Round 1: Offline Assessment at AccioJob Skill Centre in Noida, Pune, Hyderabad.

Further Rounds (for Shortlisted Candidates only)

HackerRank Online Assessment
Coding Pairing Interview
Technical Interview
Cultural Alignment Interview

Important Note: Please bring your laptop and earphones for the test.

Register here: https://go.acciojob.com/8p9ZXN

To Apply, Register and select your Slot here: https://go.acciojob.com/8p9ZXN

We will not consider your application if you do not register and select slot via the above link.

Required Skills: Python, Database(MYSQL), Big Data(Spark, Kafka)

Eligibility:

Degree: B.Tech/BE
Branch: CSE – AI & DS / AI & ML
Graduation Year: 2024 & 2025

Note: Only Female Candidates can apply for this job opportunity

Work Details:

Work Mode: Work From Office
Work Location: Bangalore & Coimbatore
CTC: 11.1 LPA

Evaluation Process:

Round 1: Offline Assessment at AccioJob Skill Centre in Noida, Pune, Hyderabad.

Further Rounds (for Shortlisted Candidates only)

HackerRank Online Assessment
Coding Pairing Interview
Technical Interview
Cultural Alignment Interview

Important Note: Please bring your laptop and earphones for the test.

Register here: https://go.acciojob.com/8p9ZXN

Data Engineer

Hunarstreet Technologies pvt ltd

Agency job

via Hunarstreet Technologies pvt ltd by Sakshi Patankar

Remote only

10 - 20 yrs

₹15L - ₹30L / yr

Data engineering

databricks

Python

Scala

Spark

+14 more

What You’ll Be Doing:

● Design and build parts of our data pipeline architecture for extraction, transformation, and loading of data from a wide variety of data sources using the latest Big Data technologies.

● Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

● Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.

● Work with machine learning, data, and analytics experts to drive innovation, accuracy and greater functionality in our data system. Qualifications:

● Bachelor's degree in Engineering, Computer Science, or relevant field.

● 10+ years of relevant and recent experience in a Data Engineer role. ● 5+ years recent experience with Apache Spark and solid understanding of the fundamentals.

● Deep understanding of Big Data concepts and distributed systems.

● Strong coding skills with Scala, Python, Java and/or other languages and the ability to quickly switch between them with ease.

● Advanced working SQL knowledge and experience working with a variety of relational databases such as Postgres and/or MySQL.

● Cloud Experience with DataBricks

● Experience working with data stored in many formats including Delta Tables, Parquet, CSV and JSON.

● Comfortable working in a linux shell environment and writing scripts as needed.

● Comfortable working in an Agile environment

● Machine Learning knowledge is a plus.

● Must be capable of working independently and delivering stable, efficient and reliable software.

● Excellent written and verbal communication skills in English.

● Experience supporting and working with cross-functional teams in a dynamic environment

EMPLOYMENT TYPE: Full-Time, Permanent

LOCATION: Remote (Pan India)

SHIFT TIMINGS: 2.00 pm-11:00pm IST