Hadoop Jobs in Mumbai

20+ Hadoop Jobs in Mumbai | Hadoop Job openings in Mumbai

Apply to 20+ Hadoop Jobs in Mumbai on CutShort.io. Explore the latest Hadoop Job opportunities across top companies like Google, Amazon & Adobe.

Hadoop jobs in other cities

Apache Hadoop Jobs Apache Hadoop Jobs in Bangalore (Bengaluru)Apache Hadoop Jobs in Chennai Apache Hadoop Jobs in Coimbatore Hadoop Jobs Hadoop Jobs in Ahmedabad Hadoop Jobs in Bangalore (Bengaluru)Hadoop Jobs in Chandigarh Hadoop Jobs in Chennai Hadoop Jobs in Coimbatore Hadoop Jobs in Delhi, NCR and Gurgaon Hadoop Jobs in Hyderabad Hadoop Jobs in Jaipur Hadoop Jobs in Kochi (Cochin)Hadoop Jobs in Pune

Jobs by Category

Fullstack Developer Jobs Backend Developer Jobs Frontend Developer Jobs Android Developer Jobs iOS Developer Jobs DevOps Jobs Data Science Jobs

Business Developer Jobs Digital Marketing Jobs Sales Jobs

UX Designer Jobs Graphic Designer Jobs

Jobs by Location

Startup Jobs in Bangalore Startup Jobs in Pune Startup Jobs in Delhi All Startup jobs

Collections

Funded Startup Jobs Product Startup Jobs

Data Engineer

at Pluginlive

1 recruiter

Posted by Harsha Saggi

Chennai, Mumbai

4 - 6 yrs

₹10L - ₹20L / yr

Python

SQL

NOSQL Databases

Data architecture

Data modeling

+7 more

Role Overview:

We are seeking a talented and experienced Data Architect with strong data visualization capabilities to join our dynamic team in Mumbai. As a Data Architect, you will be responsible for designing, building, and managing our data infrastructure, ensuring its reliability, scalability, and performance. You will also play a crucial role in transforming complex data into insightful visualizations that drive business decisions. This role requires a deep understanding of data modeling, database technologies (particularly Oracle Cloud), data warehousing principles, and proficiency in data manipulation and visualization tools, including Python and SQL.

Responsibilities:

Design and implement robust and scalable data architectures, including data warehouses, data lakes, and operational data stores, primarily leveraging Oracle Cloud services.
Develop and maintain data models (conceptual, logical, and physical) that align with business requirements and ensure data integrity and consistency.
Define data governance policies and procedures to ensure data quality, security, and compliance.
Collaborate with data engineers to build and optimize ETL/ELT pipelines for efficient data ingestion, transformation, and loading.
Develop and execute data migration strategies to Oracle Cloud.
Utilize strong SQL skills to query, manipulate, and analyze large datasets from various sources.
Leverage Python and relevant libraries (e.g., Pandas, NumPy) for data cleaning, transformation, and analysis.
Design and develop interactive and insightful data visualizations using tools like [Specify Visualization Tools - e.g., Tableau, Power BI, Matplotlib, Seaborn, Plotly] to communicate data-driven insights to both technical and non-technical stakeholders.
Work closely with business analysts and stakeholders to understand their data needs and translate them into effective data models and visualizations.
Ensure the performance and reliability of data visualization dashboards and reports.
Stay up-to-date with the latest trends and technologies in data architecture, cloud computing (especially Oracle Cloud), and data visualization.
Troubleshoot data-related issues and provide timely resolutions.
Document data architectures, data flows, and data visualization solutions.
Participate in the evaluation and selection of new data technologies and tools.

Qualifications:

Bachelor's or Master's degree in Computer Science, Data Science, Information Systems, or a related field.
Proven experience (typically 5+ years) as a Data Architect, Data Modeler, or similar role.
Deep understanding of data warehousing concepts, dimensional modeling (e.g., star schema, snowflake schema), and ETL/ELT processes.
Extensive experience working with relational databases, particularly Oracle, and proficiency in SQL.
Hands-on experience with Oracle Cloud data services (e.g., Autonomous Data Warehouse, Object Storage, Data Integration).
Strong programming skills in Python and experience with data manipulation and analysis libraries (e.g., Pandas, NumPy).
Demonstrated ability to create compelling and effective data visualizations using industry-standard tools (e.g., Tableau, Power BI, Matplotlib, Seaborn, Plotly).
Excellent analytical and problem-solving skills with the ability to interpret complex data and translate it into actionable insights.
Strong communication and presentation skills, with the ability to effectively communicate technical concepts to non-technical audiences.
Experience with data governance and data quality principles.
Familiarity with agile development methodologies.
Ability to work independently and collaboratively within a team environment.

Application Link- https://forms.gle/km7n2WipJhC2Lj2r5

Role Overview:

Responsibilities:

Design and implement robust and scalable data architectures, including data warehouses, data lakes, and operational data stores, primarily leveraging Oracle Cloud services.
Develop and maintain data models (conceptual, logical, and physical) that align with business requirements and ensure data integrity and consistency.
Define data governance policies and procedures to ensure data quality, security, and compliance.
Collaborate with data engineers to build and optimize ETL/ELT pipelines for efficient data ingestion, transformation, and loading.
Develop and execute data migration strategies to Oracle Cloud.
Utilize strong SQL skills to query, manipulate, and analyze large datasets from various sources.
Leverage Python and relevant libraries (e.g., Pandas, NumPy) for data cleaning, transformation, and analysis.
Design and develop interactive and insightful data visualizations using tools like [Specify Visualization Tools - e.g., Tableau, Power BI, Matplotlib, Seaborn, Plotly] to communicate data-driven insights to both technical and non-technical stakeholders.
Work closely with business analysts and stakeholders to understand their data needs and translate them into effective data models and visualizations.
Ensure the performance and reliability of data visualization dashboards and reports.
Stay up-to-date with the latest trends and technologies in data architecture, cloud computing (especially Oracle Cloud), and data visualization.
Troubleshoot data-related issues and provide timely resolutions.
Document data architectures, data flows, and data visualization solutions.
Participate in the evaluation and selection of new data technologies and tools.

Qualifications:

Bachelor's or Master's degree in Computer Science, Data Science, Information Systems, or a related field.
Proven experience (typically 5+ years) as a Data Architect, Data Modeler, or similar role.
Deep understanding of data warehousing concepts, dimensional modeling (e.g., star schema, snowflake schema), and ETL/ELT processes.
Extensive experience working with relational databases, particularly Oracle, and proficiency in SQL.
Hands-on experience with Oracle Cloud data services (e.g., Autonomous Data Warehouse, Object Storage, Data Integration).
Strong programming skills in Python and experience with data manipulation and analysis libraries (e.g., Pandas, NumPy).
Demonstrated ability to create compelling and effective data visualizations using industry-standard tools (e.g., Tableau, Power BI, Matplotlib, Seaborn, Plotly).
Excellent analytical and problem-solving skills with the ability to interpret complex data and translate it into actionable insights.
Strong communication and presentation skills, with the ability to effectively communicate technical concepts to non-technical audiences.
Experience with data governance and data quality principles.
Familiarity with agile development methodologies.
Ability to work independently and collaboratively within a team environment.

Application Link- https://forms.gle/km7n2WipJhC2Lj2r5

Data Engineer

at ZeMoSo Technologies

11 recruiters

Agency job

via TIGI HR Solution Pvt. Ltd. by Vaidehi Sarkar

Mumbai, Bengaluru (Bangalore), Hyderabad, Chennai, Pune

4 - 8 yrs

₹10L - ₹15L / yr

Data engineering

Python

SQL

Data Warehouse (DWH)

Amazon Web Services (AWS)

+3 more

Work Mode: Hybrid

Need B.Tech, BE, M.Tech, ME candidates - Mandatory

Must-Have Skills:

● Educational Qualification :- B.Tech, BE, M.Tech, ME in any field.

● Minimum of 3 years of proven experience as a Data Engineer.

● Strong proficiency in Python programming language and SQL.

● Experience in DataBricks and setting up and managing data pipelines, data warehouses/lakes.

● Good comprehension and critical thinking skills.

● Kindly note Salary bracket will vary according to the exp. of the candidate -

- Experience from 4 yrs to 6 yrs - Salary upto 22 LPA

- Experience from 5 yrs to 8 yrs - Salary upto 30 LPA

- Experience more than 8 yrs - Salary upto 40 LPA

Work Mode: Hybrid

Need B.Tech, BE, M.Tech, ME candidates - Mandatory

Must-Have Skills:

● Educational Qualification :- B.Tech, BE, M.Tech, ME in any field.

● Minimum of 3 years of proven experience as a Data Engineer.

● Strong proficiency in Python programming language and SQL.

● Experience in DataBricks and setting up and managing data pipelines, data warehouses/lakes.

● Good comprehension and critical thinking skills.

● Kindly note Salary bracket will vary according to the exp. of the candidate -

- Experience from 4 yrs to 6 yrs - Salary upto 22 LPA

- Experience from 5 yrs to 8 yrs - Salary upto 30 LPA

- Experience more than 8 yrs - Salary upto 40 LPA

Data Analyst

at DCB Bank

Agency job

via Pluginlive by Harsha Saggi

Mumbai

4 - 10 yrs

₹10L - ₹20L / yr

Machine Learning (ML)

R Language

Banking

NBFC

ECL

+10 more

About the company

DCB Bank is a new generation private sector bank with 442 branches across India.It is a scheduled commercial bank regulated by the Reserve Bank of India. DCB Bank’s business segments are Retail banking, Micro SME, SME, mid-Corporate, Agriculture, Government, Public Sector, Indian Banks, Co-operative Banks and Non-Banking Finance Companies.

Job Description

Department: Risk Analytics

CTC: Max 18 Lacs

Grade: Sr Manager/AVP

Experience: Min 4 years of relevant experience

We are looking for a Data Scientist to join our growing team of Data Science experts and manage the processes and people responsible for accurate data collection, processing, modelling, analysis, implementation, and maintenance.

Responsibilities

Understand, monitor and maintain existing financial scorecards (ML Based) and make changes to the model when required.
Perform Statistical analysis in R and assist IT team with deployment of ML model and analytical frameworks in Python.
Should be able to handle multiple tasks and must know how to prioritize the work.
Lead cross-functional projects using advanced data modelling and analysis techniques to discover insights that will guide strategic decisions and uncover optimization opportunities.
Develop clear, concise and actionable solutions and recommendations for client’s business needs and actively explore client’s business and formulate solutions/ideas which can help client in terms of efficient cost cutting or in achieving growth/revenue/profitability targets faster.
Build, develop and maintain data models, reporting systems, data automation systems, dashboards and performance metrics support that support key business decisions.
Design and build technical processes to address business issues.
Oversee the design and delivery of reports and insights that analyse business functions and key operations and performance metrics.
Manage and optimize processes for data intake, validation, mining, and engineering as well as modelling, visualization, and communication deliverables.
Communicate results and business impacts of insight initiatives to the Management of the company.

Requirements

Industry knowledge
4 years or more of experience in financial services industry particularly retail credit industry is a must.
Candidate should have either worked in banking sector (banks/ HFC/ NBFC) or consulting organizations serving these clients.
Experience in credit risk model building such as application scorecards, behaviour scorecards, and/ or collection scorecards.
Experience in portfolio monitoring, model monitoring, model calibration
Knowledge of ECL/ Basel preferred.
Educational qualification: Advanced degree in finance, mathematics, econometrics, or engineering.
Technical knowledge: Strong data handling skills in databases such as SQL and Hadoop. Knowledge with data visualization tools, such as SAS VI/Tableau/PowerBI is preferred.
Expertise in either R or Python; SAS knowledge will be plus.

Soft skills:

Ability to quickly adapt to the analytical tools and development approaches used within DCB Bank
Ability to multi-task good communication and team working skills.
Ability to manage day-to-day written and verbal communication with relevant stakeholders.
Ability to think strategically and make changes to data when required.

About the company

Job Description

Department: Risk Analytics

CTC: Max 18 Lacs

Grade: Sr Manager/AVP

Experience: Min 4 years of relevant experience

Responsibilities

Understand, monitor and maintain existing financial scorecards (ML Based) and make changes to the model when required.
Perform Statistical analysis in R and assist IT team with deployment of ML model and analytical frameworks in Python.
Should be able to handle multiple tasks and must know how to prioritize the work.
Lead cross-functional projects using advanced data modelling and analysis techniques to discover insights that will guide strategic decisions and uncover optimization opportunities.
Develop clear, concise and actionable solutions and recommendations for client’s business needs and actively explore client’s business and formulate solutions/ideas which can help client in terms of efficient cost cutting or in achieving growth/revenue/profitability targets faster.
Build, develop and maintain data models, reporting systems, data automation systems, dashboards and performance metrics support that support key business decisions.
Design and build technical processes to address business issues.
Oversee the design and delivery of reports and insights that analyse business functions and key operations and performance metrics.
Manage and optimize processes for data intake, validation, mining, and engineering as well as modelling, visualization, and communication deliverables.
Communicate results and business impacts of insight initiatives to the Management of the company.

Requirements

Industry knowledge
4 years or more of experience in financial services industry particularly retail credit industry is a must.
Candidate should have either worked in banking sector (banks/ HFC/ NBFC) or consulting organizations serving these clients.
Experience in credit risk model building such as application scorecards, behaviour scorecards, and/ or collection scorecards.
Experience in portfolio monitoring, model monitoring, model calibration
Knowledge of ECL/ Basel preferred.
Educational qualification: Advanced degree in finance, mathematics, econometrics, or engineering.
Technical knowledge: Strong data handling skills in databases such as SQL and Hadoop. Knowledge with data visualization tools, such as SAS VI/Tableau/PowerBI is preferred.
Expertise in either R or Python; SAS knowledge will be plus.

Soft skills:

Ability to quickly adapt to the analytical tools and development approaches used within DCB Bank
Ability to multi-task good communication and team working skills.
Ability to manage day-to-day written and verbal communication with relevant stakeholders.
Ability to think strategically and make changes to data when required.

Hadoop Admin Lead

Smartavya

Agency job

via Pluginlive by Harsha Saggi

Mumbai

10 - 12 yrs

₹20L - ₹30L / yr

Hadoop

Apache Hive

HDFS

Spark

Data cluster

+5 more

Key Responsibilities:

• Install, configure, and maintain Hadoop clusters.

• Monitor cluster performance and ensure high availability.

• Manage Hadoop ecosystem components (HDFS, YARN, Ozone, Spark, Kudu, Hive).

• Perform routine cluster maintenance and troubleshooting.

• Implement and manage security and data governance.

• Monitor systems health and optimize performance.

• Collaborate with cross-functional teams to support big data applications.

• Perform Linux administration tasks and manage system configurations.

• Ensure data integrity and backup procedures.

Key Responsibilities:

• Install, configure, and maintain Hadoop clusters.

• Monitor cluster performance and ensure high availability.

• Manage Hadoop ecosystem components (HDFS, YARN, Ozone, Spark, Kudu, Hive).

• Perform routine cluster maintenance and troubleshooting.

• Implement and manage security and data governance.

• Monitor systems health and optimize performance.

• Collaborate with cross-functional teams to support big data applications.

• Perform Linux administration tasks and manage system configurations.

• Ensure data integrity and backup procedures.

Principal Software Engineer

Product Base Company into Logistic

Agency job

via Qrata by Rayal Rajan

Mumbai, Navi Mumbai

6 - 14 yrs

₹16L - ₹37L / yr

Python

PySpark

Data engineering

Big Data

Hadoop

+3 more

Role: Principal Software Engineer

We looking for a passionate Principle Engineer - Analytics to build data products that extract valuable business insights for efficiency and customer experience. This role will require managing, processing and analyzing large amounts of raw information and in scalable databases. This will also involve developing unique data structures and writing algorithms for the entirely new set of products. The candidate will be required to have critical thinking and problem-solving skills. The candidates must be experienced with software development with advanced algorithms and must be able to handle large volume of data. Exposure with statistics and machine learning algorithms is a big plus. The candidate should have some exposure to cloud environment, continuous integration and agile scrum processes.

Responsibilities:

• Lead projects both as a principal investigator and project manager, responsible for meeting project requirements on schedule

• Software Development that creates data driven intelligence in the products which deals with Big Data backends

• Exploratory analysis of the data to be able to come up with efficient data structures and algorithms for given requirements

• The system may or may not involve machine learning models and pipelines but will require advanced algorithm development

• Managing, data in large scale data stores (such as NoSQL DBs, time series DBs, Geospatial DBs etc.)

• Creating metrics and evaluation of algorithm for better accuracy and recall

• Ensuring efficient access and usage of data through the means of indexing, clustering etc.

• Collaborate with engineering and product development teams.

Requirements:

• Master’s or Bachelor’s degree in Engineering in one of these domains - Computer Science, Information Technology, Information Systems, or related field from top-tier school

• OR Master’s degree or higher in Statistics, Mathematics, with hands on background in software development.

• Experience of 8 to 10 year with product development, having done algorithmic work

• 5+ years of experience working with large data sets or do large scale quantitative analysis

• Understanding of SaaS based products and services.

• Strong algorithmic problem-solving skills

• Able to mentor and manage team and take responsibilities of team deadline.

Skill set required:

• In depth Knowledge Python programming languages

• Understanding of software architecture and software design

• Must have fully managed a project with a team

• Having worked with Agile project management practices

• Experience with data processing analytics and visualization tools in Python (such as pandas, matplotlib, Scipy, etc.)

• Strong understanding of SQL and querying to NoSQL database (eg. Mongo, Casandra, Redis