Data Science Jobs in Mumbai

33+ Data Science Jobs in Mumbai | Data Science Job openings in Mumbai

Apply to 33+ Data Science Jobs in Mumbai on CutShort.io. Explore the latest Data Science Job opportunities across top companies like Google, Amazon & Adobe.

Rust developer

at HaystackAnalytics

Posted by Careers Hr

Navi Mumbai

1 - 4 yrs

₹6L - ₹12L / yr

Rust

Python

Artificial Intelligence (AI)

Machine Learning (ML)

Data Science

+2 more

Position – Python Developer

Location – Navi Mumbai

Who are we

Based out of IIT Bombay, HaystackAnalytics is a HealthTech company creating clinical genomics products, which enable diagnostic labs and hospitals to offer accurate and personalized diagnostics. Supported by India's most respected science agencies (DST, BIRAC, DBT), we created and launched a portfolio of products to offer genomics in infectious diseases. Our genomics-based diagnostic solution for Tuberculosis was recognized as one of the top innovations supported by BIRAC in the past 10 years, and was launched by the Prime Minister of India in the BIRAC Showcase event in Delhi, 2022.

Objectives of this Role:

Design and implement efficient, scalable backend services using Python.
Work closely with healthcare domain experts to create innovative and accurate diagnostics solutions.
Build APIs, services, and scripts to support data processing pipelines and front-end applications.
Automate recurring tasks and ensure robust integration with cloud services.
Maintain high standards of software quality and performance using clean coding principles and testing practices.
Collaborate within the team to upskill and unblock each other for faster and better outcomes.

Primary Skills – Python Development

Proficient in Python 3 and its ecosystem
Frameworks: Flask / Django / FastAPI
RESTful API development
Understanding of OOPs and SOLID design principles
Asynchronous programming (asyncio, aiohttp)
Experience with task queues (Celery, RQ)
Rust programming experience for systems-level or performance-critical components

Testing & Automation

Unit Testing: PyTest / unittest
Automation tools: Ansible / Terraform (good to have)
CI/CD pipelines

DevOps & Cloud

Docker, Kubernetes (basic knowledge expected)
Cloud platforms: AWS / Azure / GCP
GIT and GitOps workflows
Familiarity with containerized deployment & serverless architecture

Bonus Skills

Data handling libraries: Pandas / NumPy
Experience with scripting: Bash / PowerShell
Functional programming concepts
Familiarity with front-end integration (REST API usage, JSON handling)

Other Skills

Innovation and thought leadership
Interest in learning new tools, languages, workflows
Strong communication and collaboration skills
Basic understanding of UI/UX principles

To know more about us – https://haystackanalytics.in

Position – Python Developer

Location – Navi Mumbai

Who are we

Objectives of this Role:

Design and implement efficient, scalable backend services using Python.
Work closely with healthcare domain experts to create innovative and accurate diagnostics solutions.
Build APIs, services, and scripts to support data processing pipelines and front-end applications.
Automate recurring tasks and ensure robust integration with cloud services.
Maintain high standards of software quality and performance using clean coding principles and testing practices.
Collaborate within the team to upskill and unblock each other for faster and better outcomes.

Primary Skills – Python Development

Proficient in Python 3 and its ecosystem
Frameworks: Flask / Django / FastAPI
RESTful API development
Understanding of OOPs and SOLID design principles
Asynchronous programming (asyncio, aiohttp)
Experience with task queues (Celery, RQ)
Rust programming experience for systems-level or performance-critical components

Testing & Automation

Unit Testing: PyTest / unittest
Automation tools: Ansible / Terraform (good to have)
CI/CD pipelines

DevOps & Cloud

Docker, Kubernetes (basic knowledge expected)
Cloud platforms: AWS / Azure / GCP
GIT and GitOps workflows
Familiarity with containerized deployment & serverless architecture

Bonus Skills

Data handling libraries: Pandas / NumPy
Experience with scripting: Bash / PowerShell
Functional programming concepts
Familiarity with front-end integration (REST API usage, JSON handling)

Other Skills

Innovation and thought leadership
Interest in learning new tools, languages, workflows
Strong communication and collaboration skills
Basic understanding of UI/UX principles

To know more about us – https://haystackanalytics.in

ELearning Head IT

at KGISL EDU

2 recruiters

Posted by Dhivya V

Coimbatore, Tamil nadu, Bengaluru (Bangalore), Mumbai, Delhi, Gurugram, Noida, Ghaziabad, Faridabad, Pune, Hyderabad

12 - 15 yrs

₹12L - ₹20L / yr

Bachelor of Computer Science

Management Information System (MIS)

Artificial Intelligence (AI)

Data Science

Head of the Department

AI and Data Science

12 to 15 years of Experience

Salary negotiable for immediate Joiners

hrkiteatkgkitedotacdotin

Head of the Department

AI and Data Science

12 to 15 years of Experience

Salary negotiable for immediate Joiners

hrkiteatkgkitedotacdotin

Senior Data Engineer

at TechMynd Consulting

2 candid answers

Posted by Suraj N

Bengaluru (Bangalore), Gurugram, Mumbai

4 - 8 yrs

₹10L - ₹24L / yr

Data Science

PostgreSQL

Python

Apache

Amazon Web Services (AWS)

+5 more

Senior Data Engineer

Location: Bangalore, Gurugram (Hybrid)

Experience: 4-8 Years

Type: Full Time | Permanent

Job Summary:

We are looking for a results-driven Senior Data Engineer to join our engineering team. The ideal candidate will have hands-on expertise in data pipeline development, cloud infrastructure, and BI support, with a strong command of modern data stacks. You’ll be responsible for building scalable ETL/ELT workflows, managing data lakes and marts, and enabling seamless data delivery to analytics and business intelligence teams.

This role requires deep technical know-how in PostgreSQL, Python scripting, Apache Airflow, AWS or other cloud environments, and a working knowledge of modern data and BI tools.

Key Responsibilities:

PostgreSQL & Data Modeling

· Design and optimize complex SQL queries, stored procedures, and indexes

· Perform performance tuning and query plan analysis

· Contribute to schema design and data normalization

Data Migration & Transformation

· Migrate data from multiple sources to cloud or ODS platforms

· Design schema mapping and implement transformation logic

· Ensure consistency, integrity, and accuracy in migrated data

Python Scripting for Data Engineering

· Build automation scripts for data ingestion, cleansing, and transformation

· Handle file formats (JSON, CSV, XML), REST APIs, cloud SDKs (e.g., Boto3)

· Maintain reusable script modules for operational pipelines

Data Orchestration with Apache Airflow

· Develop and manage DAGs for batch/stream workflows

· Implement retries, task dependencies, notifications, and failure handling

· Integrate Airflow with cloud services, data lakes, and data warehouses

Cloud Platforms (AWS / Azure / GCP)

· Manage data storage (S3, GCS, Blob), compute services, and data pipelines

· Set up permissions, IAM roles, encryption, and logging for security

· Monitor and optimize cost and performance of cloud-based data operations

Data Marts & Analytics Layer

· Design and manage data marts using dimensional models

· Build star/snowflake schemas to support BI and self-serve analytics

· Enable incremental load strategies and partitioning

Modern Data Stack Integration

· Work with tools like DBT, Fivetran, Redshift, Snowflake, BigQuery, or Kafka

· Support modular pipeline design and metadata-driven frameworks

· Ensure high availability and scalability of the stack

BI & Reporting Tools (Power BI / Superset / Supertech)

· Collaborate with BI teams to design datasets and optimize queries

· Support development of dashboards and reporting layers

· Manage access, data refreshes, and performance for BI tools

Required Skills & Qualifications:

· 4–6 years of hands-on experience in data engineering roles

· Strong SQL skills in PostgreSQL (tuning, complex joins, procedures)

· Advanced Python scripting skills for automation and ETL

· Proven experience with Apache Airflow (custom DAGs, error handling)

· Solid understanding of cloud architecture (especially AWS)

· Experience with data marts and dimensional data modeling

· Exposure to modern data stack tools (DBT, Kafka, Snowflake, etc.)

· Familiarity with BI tools like Power BI, Apache Superset, or Supertech BI

· Version control (Git) and CI/CD pipeline knowledge is a plus

· Excellent problem-solving and communication skills

Senior Data Engineer

Location: Bangalore, Gurugram (Hybrid)

Experience: 4-8 Years

Type: Full Time | Permanent

Job Summary:

This role requires deep technical know-how in PostgreSQL, Python scripting, Apache Airflow, AWS or other cloud environments, and a working knowledge of modern data and BI tools.

Key Responsibilities:

PostgreSQL & Data Modeling

· Design and optimize complex SQL queries, stored procedures, and indexes

· Perform performance tuning and query plan analysis

· Contribute to schema design and data normalization

Data Migration & Transformation

· Migrate data from multiple sources to cloud or ODS platforms

· Design schema mapping and implement transformation logic

· Ensure consistency, integrity, and accuracy in migrated data

Python Scripting for Data Engineering

· Build automation scripts for data ingestion, cleansing, and transformation

· Handle file formats (JSON, CSV, XML), REST APIs, cloud SDKs (e.g., Boto3)

· Maintain reusable script modules for operational pipelines

Data Orchestration with Apache Airflow

· Develop and manage DAGs for batch/stream workflows

· Implement retries, task dependencies, notifications, and failure handling

· Integrate Airflow with cloud services, data lakes, and data warehouses

Cloud Platforms (AWS / Azure / GCP)

· Manage data storage (S3, GCS, Blob), compute services, and data pipelines

· Set up permissions, IAM roles, encryption, and logging for security

· Monitor and optimize cost and performance of cloud-based data operations

Data Marts & Analytics Layer

· Design and manage data marts using dimensional models

· Build star/snowflake schemas to support BI and self-serve analytics

· Enable incremental load strategies and partitioning

Modern Data Stack Integration

· Work with tools like DBT, Fivetran, Redshift, Snowflake, BigQuery, or Kafka

· Support modular pipeline design and metadata-driven frameworks

· Ensure high availability and scalability of the stack

BI & Reporting Tools (Power BI / Superset / Supertech)

· Collaborate with BI teams to design datasets and optimize queries

· Support development of dashboards and reporting layers

· Manage access, data refreshes, and performance for BI tools

Required Skills & Qualifications:

· 4–6 years of hands-on experience in data engineering roles

· Strong SQL skills in PostgreSQL (tuning, complex joins, procedures)

· Advanced Python scripting skills for automation and ETL

· Proven experience with Apache Airflow (custom DAGs, error handling)

· Solid understanding of cloud architecture (especially AWS)

· Experience with data marts and dimensional data modeling

· Exposure to modern data stack tools (DBT, Kafka, Snowflake, etc.)

· Familiarity with BI tools like Power BI, Apache Superset, or Supertech BI

· Version control (Git) and CI/CD pipeline knowledge is a plus

· Excellent problem-solving and communication skills

Senior Data Scientist

at Unicornis AI

Posted by Sachin Anbhule

Navi Mumbai

5 - 7 yrs

₹9L - ₹15L / yr

Python

Data Science

OpenAI

Retrieval Augmented Generation (RAG)

Large Language Models (LLM)

Note: We are looking for immediate joiners with 6+ years of experience.

Job Description

UnicornisAI is seeking a Senior Data Scientist with expertise in chatbot development using Retrieval-Augmented Generation (RAG) and OpenAI. This role is ideal for someone with a strong background in machine learning, natural language processing (NLP), and AI model deployment. If you are passionate about developing cutting-edge AI-driven solutions, we’d love to have you on our team.

Key Responsibilities

- Design and develop AI-powered chatbots using Retrieval-Augmented Generation (RAG), OpenAI models (GPT-4, etc.), and vector databases

- Build and fine-tune large language models (LLMs) to improve chatbot performance

- Implement document retrieval and knowledge management systems for chatbot responses

- Optimize NLP pipelines and model performance using state-of-the-art techniques

- Work with structured and unstructured data to enhance chatbot intelligence

- Deploy and maintain AI models in cloud environments such as AWS, Azure, or GCP

- Collaborate with engineering teams to integrate AI solutions into products

- Stay updated with the latest advancements in AI, NLP, and RAG-based architectures

Required Skills & Qualifications

- 6+ years of experience in data science, AI, or a related field

- Strong knowledge of RAG, OpenAI APIs (GPT-4, GPT-3.5, etc.), LLM fine-tuning, and embeddings

- Proficiency in Python, TensorFlow, PyTorch, and other ML frameworks

- Experience with vector databases such as FAISS, Pinecone, or Weaviate

- Expertise in NLP techniques such as Named Entity Recognition (NER), text summarization, and semantic search

- Hands-on experience in building and deploying AI models in production

- Knowledge of cloud platforms like AWS Sagemaker, Azure AI, or Google Vertex AI

- Strong problem-solving and analytical skills

Nice-to-Have Skills

- Experience with MLOps tools for model monitoring and retraining

- Understanding of prompt engineering and LLM chaining techniques

- Exposure to LangChain or similar frameworks for RAG-based chatbots

Location & Work Mode

- Open to remote or hybrid work, based on location

Interested candidates can email their resumes to Sachin at unicornisai.com