Apache Spark Jobs in Bangalore (Bengaluru)

50+ Apache Spark Jobs in Bangalore (Bengaluru) | Apache Spark Job openings in Bangalore (Bengaluru)

Apply to 50+ Apache Spark Jobs in Bangalore (Bengaluru) on CutShort.io. Explore the latest Apache Spark Job opportunities across top companies like Google, Amazon & Adobe.

Data Architect (Dremio Lakehouse)

AI Industry

Agency job

via Peak Hire Solutions by Dhara Thakkar

Mumbai, Bengaluru (Bangalore), Hyderabad, Gurugram

5 - 17 yrs

₹34L - ₹45L / yr

Dremio

Data engineering

Business Intelligence (BI)

Tableau

PowerBI

+51 more

Review Criteria:

Strong Dremio / Lakehouse Data Architect profile
5+ years of experience in Data Architecture / Data Engineering, with minimum 3+ years hands-on in Dremio
Strong expertise in SQL optimization, data modeling, query performance tuning, and designing analytical schemas for large-scale systems
Deep experience with cloud object storage (S3 / ADLS / GCS) and file formats such as Parquet, Delta, Iceberg along with distributed query planning concepts
Hands-on experience integrating data via APIs, JDBC, Delta/Parquet, object storage, and coordinating with data engineering pipelines (Airflow, DBT, Kafka, Spark, etc.)
Proven experience designing and implementing lakehouse architecture including ingestion, curation, semantic modeling, reflections/caching optimization, and enabling governed analytics
Strong understanding of data governance, lineage, RBAC-based access control, and enterprise security best practices
Excellent communication skills with ability to work closely with BI, data science, and engineering teams; strong documentation discipline
Candidates must come from enterprise data modernization, cloud-native, or analytics-driven companies

Preferred:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) or data catalogs (Collibra, Alation, Purview); familiarity with Snowflake, Databricks, or BigQuery environments

Role & Responsibilities:

You will be responsible for architecting, implementing, and optimizing Dremio-based data lakehouse environments integrated with cloud storage, BI, and data engineering ecosystems. The role requires a strong balance of architecture design, data modeling, query optimization, and governance enablement in large-scale analytical environments.

Design and implement Dremio lakehouse architecture on cloud (AWS/Azure/Snowflake/Databricks ecosystem).
Define data ingestion, curation, and semantic modeling strategies to support analytics and AI workloads.
Optimize Dremio reflections, caching, and query performance for diverse data consumption patterns.
Collaborate with data engineering teams to integrate data sources via APIs, JDBC, Delta/Parquet, and object storage layers (S3/ADLS).
Establish best practices for data security, lineage, and access control aligned with enterprise governance policies.
Support self-service analytics by enabling governed data products and semantic layers.
Develop reusable design patterns, documentation, and standards for Dremio deployment, monitoring, and scaling.
Work closely with BI and data science teams to ensure fast, reliable, and well-modeled access to enterprise data.

Ideal Candidate:

Bachelor’s or Master’s in Computer Science, Information Systems, or related field.
5+ years in data architecture and engineering, with 3+ years in Dremio or modern lakehouse platforms.
Strong expertise in SQL optimization, data modeling, and performance tuning within Dremio or similar query engines (Presto, Trino, Athena).
Hands-on experience with cloud storage (S3, ADLS, GCS), Parquet/Delta/Iceberg formats, and distributed query planning.
Knowledge of data integration tools and pipelines (Airflow, DBT, Kafka, Spark, etc.).
Familiarity with enterprise data governance, metadata management, and role-based access control (RBAC).
Excellent problem-solving, documentation, and stakeholder communication skills.

Preferred:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) and data catalogs (Collibra, Alation, Purview).
Exposure to Snowflake, Databricks, or BigQuery environments.
Experience in high-tech, manufacturing, or enterprise data modernization programs.

Review Criteria:

Strong Dremio / Lakehouse Data Architect profile
5+ years of experience in Data Architecture / Data Engineering, with minimum 3+ years hands-on in Dremio
Strong expertise in SQL optimization, data modeling, query performance tuning, and designing analytical schemas for large-scale systems
Deep experience with cloud object storage (S3 / ADLS / GCS) and file formats such as Parquet, Delta, Iceberg along with distributed query planning concepts
Hands-on experience integrating data via APIs, JDBC, Delta/Parquet, object storage, and coordinating with data engineering pipelines (Airflow, DBT, Kafka, Spark, etc.)
Proven experience designing and implementing lakehouse architecture including ingestion, curation, semantic modeling, reflections/caching optimization, and enabling governed analytics
Strong understanding of data governance, lineage, RBAC-based access control, and enterprise security best practices
Excellent communication skills with ability to work closely with BI, data science, and engineering teams; strong documentation discipline
Candidates must come from enterprise data modernization, cloud-native, or analytics-driven companies

Preferred:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) or data catalogs (Collibra, Alation, Purview); familiarity with Snowflake, Databricks, or BigQuery environments

Role & Responsibilities:

Design and implement Dremio lakehouse architecture on cloud (AWS/Azure/Snowflake/Databricks ecosystem).
Define data ingestion, curation, and semantic modeling strategies to support analytics and AI workloads.
Optimize Dremio reflections, caching, and query performance for diverse data consumption patterns.
Collaborate with data engineering teams to integrate data sources via APIs, JDBC, Delta/Parquet, and object storage layers (S3/ADLS).
Establish best practices for data security, lineage, and access control aligned with enterprise governance policies.
Support self-service analytics by enabling governed data products and semantic layers.
Develop reusable design patterns, documentation, and standards for Dremio deployment, monitoring, and scaling.
Work closely with BI and data science teams to ensure fast, reliable, and well-modeled access to enterprise data.

Ideal Candidate:

Bachelor’s or Master’s in Computer Science, Information Systems, or related field.
5+ years in data architecture and engineering, with 3+ years in Dremio or modern lakehouse platforms.
Strong expertise in SQL optimization, data modeling, and performance tuning within Dremio or similar query engines (Presto, Trino, Athena).
Hands-on experience with cloud storage (S3, ADLS, GCS), Parquet/Delta/Iceberg formats, and distributed query planning.
Knowledge of data integration tools and pipelines (Airflow, DBT, Kafka, Spark, etc.).
Familiarity with enterprise data governance, metadata management, and role-based access control (RBAC).
Excellent problem-solving, documentation, and stakeholder communication skills.

Preferred:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) and data catalogs (Collibra, Alation, Purview).
Exposure to Snowflake, Databricks, or BigQuery environments.
Experience in high-tech, manufacturing, or enterprise data modernization programs.

Senior Software Engineer (Full Stack — AI/ML & Data Applications)

at NeoGenCode Technologies Pvt Ltd

2 candid answers

Posted by Akshay Patil

Bengaluru (Bangalore)

5 - 10 yrs

₹25L - ₹50L / yr

NodeJS (Node.js)

React.js

Python

Java

Data engineering

+10 more

Job Title : Senior Software Engineer (Full Stack — AI/ML & Data Applications)

Experience : 5 to 10 Years

Location : Bengaluru, India

Employment Type : Full-Time | Onsite

Role Overview :

We are seeking a Senior Full Stack Software Engineer with strong technical leadership and hands-on expertise in AI/ML, data-centric applications, and scalable full-stack architectures.

In this role, you will design and implement complex applications integrating ML/AI models, lead full-cycle development, and mentor engineering teams.

Mandatory Skills :

Full Stack Development (React/Angular/Vue + Node.js/Python/Java), Data Engineering (Spark/Kafka/ETL), ML/AI Model Integration (TensorFlow/PyTorch/scikit-learn), Cloud & DevOps (AWS/GCP/Azure, Docker, Kubernetes, CI/CD), SQL/NoSQL Databases (PostgreSQL/MongoDB).

Key Responsibilities :

Architect, design, and develop scalable full-stack applications for data and AI-driven products.
Build and optimize data ingestion, processing, and pipeline frameworks for large datasets.
Deploy, integrate, and scale ML/AI models in production environments.
Drive system design, architecture discussions, and API/interface standards.
Ensure engineering best practices across code quality, testing, performance, and security.
Mentor and guide junior developers through reviews and technical decision-making.
Collaborate cross-functionally with product, design, and data teams to align solutions with business needs.
Monitor, diagnose, and optimize performance issues across the application stack.
Maintain comprehensive technical documentation for scalability and knowledge-sharing.

Required Skills & Experience :

Education : B.E./B.Tech/M.E./M.Tech in Computer Science, Data Science, or equivalent fields.
Experience : 5+ years in software development with at least 2+ years in a senior or lead role.
Full Stack Proficiency :
Front-end : React / Angular / Vue.js
Back-end : Node.js / Python / Java
Data Engineering : Experience with data frameworks such as Apache Spark, Kafka, and ETL pipeline development.
AI/ML Expertise : Practical exposure to TensorFlow, PyTorch, or scikit-learn and deploying ML models at scale.
Databases : Strong knowledge of SQL & NoSQL systems (PostgreSQL, MongoDB) and warehousing tools (Snowflake, BigQuery).
Cloud & DevOps : Working knowledge of AWS, GCP, or Azure; containerization & orchestration (Docker, Kubernetes); CI/CD; MLflow/SageMaker is a plus.
Visualization : Familiarity with modern data visualization tools (D3.js, Tableau, Power BI).

Soft Skills :

Excellent communication and cross-functional collaboration skills.
Strong analytical mindset with structured problem-solving ability.
Self-driven with ownership mentality and adaptability in fast-paced environments.

Preferred Qualifications (Bonus) :

Experience deploying distributed, large-scale ML or data-driven platforms.
Understanding of data governance, privacy, and security compliance.
Exposure to domain-driven data/AI use cases in fintech, healthcare, retail, or e-commerce.
Experience working in Agile environments (Scrum/Kanban).
Active open-source contributions or a strong GitHub technical portfolio.

Job Title : Senior Software Engineer (Full Stack — AI/ML & Data Applications)

Experience : 5 to 10 Years

Location : Bengaluru, India

Employment Type : Full-Time | Onsite

Role Overview :

We are seeking a Senior Full Stack Software Engineer with strong technical leadership and hands-on expertise in AI/ML, data-centric applications, and scalable full-stack architectures.

In this role, you will design and implement complex applications integrating ML/AI models, lead full-cycle development, and mentor engineering teams.

Mandatory Skills :

Key Responsibilities :

Architect, design, and develop scalable full-stack applications for data and AI-driven products.
Build and optimize data ingestion, processing, and pipeline frameworks for large datasets.
Deploy, integrate, and scale ML/AI models in production environments.
Drive system design, architecture discussions, and API/interface standards.
Ensure engineering best practices across code quality, testing, performance, and security.
Mentor and guide junior developers through reviews and technical decision-making.
Collaborate cross-functionally with product, design, and data teams to align solutions with business needs.
Monitor, diagnose, and optimize performance issues across the application stack.
Maintain comprehensive technical documentation for scalability and knowledge-sharing.

Required Skills & Experience :

Education : B.E./B.Tech/M.E./M.Tech in Computer Science, Data Science, or equivalent fields.
Experience : 5+ years in software development with at least 2+ years in a senior or lead role.
Full Stack Proficiency :
Front-end : React / Angular / Vue.js
Back-end : Node.js / Python / Java
Data Engineering : Experience with data frameworks such as Apache Spark, Kafka, and ETL pipeline development.
AI/ML Expertise : Practical exposure to TensorFlow, PyTorch, or scikit-learn and deploying ML models at scale.
Databases : Strong knowledge of SQL & NoSQL systems (PostgreSQL, MongoDB) and warehousing tools (Snowflake, BigQuery).
Cloud & DevOps : Working knowledge of AWS, GCP, or Azure; containerization & orchestration (Docker, Kubernetes); CI/CD; MLflow/SageMaker is a plus.
Visualization : Familiarity with modern data visualization tools (D3.js, Tableau, Power BI).

Soft Skills :

Excellent communication and cross-functional collaboration skills.
Strong analytical mindset with structured problem-solving ability.
Self-driven with ownership mentality and adaptability in fast-paced environments.

Preferred Qualifications (Bonus) :

Experience deploying distributed, large-scale ML or data-driven platforms.
Understanding of data governance, privacy, and security compliance.
Exposure to domain-driven data/AI use cases in fintech, healthcare, retail, or e-commerce.
Experience working in Agile environments (Scrum/Kanban).
Active open-source contributions or a strong GitHub technical portfolio.

Data Architect (Dremio Lakehouse)

AI-First Company

Agency job

via Peak Hire Solutions by Dhara Thakkar

Bengaluru (Bangalore), Mumbai, Hyderabad, Gurugram

5 - 17 yrs

₹30L - ₹45L / yr

Data engineering

Data architecture

SQL

Data modeling

GCS

+47 more

ROLES AND RESPONSIBILITIES:

You will be responsible for architecting, implementing, and optimizing Dremio-based data Lakehouse environments integrated with cloud storage, BI, and data engineering ecosystems. The role requires a strong balance of architecture design, data modeling, query optimization, and governance enablement in large-scale analytical environments.

Design and implement Dremio lakehouse architecture on cloud (AWS/Azure/Snowflake/Databricks ecosystem).
Define data ingestion, curation, and semantic modeling strategies to support analytics and AI workloads.
Optimize Dremio reflections, caching, and query performance for diverse data consumption patterns.
Collaborate with data engineering teams to integrate data sources via APIs, JDBC, Delta/Parquet, and object storage layers (S3/ADLS).
Establish best practices for data security, lineage, and access control aligned with enterprise governance policies.
Support self-service analytics by enabling governed data products and semantic layers.
Develop reusable design patterns, documentation, and standards for Dremio deployment, monitoring, and scaling.
Work closely with BI and data science teams to ensure fast, reliable, and well-modeled access to enterprise data.

IDEAL CANDIDATE:

Bachelor’s or Master’s in Computer Science, Information Systems, or related field.
5+ years in data architecture and engineering, with 3+ years in Dremio or modern lakehouse platforms.
Strong expertise in SQL optimization, data modeling, and performance tuning within Dremio or similar query engines (Presto, Trino, Athena).
Hands-on experience with cloud storage (S3, ADLS, GCS), Parquet/Delta/Iceberg formats, and distributed query planning.
Knowledge of data integration tools and pipelines (Airflow, DBT, Kafka, Spark, etc.).
Familiarity with enterprise data governance, metadata management, and role-based access control (RBAC).
Excellent problem-solving, documentation, and stakeholder communication skills.

PREFERRED:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) and data catalogs (Collibra, Alation, Purview).
Exposure to Snowflake, Databricks, or BigQuery environments.
Experience in high-tech, manufacturing, or enterprise data modernization programs.

ROLES AND RESPONSIBILITIES:

Design and implement Dremio lakehouse architecture on cloud (AWS/Azure/Snowflake/Databricks ecosystem).
Define data ingestion, curation, and semantic modeling strategies to support analytics and AI workloads.
Optimize Dremio reflections, caching, and query performance for diverse data consumption patterns.
Collaborate with data engineering teams to integrate data sources via APIs, JDBC, Delta/Parquet, and object storage layers (S3/ADLS).
Establish best practices for data security, lineage, and access control aligned with enterprise governance policies.
Support self-service analytics by enabling governed data products and semantic layers.
Develop reusable design patterns, documentation, and standards for Dremio deployment, monitoring, and scaling.
Work closely with BI and data science teams to ensure fast, reliable, and well-modeled access to enterprise data.

IDEAL CANDIDATE:

Bachelor’s or Master’s in Computer Science, Information Systems, or related field.
5+ years in data architecture and engineering, with 3+ years in Dremio or modern lakehouse platforms.
Strong expertise in SQL optimization, data modeling, and performance tuning within Dremio or similar query engines (Presto, Trino, Athena).
Hands-on experience with cloud storage (S3, ADLS, GCS), Parquet/Delta/Iceberg formats, and distributed query planning.
Knowledge of data integration tools and pipelines (Airflow, DBT, Kafka, Spark, etc.).
Familiarity with enterprise data governance, metadata management, and role-based access control (RBAC).
Excellent problem-solving, documentation, and stakeholder communication skills.

PREFERRED:

Experience integrating Dremio with BI tools (Tableau, Power BI, Looker) and data catalogs (Collibra, Alation, Purview).
Exposure to Snowflake, Databricks, or BigQuery environments.
Experience in high-tech, manufacturing, or enterprise data modernization programs.

Backend Lead Engineer

at Corridor Platforms

3 recruiters

Posted by Aniket Agrawal

Bengaluru (Bangalore)

4 - 8 yrs

₹30L - ₹50L / yr

Python

PySpark

Apache Spark

NumPy

pandas

+8 more

About Corridor Platforms

Corridor Platforms is a leader in next-generation risk decisioning and responsible AI governance, empowering banks and lenders to build transparent, compliant, and data-driven solutions. Our platforms combine advanced analytics, real-time data integration, and GenAI to support complex financial decision workflows for regulated industries.

Role Overview

As a Backend Engineer at Corridor Platforms, you will:

Architect, develop, and maintain backend components for our Risk Decisioning Platform.
Build and orchestrate scalable backend services that automate, optimize, and monitor high-value credit and risk decisions in real time.
Integrate with ORM layers – such as SQLAlchemy – and multi RDBMS solutions (Postgres, MySQL, Oracle, MSSQL, etc) to ensure data integrity, scalability, and compliance.
Collaborate closely with Product Team, Data Scientists, QA Teams to create extensible APIs, workflow automation, and AI governance features.
Architect workflows for privacy, auditability, versioned traceability, and role-based access control, ensuring adherence to regulatory frameworks.
Take ownership from requirements to deployment, seeing your code deliver real impact in the lives of customers and end users.

Technical Skills

Languages: Python 3.9+, SQL, JavaScript/TypeScript, Angular
Frameworks: Flask, SQLAlchemy, Celery, Marshmallow, Apache Spark
Databases: PostgreSQL, Oracle, SQL Server, Redis
Tools: pytest, Docker, Git, Nx
Cloud: Experience with AWS, Azure, or GCP preferred
Monitoring: Familiarity with OpenTelemetry and logging frameworks

Why Join Us?

Cutting-Edge Tech: Work hands-on with the latest AI, cloud-native workflows, and big data tools—all within a single compliant platform.
End-to-End Impact: Contribute to mission-critical backend systems, from core data models to live production decision services.
Innovation at Scale: Engineer solutions that process vast data volumes, helping financial institutions innovate safely and effectively.
Mission-Driven: Join a passionate team advancing fair, transparent, and compliant risk decisioning at the forefront of fintech and AI governance.

What We’re Looking For

Proficiency in Python, SQLAlchemy (or similar ORM), and SQL databases.
Experience developing and maintaining scalable backend services, including API, data orchestration, ML workflows, and workflow automation.
Solid understanding of data modeling, distributed systems, and backend architecture for regulated environments.
Curiosity and drive to work at the intersection of AI/ML, fintech, and regulatory technology.
Experience mentoring and guiding junior developers.

Ready to build backends that shape the future of decision intelligence and responsible AI?

Apply now and become part of the innovation at Corridor Platforms!

About Corridor Platforms

Role Overview

As a Backend Engineer at Corridor Platforms, you will:

Architect, develop, and maintain backend components for our Risk Decisioning Platform.
Build and orchestrate scalable backend services that automate, optimize, and monitor high-value credit and risk decisions in real time.
Integrate with ORM layers – such as SQLAlchemy – and multi RDBMS solutions (Postgres, MySQL, Oracle, MSSQL, etc) to ensure data integrity, scalability, and compliance.
Collaborate closely with Product Team, Data Scientists, QA Teams to create extensible APIs, workflow automation, and AI governance features.
Architect workflows for privacy, auditability, versioned traceability, and role-based access control, ensuring adherence to regulatory frameworks.
Take ownership from requirements to deployment, seeing your code deliver real impact in the lives of customers and end users.

Technical Skills

Languages: Python 3.9+, SQL, JavaScript/TypeScript, Angular
Frameworks: Flask, SQLAlchemy, Celery, Marshmallow, Apache Spark
Databases: PostgreSQL, Oracle, SQL Server, Redis
Tools: pytest, Docker, Git, Nx
Cloud: Experience with AWS, Azure, or GCP preferred
Monitoring: Familiarity with OpenTelemetry and logging frameworks

Why Join Us?

Cutting-Edge Tech: Work hands-on with the latest AI, cloud-native workflows, and big data tools—all within a single compliant platform.
End-to-End Impact: Contribute to mission-critical backend systems, from core data models to live production decision services.
Innovation at Scale: Engineer solutions that process vast data volumes, helping financial institutions innovate safely and effectively.
Mission-Driven: Join a passionate team advancing fair, transparent, and compliant risk decisioning at the forefront of fintech and AI governance.

What We’re Looking For

Proficiency in Python, SQLAlchemy (or similar ORM), and SQL databases.
Experience developing and maintaining scalable backend services, including API, data orchestration, ML workflows, and workflow automation.
Solid understanding of data modeling, distributed systems, and backend architecture for regulated environments.
Curiosity and drive to work at the intersection of AI/ML, fintech, and regulatory technology.
Experience mentoring and guiding junior developers.

Ready to build backends that shape the future of decision intelligence and responsible AI?

Apply now and become part of the innovation at Corridor Platforms!

Big Data Engineer

at Deqode

1 recruiter

Posted by Shraddha Katare

Bengaluru (Bangalore)

5 - 8 yrs

₹5L - ₹20L / yr

Apache Hive

Apache Spark

Python

SQL

Hadoop

+1 more

Profile: Big Data Engineer (System Design)

Experience: 5+ years

Location: Bangalore

Work Mode: Hybrid

About the Role

We're looking for an experienced Big Data Engineer with system design expertise to architect and build scalable data pipelines and optimize big data solutions.

Key Responsibilities

Design, develop, and maintain data pipelines and ETL processes using Python, Hive, and Spark
Architect scalable big data solutions with strong system design principles
Build and optimize workflows using Apache Airflow
Implement data modeling, integration, and warehousing solutions
Collaborate with cross-functional teams to deliver data solutions

Must-Have Skills

5+ years as a Data Engineer with Python, Hive, and Spark
Strong hands-on experience with Java
Advanced SQL and Hadoop experience
Expertise in Apache Airflow
Strong understanding of data modeling, integration, and warehousing
Experience with relational databases (PostgreSQL, MySQL)
System design knowledge
Excellent problem-solving and communication skills

Good to Have

Docker and containerization experience
Knowledge of Apache Beam, Apache Flink, or similar frameworks
Cloud platform experience.

Profile: Big Data Engineer (System Design)

Experience: 5+ years

Location: Bangalore

Work Mode: Hybrid

About the Role

We're looking for an experienced Big Data Engineer with system design expertise to architect and build scalable data pipelines and optimize big data solutions.

Key Responsibilities

Design, develop, and maintain data pipelines and ETL processes using Python, Hive, and Spark
Architect scalable big data solutions with strong system design principles
Build and optimize workflows using Apache Airflow
Implement data modeling, integration, and warehousing solutions
Collaborate with cross-functional teams to deliver data solutions

Must-Have Skills

5+ years as a Data Engineer with Python, Hive, and Spark
Strong hands-on experience with Java
Advanced SQL and Hadoop experience
Expertise in Apache Airflow
Strong understanding of data modeling, integration, and warehousing
Experience with relational databases (PostgreSQL, MySQL)
System design knowledge
Excellent problem-solving and communication skills

Good to Have

Docker and containerization experience
Knowledge of Apache Beam, Apache Flink, or similar frameworks
Cloud platform experience.

Solution/Technical Architect (Databricks)

at Quintica

Posted by Nitin D

Remote, Bengaluru (Bangalore), Pune, Chennai, Nagpur

5 - 15 yrs

₹20L - ₹30L / yr

databricks

PySpark

Apache Spark

CI/CD

Data engineering

Technical Architect (Databricks)

10+ Years Data Engineering Experience with expertise in Databricks
3+ years of consulting experience
Completed Data Engineering Professional certification & required classes
Minimum 2-3 projects delivered with hands-on experience in Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
Experience in Spark and/or Hadoop, Flink, Presto, other popular big data engines
Familiarity with Databricks multi-hop pipeline architecture

Sr. Data Engineer (Databricks)

5+ Years Data Engineering Experience with expertise in Databricks
Completed Data Engineering Associate certification & required classes
Minimum 1 project delivered with hands-on experience in development on Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
SQL delivery experience, and familiarity with Bigquery, Synapse or Redshift
Proficient in Python, knowledge of additional databricks programming languages (Scala)

Technical Architect (Databricks)

10+ Years Data Engineering Experience with expertise in Databricks
3+ years of consulting experience
Completed Data Engineering Professional certification & required classes
Minimum 2-3 projects delivered with hands-on experience in Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
Experience in Spark and/or Hadoop, Flink, Presto, other popular big data engines
Familiarity with Databricks multi-hop pipeline architecture

Sr. Data Engineer (Databricks)

5+ Years Data Engineering Experience with expertise in Databricks
Completed Data Engineering Associate certification & required classes
Minimum 1 project delivered with hands-on experience in development on Databricks
Completed Apache Spark Programming with Databricks, Data Engineering with Databricks, Optimizing Apache Spark™ on Databricks
SQL delivery experience, and familiarity with Bigquery, Synapse or Redshift
Proficient in Python, knowledge of additional databricks programming languages (Scala)

SDE - II / III (Java, Kafka, Data Engineering

at Talent Pro

Posted by Mayank choudhary

Bengaluru (Bangalore)

4 - 8 yrs

₹26L - ₹35L / yr

Java

Spring Boot

Google Cloud Platform (GCP)

Distributed Systems

Microservices

+3 more

Role & Responsibilities

Responsible for ensuring that the architecture and design of the platform remains top-notch with respect to scalability, availability, reliability and maintainability

Act as a key technical contributor as well as a hands-on contributing member of the team.

Own end-to-end availability and performance of features, driving rapid product innovation while ensuring a reliable service.

Working closely with the various stakeholders like Program Managers, Product Managers, Reliability and Continuity Engineering(RCE) team, QE team to estimate and execute features/tasks independently.

Maintain and drive tech backlog execution for non-functional requirements of the platform required to keep the platform resilient

Assist in release planning and prioritization based on technical feasibility and engineering constraints

A zeal to continually find new ways to improve architecture, design and ensure timely delivery and high quality.

Role & Responsibilities

Responsible for ensuring that the architecture and design of the platform remains top-notch with respect to scalability, availability, reliability and maintainability

Act as a key technical contributor as well as a hands-on contributing member of the team.

Own end-to-end availability and performance of features, driving rapid product innovation while ensuring a reliable service.

Working closely with the various stakeholders like Program Managers, Product Managers, Reliability and Continuity Engineering(RCE) team, QE team to estimate and execute features/tasks independently.

Maintain and drive tech backlog execution for non-functional requirements of the platform required to keep the platform resilient

Assist in release planning and prioritization based on technical feasibility and engineering constraints

A zeal to continually find new ways to improve architecture, design and ensure timely delivery and high quality.

Senior Data Engineer

at Talent Pro

Posted by Mayank choudhary

Bengaluru (Bangalore)

3 - 5 yrs

₹20L - ₹25L / yr

ETL

SQL

Apache Spark

Apache Kafka

Role & Responsibilities

About the Role:

We are seeking a highly skilled Senior Data Engineer with 5-7 years of experience to join our dynamic team. The ideal candidate will have a strong background in data engineering, with expertise in data warehouse architecture, data modeling, ETL processes, and building both batch and streaming pipelines. The candidate should also possess advanced proficiency in Spark, Databricks, Kafka, Python, SQL, and Change Data Capture (CDC) methodologies.

Key responsibilities:

Design, develop, and maintain robust data warehouse solutions to support the organization's analytical and reporting needs.

Implement efficient data modeling techniques to optimize performance and scalability of data systems.

Build and manage data lakehouse infrastructure, ensuring reliability, availability, and security of data assets.

Develop and maintain ETL pipelines to ingest, transform, and load data from various sources into the data warehouse and data lakehouse.

Utilize Spark and Databricks to process large-scale datasets efficiently and in real-time.

Implement Kafka for building real-time streaming pipelines and ensure data consistency and reliability.

Design and develop batch pipelines for scheduled data processing tasks.

Collaborate with cross-functional teams to gather requirements, understand data needs, and deliver effective data solutions.

Perform data analysis and troubleshooting to identify and resolve data quality issues and performance bottlenecks.

Stay updated with the latest technologies and industry trends in data engineering and contribute to continuous improvement initiatives.