Hdfs jobs

27+ HDFS Jobs in India

Apply to 27+ HDFS Jobs on CutShort.io. Find your next job, effortlessly. Browse HDFS Jobs and apply today!

VP - Data Architect (B2B SaaS)

Software Company

Agency job

via Peak Hire Solutions by Dhara Thakkar

Delhi

12 - 17 yrs

₹50L - ₹140L / yr

Data engineering

Apache Spark

Apache Kafka

Java

Python

+4 more

MANDATORY:

Super Quality Data Architect, Data Engineering Manager / Director Profile
Must have 12+ YOE in Data Engineering roles, with at least 2+ years in a Leadership role
Must have 7+ YOE in hands-on Tech development with Java (Highly preferred) or Python, Node.JS, GoLang
Must have strong experience in large data technologies, tools like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow, Presto etc.
Strong expertise in HLD and LLD, to design scalable, maintainable data architectures.
Must have managed a team of at least 5+ Data Engineers (Read Leadership role in CV)
Product Companies (Prefers high-scale, data-heavy companies)

PREFERRED:

Must be from Tier - 1 Colleges, preferred IIT
Candidates must have spent a minimum 3 yrs in each company.
Must have recent 4+ YOE with high-growth Product startups, and should have implemented Data Engineering systems from an early stage in the Company

ROLES & RESPONSIBILITIES:

Lead and mentor a team of data engineers, ensuring high performance and career growth.
Architect and optimize scalable data infrastructure, ensuring high availability and reliability.
Drive the development and implementation of data governance frameworks and best practices.
Work closely with cross-functional teams to define and execute a data roadmap.
Optimize data processing workflows for performance and cost efficiency.
Ensure data security, compliance, and quality across all data platforms.
Foster a culture of innovation and technical excellence within the data team.

IDEAL CANDIDATE:

10+ years of experience in software/data engineering, with at least 3+ years in a leadership role.
Expertise in backend development with programming languages such as Java, PHP, Python, Node.JS, GoLang, JavaScript, HTML, and CSS.
Proficiency in SQL, Python, and Scala for data processing and analytics.
Strong understanding of cloud platforms (AWS, GCP, or Azure) and their data services.
Strong foundation and expertise in HLD and LLD, as well as design patterns, preferably using Spring Boot or Google Guice
Experience in big data technologies such as Spark, Hadoop, Kafka, and distributed computing frameworks.
Hands-on experience with data warehousing solutions such as Snowflake, Redshift, or BigQuery
Deep knowledge of data governance, security, and compliance (GDPR, SOC2, etc.).
Experience in NoSQL databases like Redis, Cassandra, MongoDB, and TiDB.
Familiarity with automation and DevOps tools like Jenkins, Ansible, Docker, Kubernetes, Chef, Grafana, and ELK.
Proven ability to drive technical strategy and align it with business objectives.
Strong leadership, communication, and stakeholder management skills.

PREFERRED QUALIFICATIONS:

Experience in machine learning infrastructure or MLOps is a plus.
Exposure to real-time data processing and analytics.
Interest in data structures, algorithm analysis and design, multicore programming, and scalable architecture.
Prior experience in a SaaS or high-growth tech company.

MANDATORY:

Super Quality Data Architect, Data Engineering Manager / Director Profile
Must have 12+ YOE in Data Engineering roles, with at least 2+ years in a Leadership role
Must have 7+ YOE in hands-on Tech development with Java (Highly preferred) or Python, Node.JS, GoLang
Must have strong experience in large data technologies, tools like HDFS, YARN, Map-Reduce, Hive, Kafka, Spark, Airflow, Presto etc.
Strong expertise in HLD and LLD, to design scalable, maintainable data architectures.
Must have managed a team of at least 5+ Data Engineers (Read Leadership role in CV)
Product Companies (Prefers high-scale, data-heavy companies)

PREFERRED:

Must be from Tier - 1 Colleges, preferred IIT
Candidates must have spent a minimum 3 yrs in each company.
Must have recent 4+ YOE with high-growth Product startups, and should have implemented Data Engineering systems from an early stage in the Company

ROLES & RESPONSIBILITIES:

Lead and mentor a team of data engineers, ensuring high performance and career growth.
Architect and optimize scalable data infrastructure, ensuring high availability and reliability.
Drive the development and implementation of data governance frameworks and best practices.
Work closely with cross-functional teams to define and execute a data roadmap.
Optimize data processing workflows for performance and cost efficiency.
Ensure data security, compliance, and quality across all data platforms.
Foster a culture of innovation and technical excellence within the data team.

IDEAL CANDIDATE:

10+ years of experience in software/data engineering, with at least 3+ years in a leadership role.
Expertise in backend development with programming languages such as Java, PHP, Python, Node.JS, GoLang, JavaScript, HTML, and CSS.
Proficiency in SQL, Python, and Scala for data processing and analytics.
Strong understanding of cloud platforms (AWS, GCP, or Azure) and their data services.
Strong foundation and expertise in HLD and LLD, as well as design patterns, preferably using Spring Boot or Google Guice
Experience in big data technologies such as Spark, Hadoop, Kafka, and distributed computing frameworks.
Hands-on experience with data warehousing solutions such as Snowflake, Redshift, or BigQuery
Deep knowledge of data governance, security, and compliance (GDPR, SOC2, etc.).
Experience in NoSQL databases like Redis, Cassandra, MongoDB, and TiDB.
Familiarity with automation and DevOps tools like Jenkins, Ansible, Docker, Kubernetes, Chef, Grafana, and ELK.
Proven ability to drive technical strategy and align it with business objectives.
Strong leadership, communication, and stakeholder management skills.

PREFERRED QUALIFICATIONS:

Experience in machine learning infrastructure or MLOps is a plus.
Exposure to real-time data processing and analytics.
Interest in data structures, algorithm analysis and design, multicore programming, and scalable architecture.
Prior experience in a SaaS or high-growth tech company.

Data Architect

at Talent Pro

Posted by Mayank choudhary

Remote only

8 - 13 yrs

₹70L - ₹90L / yr

Data engineering

Apache Spark

Apache Kafka

Java

Python

+6 more

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Senior data architect

at Talent Pro

Posted by Mayank choudhary

Remote only

11 - 18 yrs

₹70L - ₹80L / yr

Java

Go Programming (Golang)

NodeJS (Node.js)

Python

Apache Kafka

+7 more

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Data Architect

at Talent Pro

Posted by Mayank choudhary

Remote only

11 - 18 yrs

₹50L - ₹70L / yr

Java

Data engineering

NodeJS (Node.js)

Python

Go Programming (Golang)

+5 more

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Big Data Engineer (Java Spark Developer – JAVA SPARK EXP IS MUST)

at Infosys Consulting

Agency job

via LION & ELEPHANTS Consultancy Pvt Ltd by Priya C

Remote only

7 - 12 yrs

₹25L - ₹40L / yr

Spark

Java

Apache Kafka

Big Data

Apache Hive

+5 more

Job Title: Big Data Engineer (Java Spark Developer – JAVA SPARK EXP IS MUST)

Location: Chennai, Hyderabad, Pune, Bangalore (Bengaluru) / NCR Delhi

Client: Premium Tier 1 Company

Payroll: Direct Client

Employment Type: Full time / Perm

Experience: 7+ years

Job Description:

We are looking for a skilled Big Data Engineers using Java Spark with 7+ years of experience in Big Data / legacy platforms, who can join immediately. Desired candidate should have design, development and optimization of real-time & batch data pipelines experience in Big Data environment at an enterprise scale applications. You will work on building scalable and high-performance data processing solutions, integrating real-time data streams, and building a reliable Data platforms. Strong troubleshooting, performance tuning, and collaboration skills are key for this role.

Key Responsibilities:

· Develop data pipelines using Java Spark and Kafka.

· Optimize and maintain real-time data pipelines and messaging systems.

· Collaborate with cross-functional teams to deliver scalable data solutions.

· Troubleshoot and resolve issues in Java Spark and Kafka applications.

Qualifications:

· Experience in Java Spark is must

· Knowledge and hands-on experience using distributed computing, real-time data streaming, and big data technologies

· Strong problem-solving and performance optimization skills

· Looking for immediate joiners

If interested, please share your resume along with the following details

1) Notice Period

2) Current CTC

3) Expected CTC

4) Have Experience in Java Spark - Y / N (this is must)

5) Any offers in hand

Thanks & Regards,

LION & ELEPHANTS CONSULTANCY PVT LTD TEAM

SINGAPORE | INDIA

Job Title: Big Data Engineer (Java Spark Developer – JAVA SPARK EXP IS MUST)

Location: Chennai, Hyderabad, Pune, Bangalore (Bengaluru) / NCR Delhi

Client: Premium Tier 1 Company

Payroll: Direct Client

Employment Type: Full time / Perm

Experience: 7+ years

Job Description:

Key Responsibilities:

· Develop data pipelines using Java Spark and Kafka.

· Optimize and maintain real-time data pipelines and messaging systems.

· Collaborate with cross-functional teams to deliver scalable data solutions.

· Troubleshoot and resolve issues in Java Spark and Kafka applications.

Qualifications:

· Experience in Java Spark is must

· Knowledge and hands-on experience using distributed computing, real-time data streaming, and big data technologies

· Strong problem-solving and performance optimization skills

· Looking for immediate joiners

If interested, please share your resume along with the following details

1) Notice Period

2) Current CTC

3) Expected CTC

4) Have Experience in Java Spark - Y / N (this is must)

5) Any offers in hand

Thanks & Regards,

LION & ELEPHANTS CONSULTANCY PVT LTD TEAM

SINGAPORE | INDIA

Senior Hadoop Developer

at Solix Technologies

3 recruiters

Posted by Sumathi Arramraju

Hyderabad

3 - 7 yrs

₹6L - ₹12L / yr

Hadoop

Java

HDFS

Spring

Spark

+1 more

Primary Skills required: Java, J2ee, JSP, Servlets, JDBC, Tomcat, Hadoop (hdfs, map reduce, hive, hbase, spark, impala)
Secondary Skills: Streaming, Archiving , AWS / AZURE / CLOUD

Role:
·         Should have strong programming and support experience in Java, J2EE technologies
·         Should have good experience in Core Java, JSP, Sevlets, JDBC
·         Good exposure in Hadoop development ( HDFS, Map Reduce, Hive, HBase, Spark)
·         Should have 2+ years of Java experience and 1+ years of experience in Hadoop
·         Should possess good communication skills

· Web Services or Elastic \ Map Reduce

· Familiarity with data-loading tools such as Sqoop

· Good to know: Spark, Storm, Apache HBase

Primary Skills required: Java, J2ee, JSP, Servlets, JDBC, Tomcat, Hadoop (hdfs, map reduce, hive, hbase, spark, impala)
Secondary Skills: Streaming, Archiving , AWS / AZURE / CLOUD

· Web Services or Elastic \ Map Reduce

· Familiarity with data-loading tools such as Sqoop

· Good to know: Spark, Storm, Apache HBase

Platform Engineer

at Mobile Programming LLC

1 video

34 recruiters

Posted by Sukhdeep Singh

Chennai

4 - 7 yrs

₹13L - ₹15L / yr

Data Analytics

Data Visualization

PowerBI

Tableau

Qlikview

+10 more

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Senior Engineer, Scala/Rust

at Conviva

1 recruiter

Posted by Anusha Bondada

Bengaluru (Bangalore)

3 - 6 yrs

₹20L - ₹40L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+9 more

As Conviva is expanding, we are building products providing deep insights into end-user experience for our customers.

Platform and TLB Team

The vision for the TLB team is to build data processing software that works on terabytes of streaming data in real-time. Engineer the next-gen Spark-like system for in-memory computation of large time-series datasets – both Spark-like backend infra and library-based programming model. Build a horizontally and vertically scalable system that analyses trillions of events per day within sub-second latencies. Utilize the latest and greatest big data technologies to build solutions for use cases across multiple verticals. Lead technology innovation and advancement that will have a big business impact for years to come. Be part of a worldwide team building software using the latest technologies and the best of software development tools and processes.

What You’ll Do

This is an individual contributor position. Expectations will be on the below lines:

Design, build and maintain the stream processing, and time-series analysis system which is at the heart of Conviva’s products
Responsible for the architecture of the Conviva platform
Build features, enhancements, new services, and bug fixing in Scala and Java on a Jenkins-based pipeline to be deployed as Docker containers on Kubernetes
Own the entire lifecycle of your microservice including early specs, design, technology choice, development, unit-testing, integration-testing, documentation, deployment, troubleshooting, enhancements, etc.
Lead a team to develop a feature or parts of a product
Adhere to the Agile model of software development to plan, estimate, and ship per business priority

What you need to succeed

5+ years of work experience in software development of data processing products.
Engineering degree in software or equivalent from a premier institute.
Excellent knowledge of fundamentals of Computer Science like algorithms and data structures. Hands-on with functional programming and know-how of its concepts
Excellent programming and debugging skills on the JVM. Proficient in writing code in Scala/Java/Rust/Haskell/Erlang that is reliable, maintainable, secure, and performant
Experience with big data technologies like Spark, Flink, Kafka, Druid, HDFS, etc.
Deep understanding of distributed systems concepts and scalability challenges including multi-threading, concurrency, sharding, partitioning, etc.
Experience/knowledge of Akka/Lagom framework and/or stream processing technologies like RxJava or Project Reactor will be a big plus. Knowledge of design patterns like event-streaming, CQRS and DDD to build large microservice architectures will be a big plus
Excellent communication skills. Willingness to work under pressure. Hunger to learn and succeed. Comfortable with ambiguity. Comfortable with complexity

Underpinning the Conviva platform is a rich history of innovation. More than 60 patents represent award-winning technologies and standards, including first-of-its kind-innovations like time-state analytics and AI-automated data modeling, that surfaces actionable insights. By understanding real-world human experiences and having the ability to act within seconds of observation, our customers can solve business-critical issues and focus on growing their business ahead of the competition. Examples of the brands Conviva has helped fuel streaming growth for include: DAZN, Disney+, HBO, Hulu, NBCUniversal, Paramount+, Peacock, Sky, Sling TV, Univision and Warner Bros Discovery.

Privately held, Conviva is headquartered in Silicon Valley, California with offices and people around the globe. For more information, visit us at www.conviva.com. Join us to help extend our leadership position in big data streaming analytics to new audiences and markets!

As Conviva is expanding, we are building products providing deep insights into end-user experience for our customers.

Platform and TLB Team

What You’ll Do

This is an individual contributor position. Expectations will be on the below lines:

Design, build and maintain the stream processing, and time-series analysis system which is at the heart of Conviva’s products
Responsible for the architecture of the Conviva platform
Build features, enhancements, new services, and bug fixing in Scala and Java on a Jenkins-based pipeline to be deployed as Docker containers on Kubernetes
Own the entire lifecycle of your microservice including early specs, design, technology choice, development, unit-testing, integration-testing, documentation, deployment, troubleshooting, enhancements, etc.
Lead a team to develop a feature or parts of a product
Adhere to the Agile model of software development to plan, estimate, and ship per business priority

What you need to succeed

5+ years of work experience in software development of data processing products.
Engineering degree in software or equivalent from a premier institute.
Excellent knowledge of fundamentals of Computer Science like algorithms and data structures. Hands-on with functional programming and know-how of its concepts
Excellent programming and debugging skills on the JVM. Proficient in writing code in Scala/Java/Rust/Haskell/Erlang that is reliable, maintainable, secure, and performant
Experience with big data technologies like Spark, Flink, Kafka, Druid, HDFS, etc.
Deep understanding of distributed systems concepts and scalability challenges including multi-threading, concurrency, sharding, partitioning, etc.
Experience/knowledge of Akka/Lagom framework and/or stream processing technologies like RxJava or Project Reactor will be a big plus. Knowledge of design patterns like event-streaming, CQRS and DDD to build large microservice architectures will be a big plus
Excellent communication skills. Willingness to work under pressure. Hunger to learn and succeed. Comfortable with ambiguity. Comfortable with complexity

Big Data Engineer + Spark

Multinational Company providing energy & Automation digital

Agency job

via Jobdost by Sathish Kumar

Hyderabad

4 - 7 yrs

₹14L - ₹25L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+5 more

Roles and Responsibilities

Big Data Engineer + Spark Responsibilies Atleast 3 to 4 years of relevant experience as Big Data Engineer Min 1 year of relevant hands-on experience into Spark framework. Minimum 4 years of Application Development experience using any programming language like Scala/Java/Python. Hands on experience on any major components in Hadoop Ecosystem like HDFS or Map or Reduce or Hive or Impala. Strong programming experience of building applications / platforms using Scala/Java/Python. Experienced in implementing Spark RDD Transformations, actions to implement business analysis. An efficient interpersonal communicator with sound analytical problemsolving skills and management capabilities. Strive to keep the slope of the learning curve high and able to quickly adapt to new environments and technologies. Good knowledge on agile methodology of Software development.

Roles and Responsibilities

Data Engineer

at Number Theory

3 recruiters

Posted by Nidhi Mishra

Gurugram

2 - 4 yrs

₹10L - ₹15L / yr

Hadoop

Spark

HDFS

Scala

Java

+2 more

Position Overview: Data Engineer (2+ yrs)
Our company is seeking to hire a skilled software developer to help with the development of our AI/ML platform.
Your duties will primarily revolve around building Platform by writing code in Scala, as well as modifying platform
to fix errors, work on distributed computing, adapt it to new cloud services, improve its performance, or upgrade
interfaces. To be successful in this role, you will need extensive knowledge of programming languages and the
software development life-cycle.

Responsibilities:
 Analyze, design develop, troubleshoot and debug Platform
 Writes code and guides other team membersfor best practices and performs testing and debugging of
applications.
 Specify, design and implementminor changes to existing software architecture. Build highly complex
enhancements and resolve complex bugs. Build and execute unit tests and unit plans.
 Duties and tasks are varied and complex, needing independent judgment. Fully competent in own area of
expertise

Experience:
The candidate should have about 2+ years of experience with design and development in Java/Scala. Experience in
algorithm, Distributed System, Data-structure, database and architectures of distributed System is mandatory.

Required Skills:
1. In-depth knowledge of Hadoop, Spark architecture and its componentssuch as HDFS, YARN and executor, cores and memory param
2. Knowledge of Scala/Java.
3. Extensive experience in developing spark job. Should possess good Oops knowledge and be aware of
enterprise application design patterns.
4. Good knowledge of Unix/Linux.
5. Experience working on large-scale software projects
6. Keep an eye out for technological trends, open-source projects that can be used.
7. Knows common programming languages Frameworks

Data Engineer

Product and Service based company

Agency job

via Jobdost by Sathish Kumar

Hyderabad, Ahmedabad

4 - 8 yrs

₹15L - ₹30L / yr

Amazon Web Services (AWS)

Apache

Snow flake schema

Python

Spark

+13 more

Job Description

Mandatory Requirements

Experience in AWS Glue
Experience in Apache Parquet
Proficient in AWS S3 and data lake
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible
Identify and interpret trends and patterns from complex data sets
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders.
Key participant in regular Scrum ceremonies with the agile teams
Proficient at developing queries, writing reports and presenting findings
Mentor junior members and bring best industry practices.

QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales)
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C#
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools.
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines.
Good written and oral communication skills and ability to present results to non-technical audiences
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus:

AWS certification
Spark Streaming
Kafka Streaming / Kafka Connect
ELK Stack
Cassandra / MongoDB
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Job Description

Mandatory Requirements

Experience in AWS Glue
Experience in Apache Parquet
Proficient in AWS S3 and data lake
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible
Identify and interpret trends and patterns from complex data sets
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders.
Key participant in regular Scrum ceremonies with the agile teams
Proficient at developing queries, writing reports and presenting findings
Mentor junior members and bring best industry practices.

QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales)
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C#
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools.
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines.
Good written and oral communication skills and ability to present results to non-technical audiences
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus:

AWS certification
Spark Streaming
Kafka Streaming / Kafka Connect
ELK Stack
Cassandra / MongoDB
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Principal Engineer - Hadoop

US Based Product Organization

Agency job

via e-Hireo by Biswajit Banik

Bengaluru (Bangalore)

10 - 15 yrs

₹25L - ₹45L / yr

Hadoop

HDFS

Apache Hive

Zookeeper

Cloudera

+8 more

Responsibilities :

Provide Support Services to our Gold & Enterprise customers using our flagship product suits. This may include assistance provided during the engineering and operations of distributed systems as well as responses for mission-critical systems and production customers.
Lead end-to-end delivery and customer success of next-generation features related to scalability, reliability, robustness, usability, security, and performance of the product
Lead and mentor others about concurrency, parallelization to deliver scalability, performance, and resource optimization in a multithreaded and distributed environment
Demonstrate the ability to actively listen to customers and show empathy to the customer’s business impact when they experience issues with our products

Requires Skills :

10+ years of Experience with a highly scalable, distributed, multi-node environment (100+ nodes)
Hadoop operation including Zookeeper, HDFS, YARN, Hive, and related components like the Hive metastore, Cloudera Manager/Ambari, etc
Authentication and security configuration and tuning (KNOX, LDAP, Kerberos, SSL/TLS, second priority: SSO/OAuth/OIDC, Ranger/Sentry)
Java troubleshooting, e.g., collection and evaluation of jstacks, heap dumps
Linux, NFS, Windows, including application installation, scripting, basic command line
Docker and Kubernetes configuration and troubleshooting, including Helm charts, storage options, logging, and basic kubectl CLI
Experience working with scripting languages (Bash, PowerShell, Python)
Working knowledge of application, server, and network security management concepts
Familiarity with virtual machine technologies
Knowledge of databases like MySQL and PostgreSQL,
Certification on any of the leading Cloud providers (AWS, Azure, GCP ) and/or Kubernetes is a big plus

Responsibilities :

Provide Support Services to our Gold & Enterprise customers using our flagship product suits. This may include assistance provided during the engineering and operations of distributed systems as well as responses for mission-critical systems and production customers.
Lead end-to-end delivery and customer success of next-generation features related to scalability, reliability, robustness, usability, security, and performance of the product
Lead and mentor others about concurrency, parallelization to deliver scalability, performance, and resource optimization in a multithreaded and distributed environment
Demonstrate the ability to actively listen to customers and show empathy to the customer’s business impact when they experience issues with our products

Requires Skills :

10+ years of Experience with a highly scalable, distributed, multi-node environment (100+ nodes)
Hadoop operation including Zookeeper, HDFS, YARN, Hive, and related components like the Hive metastore, Cloudera Manager/Ambari, etc
Authentication and security configuration and tuning (KNOX, LDAP, Kerberos, SSL/TLS, second priority: SSO/OAuth/OIDC, Ranger/Sentry)
Java troubleshooting, e.g., collection and evaluation of jstacks, heap dumps
Linux, NFS, Windows, including application installation, scripting, basic command line
Docker and Kubernetes configuration and troubleshooting, including Helm charts, storage options, logging, and basic kubectl CLI
Experience working with scripting languages (Bash, PowerShell, Python)
Working knowledge of application, server, and network security management concepts
Familiarity with virtual machine technologies
Knowledge of databases like MySQL and PostgreSQL,
Certification on any of the leading Cloud providers (AWS, Azure, GCP ) and/or Kubernetes is a big plus

Site Reliability Engineer Level 3

at Acceldata

5 recruiters

Posted by Richa Kukar

Bengaluru (Bangalore)

6 - 10 yrs

Best in industry

SRE

Reliability engineering

Site reliability

Hadoop

HDFS

+1 more

Senior SRE - Acceldata (IC3 Level)

About the Job

You will join a team of highly skilled engineers who are responsible for delivering Acceldata’s support services. Our Site Reliability Engineers are trained to be active listeners and demonstrate empathy when customers encounter product issues. In our fun and collaborative environment Site Reliability Engineers develop strong business, interpersonal and technical skills to deliver high-quality service to our valued customers.

When you arrive for your first day, we’ll want you to have:

Solid skills in troubleshooting to repair failed products or processes on a machine or a system using a logical, systematic search for the source of a problem in order to solve it, and make the product or process operational again
A strong ability to understand the feelings of our customers as we empathize with them on the issue at hand
A strong desire to increase your product and technology skillset; increase- your confidence supporting our products so you can help our customers succeed

In this position you will…

Provide Support Services to our Gold & Enterprise customers using our flagship Acceldata Pulse,Flow & Torch Product suits. This may include assistance provided during the engineering and operations of distributed systems as well as responses for mission-critical systems and production customers.
Demonstrate the ability to actively listen to customers and show empathy to the customer’s business impact when they experience issues with our products
Participate in the queue management and coordination process by owning customer escalations, managing the unassigned queue.
Be involved with and work on other support related activities - Performing POC & assisting Onboarding deployments of Acceldata & Hadoop distribution products.
Triage, diagnose and escalate customer inquiries when applicable during their engineering and operations efforts.
Collaborate and share solutions with both customers and the Internal team.
Investigate product related issues both for particular customers and for common trends that may arise
Study and understand critical system components and large cluster operations
Differentiate between issues that arise in operations, user code, or product
Coordinate enhancement and feature requests with product management and Acceldata engineering team.
Flexible in working in Shifts.
Participate in a Rotational weekend on-call roster for critical support needs.
Participate as a designated or dedicated engineer for specific customers. Aspects of this engagement translates to building long term successful relationships with customers, leading weekly status calls, and occasional visits to customer sites

In this position, you should have…

A strong desire and aptitude to become a well-rounded support professional. Acceldata Support considers the service we deliver as our core product.
A positive attitude towards feedback and continual improvement
A willingness to give direct feedback to and partner with management to improve team operations
A tenacity to bring calm and order to the often stressful situations of customer cases
A mental capability to multi-task across many customer situations simultaneously
Bachelor degree in Computer Science or Engineering or equivalent experience. Master’s degree is a plus
At least 2+ years of experience with at least one of the following cloud platforms: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform (GCP), experience with managing and supporting a cloud infrastructure on any of the 3 platforms. Also knowledge on Kubernetes, Docker is a must.
Strong troubleshooting skills (in example, TCP/IP, DNS, File system, Load balancing, database, Java)
Excellent communication skills in English (written and verbal)
Prior enterprise support experience in a technical environment strongly preferred

Strong Hands-on Experience Working With Or Supporting The Following

8-12 years of Experience with a highly-scalable, distributed, multi-node environment (50+ nodes)
Hadoop operation including Zookeeper, HDFS, YARN, Hive, and related components like the Hive metastore, Cloudera Manager/Ambari, etc
Authentication and security configuration and tuning (KNOX, LDAP, Kerberos, SSL/TLS, second priority: SSO/OAuth/OIDC, Ranger/Sentry)
Java troubleshooting, e.g., collection and evaluation of jstacks, heap dumps

You might also have…

Linux, NFS, Windows, including application installation, scripting, basic command line
Docker and Kubernetes configuration and troubleshooting, including Helm charts, storage options, logging, and basic kubectl CLI

Experience working with scripting languages (Bash, PowerShell, Python)
Working knowledge of application, server, and network security management concepts
Familiarity with virtual machine technologies
Knowledge of databases like MySQL and PostgreSQL,
Certification on any of the leading Cloud providers (AWS, Azure, GCP ) and/or Kubernetes is a big plus

The right person in this role has an opportunity to make a huge impact at Acceldata and add value to our future decisions. If this position has piqued your interest and you have what we described - we invite you to apply! An adventure in data awaits.

Learn more at https://www.acceldata.io/about-us">https://www.acceldata.io/about-us

Senior SRE - Acceldata (IC3 Level)

About the Job

When you arrive for your first day, we’ll want you to have:

Solid skills in troubleshooting to repair failed products or processes on a machine or a system using a logical, systematic search for the source of a problem in order to solve it, and make the product or process operational again
A strong ability to understand the feelings of our customers as we empathize with them on the issue at hand
A strong desire to increase your product and technology skillset; increase- your confidence supporting our products so you can help our customers succeed

In this position you will…

Provide Support Services to our Gold & Enterprise customers using our flagship Acceldata Pulse,Flow & Torch Product suits. This may include assistance provided during the engineering and operations of distributed systems as well as responses for mission-critical systems and production customers.
Demonstrate the ability to actively listen to customers and show empathy to the customer’s business impact when they experience issues with our products
Participate in the queue management and coordination process by owning customer escalations, managing the unassigned queue.
Be involved with and work on other support related activities - Performing POC & assisting Onboarding deployments of Acceldata & Hadoop distribution products.
Triage, diagnose and escalate customer inquiries when applicable during their engineering and operations efforts.
Collaborate and share solutions with both customers and the Internal team.
Investigate product related issues both for particular customers and for common trends that may arise
Study and understand critical system components and large cluster operations
Differentiate between issues that arise in operations, user code, or product
Coordinate enhancement and feature requests with product management and Acceldata engineering team.
Flexible in working in Shifts.
Participate in a Rotational weekend on-call roster for critical support needs.
Participate as a designated or dedicated engineer for specific customers. Aspects of this engagement translates to building long term successful relationships with customers, leading weekly status calls, and occasional visits to customer sites

In this position, you should have…

A strong desire and aptitude to become a well-rounded support professional. Acceldata Support considers the service we deliver as our core product.
A positive attitude towards feedback and continual improvement
A willingness to give direct feedback to and partner with management to improve team operations
A tenacity to bring calm and order to the often stressful situations of customer cases
A mental capability to multi-task across many customer situations simultaneously
Bachelor degree in Computer Science or Engineering or equivalent experience. Master’s degree is a plus
At least 2+ years of experience with at least one of the following cloud platforms: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform (GCP), experience with managing and supporting a cloud infrastructure on any of the 3 platforms. Also knowledge on Kubernetes, Docker is a must.
Strong troubleshooting skills (in example, TCP/IP, DNS, File system, Load balancing, database, Java)
Excellent communication skills in English (written and verbal)
Prior enterprise support experience in a technical environment strongly preferred

Strong Hands-on Experience Working With Or Supporting The Following

8-12 years of Experience with a highly-scalable, distributed, multi-node environment (50+ nodes)
Hadoop operation including Zookeeper, HDFS, YARN, Hive, and related components like the Hive metastore, Cloudera Manager/Ambari, etc
Authentication and security configuration and tuning (KNOX, LDAP, Kerberos, SSL/TLS, second priority: SSO/OAuth/OIDC, Ranger/Sentry)
Java troubleshooting, e.g., collection and evaluation of jstacks, heap dumps

You might also have…

Linux, NFS, Windows, including application installation, scripting, basic command line
Docker and Kubernetes configuration and troubleshooting, including Helm charts, storage options, logging, and basic kubectl CLI

Experience working with scripting languages (Bash, PowerShell, Python)
Working knowledge of application, server, and network security management concepts
Familiarity with virtual machine technologies
Knowledge of databases like MySQL and PostgreSQL,
Certification on any of the leading Cloud providers (AWS, Azure, GCP ) and/or Kubernetes is a big plus

Principal Data Engineer

AI-powered cloud-based SaaS solution provider

Agency job

via wrackle by Naveen Taalanki

Bengaluru (Bangalore)

8 - 15 yrs

₹25L - ₹60L / yr

Data engineering

Big Data

Spark

Apache Kafka

Cassandra

+20 more

Responsibilities

● Able to contribute to the gathering of functional requirements, developing technical
specifications, and test case planning
● Demonstrating technical expertise, and solving challenging programming and design
problems
● 60% hands-on coding with architecture ownership of one or more products
● Ability to articulate architectural and design options, and educate development teams and
business users
● Resolve defects/bugs during QA testing, pre-production, production, and post-release
patches
● Mentor and guide team members
● Work cross-functionally with various bidgely teams including product management, QA/QE,
various product lines, and/or business units to drive forward results

Requirements
● BS/MS in computer science or equivalent work experience
● 8-12 years’ experience designing and developing applications in Data Engineering
● Hands-on experience with Big data EcoSystems.
● Past experience with Hadoop,Hdfs,Map Reduce,YARN,AWS Cloud, EMR, S3, Spark, Cassandra,
Kafka, Zookeeper
● Expertise with any of the following Object-Oriented Languages (OOD): Java/J2EE,Scala,
Python
● Ability to lead and mentor technical team members
● Expertise with the entire Software Development Life Cycle (SDLC)
● Excellent communication skills: Demonstrated ability to explain complex technical issues to
both technical and non-technical audiences
● Expertise in the Software design/architecture process
● Expertise with unit testing & Test-Driven Development (TDD)
● Business Acumen - strategic thinking & strategy development
● Experience on Cloud or AWS is preferable
● Have a good understanding and ability to develop software, prototypes, or proofs of
concepts (POC's) for various Data Engineering requirements.
● Experience with Agile Development, SCRUM, or Extreme Programming methodologies

Data Engineer

AI-powered cloud-based SaaS solution

Agency job

via wrackle by Naveen Taalanki

Bengaluru (Bangalore)

2 - 10 yrs

₹15L - ₹50L / yr

Data engineering

Big Data

Data Engineer

Big Data Engineer

Hibernate (Java)

+18 more

Responsibilities

● Able contribute to the gathering of functional requirements, developing technical
specifications, and project & test planning
● Demonstrating technical expertise, and solving challenging programming and design
problems
● Roughly 80% hands-on coding
● Generate technical documentation and PowerPoint presentations to communicate
architectural and design options, and educate development teams and business users
● Resolve defects/bugs during QA testing, pre-production, production, and post-release
patches
● Work cross-functionally with various bidgely teams including: product management,
QA/QE, various product lines, and/or business units to drive forward results

Requirements
● BS/MS in computer science or equivalent work experience
● 2-4 years’ experience designing and developing applications in Data Engineering
● Hands-on experience with Big data Eco Systems.
● Hadoop,Hdfs,Map Reduce,YARN,AWS Cloud, EMR, S3, Spark, Cassandra, Kafka,
Zookeeper
● Expertise with any of the following Object-Oriented Languages (OOD): Java/J2EE,Scala,
Python
● Strong leadership experience: Leading meetings, presenting if required
● Excellent communication skills: Demonstrated ability to explain complex technical
issues to both technical and non-technical audiences
● Expertise in the Software design/architecture process
● Expertise with unit testing & Test-Driven Development (TDD)
● Experience on Cloud or AWS is preferable
● Have a good understanding and ability to develop software, prototypes, or proofs of
concepts (POC's) for various Data Engineering requirements.

Director of Engineering

AI Based SAAS company

Agency job

via wrackle by Naveen Taalanki

Bengaluru (Bangalore)

12 - 22 yrs

₹50L - ₹99L / yr

Engineering Management

Engineering Manager

Engineering head

Technical Architecture

Technical lead

+20 more

Location: Bangalore

Function: Software Engineering → Backend Development

We are looking for an extraordinary and dynamic Director of Engineering to be part of its Engineering team in Bangalore. You must have a good record of architecting scalable solutions, hiring and mentoring talented teams and working with product managers to build great products. You must be highly analytical and a good problem solver. You will be part of a highly energetic and innovative team that believes nothing is impossible with some creativity and hard work.

Responsibilities:

Own the overall solution design and implementation for backend systems. This includes requirement analysis, scope discussion, design, architecture, implementation, delivery and resolving production issues related to engineering.
Owner of the technology roadmap of our products from core back end engineering perspective.
Ability to guide the team in debugging production issues and write best-of- the breed code.
Drive engineering excellence (defects, productivity through automation, performance of products etc) through clearly defined metrics.
Stay current with the latest tools, technology ideas and methodologies; share knowledge by clearly articulating results and ideas to key decision makers.
Hiring, mentoring, and retaining a very talented team.

Requirements:

12 - 20 years of strong experience in product development.
Strong experience in building data engineering (no SQL DBs, HDFS, Kafka, cassandra, Elasticsearch, Spark etc) intensive backend.
Excellent track record of designing and delivering System architecture, implementation and deployment of successful solutions in a custome facing role
Strong in problem solving and analytical skills.
Ability to influence decision making through data and be metric driven.
Strong understanding of non-functional requirements like security, test automation etc.
Fluency in Java, Spring, Hibernate, J2EE, REST Services.
Ability to hire, mentor and retain best-of-the-breed engineers.
Exposure to Agile development methodologies.
Ability to collaborate across teams and strong interpersonal skills.
SAAS experience a plus.

Location: Bangalore

Function: Software Engineering → Backend Development

Responsibilities:

Own the overall solution design and implementation for backend systems. This includes requirement analysis, scope discussion, design, architecture, implementation, delivery and resolving production issues related to engineering.
Owner of the technology roadmap of our products from core back end engineering perspective.
Ability to guide the team in debugging production issues and write best-of- the breed code.
Drive engineering excellence (defects, productivity through automation, performance of products etc) through clearly defined metrics.
Stay current with the latest tools, technology ideas and methodologies; share knowledge by clearly articulating results and ideas to key decision makers.
Hiring, mentoring, and retaining a very talented team.

Requirements:

12 - 20 years of strong experience in product development.
Strong experience in building data engineering (no SQL DBs, HDFS, Kafka, cassandra, Elasticsearch, Spark etc) intensive backend.
Excellent track record of designing and delivering System architecture, implementation and deployment of successful solutions in a custome facing role
Strong in problem solving and analytical skills.
Ability to influence decision making through data and be metric driven.
Strong understanding of non-functional requirements like security, test automation etc.
Fluency in Java, Spring, Hibernate, J2EE, REST Services.
Ability to hire, mentor and retain best-of-the-breed engineers.
Exposure to Agile development methodologies.
Ability to collaborate across teams and strong interpersonal skills.
SAAS experience a plus.

Backend Data Engineer

India's best Short Video App

Agency job

via wrackle by Naveen Taalanki

Bengaluru (Bangalore)

4 - 12 yrs

₹25L - ₹50L / yr

Data engineering

Big Data

Spark

Apache Kafka

Apache Hive

+26 more

What Makes You a Great Fit for The Role?

You’re awesome at and will be responsible for

Extensive programming experience with cross-platform development of one of the following Java/SpringBoot, Javascript/Node.js, Express.js or Python

3-4 years of experience in big data analytics technologies like Storm, Spark/Spark streaming, Flink, AWS Kinesis, Kafka streaming, Hive, Druid, Presto, Elasticsearch, Airflow, etc.

3-4 years of experience in building high performance RPC services using different high performance paradigms: multi-threading, multi-processing, asynchronous programming (nonblocking IO), reactive programming,

3-4 years of experience working high throughput low latency databases and cache layers like MongoDB, Hbase, Cassandra, DynamoDB,, Elasticache ( Redis + Memcache )

Experience with designing and building high scale app backends and micro-services leveraging cloud native services on AWS like proxies, caches, CDNs, messaging systems, Serverless compute(e.g. lambda), monitoring and telemetry.

Strong understanding of distributed systems fundamentals around scalability, elasticity, availability, fault-tolerance.

Experience in analysing and improving the efficiency, scalability, and stability of distributed systems and backend micro services.

5-7 years of strong design/development experience in building massively large scale, high throughput low latency distributed internet systems and products.

Good experience in working with Hadoop and Big Data technologies like HDFS, Pig, Hive, Storm, HBase, Scribe, Zookeeper and NoSQL systems etc.

Agile methodologies, Sprint management, Roadmap, Mentoring, Documenting, Software architecture.

Liaison with Product Management, DevOps, QA, Client and other teams

Your Experience Across The Years in the Roles You’ve Played

Have total or more 5 - 7 years of experience with 2-3 years in a startup.

Have B.Tech or M.Tech or equivalent academic qualification from premier institute.

Experience in Product companies working on Internet-scale applications is preferred

Thoroughly aware of cloud computing infrastructure on AWS leveraging cloud native service and infrastructure services to design solutions.

Follow Cloud Native Computing Foundation leveraging mature open source projects including understanding of containerisation/Kubernetes.

You are passionate about learning or growing your expertise in some or all of the following

Data Pipelines

Data Warehousing

Statistics

Metrics Development

We Value Engineers Who Are

Customer-focused: We believe that doing what’s right for the creator is ultimately what will drive our business forward.

Obsessed with Quality: Your Production code just works & scales linearly

Team players. You believe that more can be achieved together. You listen to feedback and also provide supportive feedback to help others grow/improve.

Pragmatic: We do things quickly to learn what our creators desire. You know when it’s appropriate to take shortcuts that don’t sacrifice quality or maintainability.

Owners: Engineers at Chingari know how to positively impact the business.