Apache hadoop jobs

17+ Apache Hadoop Jobs in India

Apply to 17+ Apache Hadoop Jobs on CutShort.io. Find your next job, effortlessly. Browse Apache Hadoop Jobs and apply today!

Senior data architect

at Talent Pro

Posted by Mayank choudhary

Remote only

11 - 18 yrs

₹70L - ₹80L / yr

Java

Go Programming (Golang)

NodeJS (Node.js)

Python

Apache Kafka

+7 more

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

Role & Responsibilities

Lead and mentor a team of data engineers, ensuring high performance and career growth.

Architect and optimize scalable data infrastructure, ensuring high availability and reliability.

Drive the development and implementation of data governance frameworks and best practices.

Work closely with cross-functional teams to define and execute a data roadmap.

Optimize data processing workflows for performance and cost efficiency.

Ensure data security, compliance, and quality across all data platforms.

Foster a culture of innovation and technical excellence within the data team.

ML Ops Engineer

at JK Technosoft Ltd

Posted by Nishu Gupta

Bengaluru (Bangalore)

3 - 5 yrs

₹5L - ₹15L / yr

Data Science

Machine Learning (ML)

Natural Language Processing (NLP)

Computer Vision

recommendation algorithm

+13 more

Roles and Responsibilities:

Design, develop, and maintain the end-to-end MLOps infrastructure from the ground up, leveraging open-source systems across the entire MLOps landscape.
Creating pipelines for data ingestion, data transformation, building, testing, and deploying machine learning models, as well as monitoring and maintaining the performance of these models in production.
Managing the MLOps stack, including version control systems, continuous integration and deployment tools, containerization, orchestration, and monitoring systems.
Ensure that the MLOps stack is scalable, reliable, and secure.

Skills Required:

3-6 years of MLOps experience
Preferably worked in the startup ecosystem

Primary Skills:

Experience with E2E MLOps systems like ClearML, Kubeflow, MLFlow etc.
Technical expertise in MLOps: Should have a deep understanding of the MLOps landscape and be able to leverage open-source systems to build scalable, reliable, and secure MLOps infrastructure.
Programming skills: Proficient in at least one programming language, such as Python, and have experience with data science libraries, such as TensorFlow, PyTorch, or Scikit-learn.
DevOps experience: Should have experience with DevOps tools and practices, such as Git, Docker, Kubernetes, and Jenkins.

Secondary Skills:

Version Control Systems (VCS) tools like Git and Subversion
Containerization technologies like Docker and Kubernetes
Cloud Platforms like AWS, Azure, and Google Cloud Platform
Data Preparation and Management tools like Apache Spark, Apache Hadoop, and SQL databases like PostgreSQL and MySQL
Machine Learning Frameworks like TensorFlow, PyTorch, and Scikit-learn
Monitoring and Logging tools like Prometheus, Grafana, and Elasticsearch
Continuous Integration and Continuous Deployment (CI/CD) tools like Jenkins, GitLab CI, and CircleCI
Explain ability and Interpretability tools like LIME and SHAP

Roles and Responsibilities:

Design, develop, and maintain the end-to-end MLOps infrastructure from the ground up, leveraging open-source systems across the entire MLOps landscape.
Creating pipelines for data ingestion, data transformation, building, testing, and deploying machine learning models, as well as monitoring and maintaining the performance of these models in production.
Managing the MLOps stack, including version control systems, continuous integration and deployment tools, containerization, orchestration, and monitoring systems.
Ensure that the MLOps stack is scalable, reliable, and secure.

Skills Required:

3-6 years of MLOps experience
Preferably worked in the startup ecosystem

Primary Skills:

Experience with E2E MLOps systems like ClearML, Kubeflow, MLFlow etc.
Technical expertise in MLOps: Should have a deep understanding of the MLOps landscape and be able to leverage open-source systems to build scalable, reliable, and secure MLOps infrastructure.
Programming skills: Proficient in at least one programming language, such as Python, and have experience with data science libraries, such as TensorFlow, PyTorch, or Scikit-learn.
DevOps experience: Should have experience with DevOps tools and practices, such as Git, Docker, Kubernetes, and Jenkins.

Secondary Skills:

Version Control Systems (VCS) tools like Git and Subversion
Containerization technologies like Docker and Kubernetes
Cloud Platforms like AWS, Azure, and Google Cloud Platform
Data Preparation and Management tools like Apache Spark, Apache Hadoop, and SQL databases like PostgreSQL and MySQL
Machine Learning Frameworks like TensorFlow, PyTorch, and Scikit-learn
Monitoring and Logging tools like Prometheus, Grafana, and Elasticsearch
Continuous Integration and Continuous Deployment (CI/CD) tools like Jenkins, GitLab CI, and CircleCI
Explain ability and Interpretability tools like LIME and SHAP

Sr. Software Engineer, Rust

at Conviva

1 recruiter

Posted by Adarsh Sikarwar

Bengaluru (Bangalore)

4 - 8 yrs

₹15L - ₹40L / yr

Apache Kafka

Redis

Systems design

Data Structures

Algorithms

+5 more

Have you streamed a program on Disney+, watched your favorite binge-worthy series on Peacock or cheered your favorite team on during the World Cup from one of the 20 top streaming platforms around the globe? If the answer is yes, you’ve already benefitted from Conviva technology, helping the world’s leading streaming publishers deliver exceptional streaming experiences and grow their businesses.

Conviva is the only global streaming analytics platform for big data that collects, standardizes, and puts trillions of cross-screen, streaming data points in context, in real time. The Conviva platform provides comprehensive, continuous, census-level measurement through real-time, server side sessionization at unprecedented scale. If this sounds important, it is! We measure a global footprint of more than 500 million unique viewers in 180 countries watching 220 billion streams per year across 3 billion applications streaming on devices. With Conviva, customers get a unique level of actionability and scale from continuous streaming measurement insights and benchmarking across every stream, every screen, every second.

What you get to do in this role:

Work on extremely high scale RUST web services or backend systems.

Design and develop solutions for highly scalable web and backend systems.

Proactively identify and solve performance issues.

Maintain a high bar on code quality and unit testing.

What you bring to the role:

5+ years of hands-on software development experience.

At least 2+ years of RUST development experience.

Knowledge of cargo packages for kafka, redis etc.

Strong CS fundamentals, including system design, data structures and algorithms.

Expertise in backend and web services development.

Good analytical and troubleshooting skills.

What will help you stand out:

Experience working with large scale web services and applications.

Exposure to Golang, Scala or Java

Exposure to Big data systems like Kafka, Spark, Hadoop etc.

Underpinning the Conviva platform is a rich history of innovation. More than 60 patents represent award-winning technologies and standards, including first-of-its kind-innovations like time-state analytics and AI-automated data modeling, that surfaces actionable insights. By understanding real-world human experiences and having the ability to act within seconds of observation, our customers can solve business-critical issues and focus on growing their business ahead of the competition. Examples of the brands Conviva has helped fuel streaming growth for include: DAZN, Disney+, HBO, Hulu, NBCUniversal, Paramount+, Peacock, Sky, Sling TV, Univision and Warner Bros Discovery.

Privately held, Conviva is headquartered in Silicon Valley, California with offices and people around the globe. For more information, visit us at www.conviva.com. Join us to help extend our leadership position in big data streaming analytics to new audiences and markets!

What you get to do in this role:

Work on extremely high scale RUST web services or backend systems.

Design and develop solutions for highly scalable web and backend systems.

Proactively identify and solve performance issues.

Maintain a high bar on code quality and unit testing.

What you bring to the role:

5+ years of hands-on software development experience.

At least 2+ years of RUST development experience.

Knowledge of cargo packages for kafka, redis etc.

Strong CS fundamentals, including system design, data structures and algorithms.

Expertise in backend and web services development.

Good analytical and troubleshooting skills.

What will help you stand out:

Experience working with large scale web services and applications.

Exposure to Golang, Scala or Java

Exposure to Big data systems like Kafka, Spark, Hadoop etc.

software engineer

hiring for a leading client

Agency job

via Jobaajcom by Saksham Agarwal

Bengaluru (Bangalore)

1 - 3 yrs

₹12L - ₹15L / yr

Big Data

Apache Hadoop

Apache Impala

Apache Kafka

Apache Spark

+5 more

We are seeking a self motivated Software Engineer with hands-on experience to build sustainable data solutions, identifying and addressing performance bottlenecks, collaborating with other team members, and implementing best practices for data engineering. Our engineering process is fully agile, and has a really fast release cycle - which keeps our environment very energetic and fun.

What you'll do:

Design and development of scalable applications.
Collaborate with tech leads to get maximum understanding of underlying infrastructure.
Contribute to continual improvement by suggesting improvements to the software system.
Ensure high scalability and performance
You will advocate for good, clean, well documented and performing code; follow standards and best practices.
We'd love for you to have:

Education: Bachelor/Master Degree in Computer Science
Experience: 1-3 years of relevant experience in BI/Big-Data with hands-on coding experience
Mandatory Skills

Strong in problem-solving
Good exposure to Big Data technologies, Hive, Hadoop, Impala, Hbase, Kafka, Spark
Strong experience of Data Engineering
Able to comprehend challenges related to Database and Data Warehousing technologies and ability to understand complex design, system architecture
Experience with the software development lifecycle, design, develop, review, debug, document, and deliver (especially in a multi-location organization)
Working knowledge of Java, python
Desired Skills

Experience with reporting tools like Tableau, QlikView
Awareness of CI-CD pipeline
Inclination to work on cloud platform ex:- AWS
Crisp communication skills with team members, Business owners.
Be able to work in a challenging, dynamic environment and meet tight deadlines

Data Engineer

at Planet Spark

5 recruiters

Posted by Maneesh Dhooper

Gurugram

2 - 5 yrs

₹7L - ₹18L / yr

Data engineering

Data Engineer

Data Warehouse (DWH)

Python

SQL

+4 more

Responsibilities :

Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet business requirements
Identifying, designing, and implementing internal process improvements including redesigning infrastructure for greater scalability, optimizing data delivery, and automating manual processes
Work with Data, Analytics & Tech team to extract, arrange and analyze data
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS technologies
Building analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics including operational efficiency and customer acquisition
Works closely with all business units and engineering teams to develop a strategy for long-term data platform architecture.
Working with stakeholders including data, design, product, and executive teams, and assisting them with data-related technical issues
Working with stakeholders including the Executive, Product, Data, and Design teams to support their data infrastructure needs while assisting with data-related technical issues.

Skill Requirements

SQL
Ruby or Python(Ruby preferred)
Apache-Hadoop based analytics
Data warehousing
Data architecture
Schema design
ML

Experience Requirement

Prior experience of 2 to 5 years as a Data Engineer.
Ability in managing and communicating data warehouse plans to internal teams.
Experience designing, building, and maintaining data processing systems.
Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions.
Excellent analytic skills associated with working on unstructured datasets.
Ability to build processes that support data transformation, workload management, data structures, dependency, and metadata.

Responsibilities :

Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet business requirements
Identifying, designing, and implementing internal process improvements including redesigning infrastructure for greater scalability, optimizing data delivery, and automating manual processes
Work with Data, Analytics & Tech team to extract, arrange and analyze data
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS technologies
Building analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics including operational efficiency and customer acquisition
Works closely with all business units and engineering teams to develop a strategy for long-term data platform architecture.
Working with stakeholders including data, design, product, and executive teams, and assisting them with data-related technical issues
Working with stakeholders including the Executive, Product, Data, and Design teams to support their data infrastructure needs while assisting with data-related technical issues.

Skill Requirements

SQL
Ruby or Python(Ruby preferred)
Apache-Hadoop based analytics
Data warehousing
Data architecture
Schema design
ML

Experience Requirement

Prior experience of 2 to 5 years as a Data Engineer.
Ability in managing and communicating data warehouse plans to internal teams.
Experience designing, building, and maintaining data processing systems.
Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions.
Excellent analytic skills associated with working on unstructured datasets.
Ability to build processes that support data transformation, workload management, data structures, dependency, and metadata.

Data Engineer

A logistic Company

Agency job

via Anzy by Dattatraya Kolangade

Bengaluru (Bangalore)

5 - 7 yrs

₹18L - ₹25L / yr

Data engineering

ETL

SQL

Hadoop

Apache Spark

+13 more

Key responsibilities:
• Create and maintain data pipeline
• Build and deploy ETL infrastructure for optimal data delivery
• Work with various including product, design and executive team to troubleshoot data
related issues
• Create tools for data analysts and scientists to help them build and optimise the product
• Implement systems and process for data access controls and guarantees
• Distill the knowledge from experts in the field outside the org and optimise internal data
systems
Preferred qualifications/skills:
• 5+ years experience
• Strong analytical skills

____ 04

Freight Commerce Solutions Pvt Ltd.

• Degree in Computer Science, Statistics, Informatics, Information Systems
• Strong project management and organisational skills
• Experience supporting and working with cross-functional teams in a dynamic environment
• SQL guru with hands on experience on various databases
• NoSQL databases like Cassandra, MongoDB
• Experience with Snowflake, Redshift
• Experience with tools like Airflow, Hevo
• Experience with Hadoop, Spark, Kafka, Flink
• Programming experience in Python, Java, Scala

Senior Software Engineer

Digital Banking Firm

Agency job

via Qrata by Prajakta Kulkarni

Bengaluru (Bangalore)

5 - 10 yrs

₹20L - ₹40L / yr

Apache Kafka

Hadoop

Spark

Apache Hadoop

Big Data

+5 more

Location - Bangalore (Remote for now)

Designation - Sr. SDE (Platform Data Science)

About Platform Data Science Team

The Platform Data Science team works at the intersection of data science and engineering. Domain experts develop and advance platforms, including the data platforms, machine learning platform, other platforms for Forecasting, Experimentation, Anomaly Detection, Conversational AI, Underwriting of Risk, Portfolio Management, Fraud Detection & Prevention and many more. We also are the Data Science and Analytics partners for Product and provide Behavioural Science insights across Jupiter.

About the role:

We’re looking for strong Software Engineers that can combine EMR, Redshift, Hadoop, Spark, Kafka, Elastic Search, Tensorflow, Pytorch and other technologies to build the next generation Data Platform, ML Platform, Experimentation Platform. If this sounds interesting we’d love to hear from you!
This role will involve designing and developing software products that impact many areas of our business. The individual in this role will have responsibility help define requirements, create software designs, implement code to these specifications, provide thorough unit and integration testing, and support products while deployed and used by our stakeholders.

Key Responsibilities:

Participate, Own & Influence in architecting & designing of systems
Collaborate with other engineers, data scientists, product managers
Build intelligent systems that drive decisions
Build systems that enable us to perform experiments and iterate quickly
Build platforms that enable scientists to train, deploy and monitor models at scale
Build analytical systems that drives better decision making

Required Skills:

Programming experience with at least one modern language such as Java, Scala including object-oriented design
Experience in contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
Bachelor’s degree in Computer Science or related field
Computer Science fundamentals in object-oriented design
Computer Science fundamentals in data structures
Computer Science fundamentals in algorithm design, problem solving, and complexity analysis
Experience in databases, analytics, big data systems or business intelligence products:
Data lake, data warehouse, ETL, ML platform
Big data tech like: Hadoop, Apache Spark

Location - Bangalore (Remote for now)

Designation - Sr. SDE (Platform Data Science)

Required Skills:

Hadoop Admin

MNC

Agency job

via Fragma Data Systems by Harpreet kour

Bengaluru (Bangalore)

5 - 9 yrs

₹16L - ₹20L / yr

Apache Hadoop

Hadoop

Apache Hive

HDFS

SSL

+1 more

Responsibilities
     - Responsible for implementation and ongoing administration of Hadoop
infrastructure.
     - Aligning with the systems engineering team to propose and deploy new
hardware and software environments required for Hadoop and to expand existing
environments.
     - Working with data delivery teams to setup new Hadoop users. This job includes
setting up Linux users, setting up Kerberos principals and testing HDFS, Hive, Pig
and MapReduce access for the new users.
     - Cluster maintenance as well as creation and removal of nodes using tools like
Ganglia, Nagios, Cloudera Manager Enterprise, Dell Open Manage and other tools
     - Performance tuning of Hadoop clusters and Hadoop MapReduce routines
     - Screen Hadoop cluster job performances and capacity planning
     - Monitor Hadoop cluster connectivity and security
     - Manage and review Hadoop log files.
     - File system management and monitoring.
     - Diligently teaming with the infrastructure, network, database, application and
business intelligence teams to guarantee high data quality and availability
     - Collaboration with application teams to install operating system and Hadoop
updates, patches, version upgrades when required.

READ MORE OF THE JOB DESCRIPTION
Qualifications
Qualifications
     - Bachelors Degree in Information Technology, Computer Science or other
relevant fields
     - General operational expertise such as good troubleshooting skills,
understanding of systems capacity, bottlenecks, basics of memory, CPU, OS,
storage, and networks.
     - Hadoop skills like HBase, Hive, Pig, Mahout
     - Ability to deploy Hadoop cluster, add and remove nodes, keep track of jobs,
monitor critical parts of the cluster, configure name node high availability, schedule
and configure it and take backups.
     - Good knowledge of Linux as Hadoop runs on Linux.
     - Familiarity with open source configuration management and deployment tools
such as Puppet or Chef and Linux scripting.
     Nice to Have
     - Knowledge of Troubleshooting Core Java Applications is a plus.

Responsibilities
     - Responsible for implementation and ongoing administration of Hadoop
infrastructure.
     - Aligning with the systems engineering team to propose and deploy new
hardware and software environments required for Hadoop and to expand existing
environments.
     - Working with data delivery teams to setup new Hadoop users. This job includes
setting up Linux users, setting up Kerberos principals and testing HDFS, Hive, Pig
and MapReduce access for the new users.
     - Cluster maintenance as well as creation and removal of nodes using tools like
Ganglia, Nagios, Cloudera Manager Enterprise, Dell Open Manage and other tools
     - Performance tuning of Hadoop clusters and Hadoop MapReduce routines
     - Screen Hadoop cluster job performances and capacity planning
     - Monitor Hadoop cluster connectivity and security
     - Manage and review Hadoop log files.
     - File system management and monitoring.
     - Diligently teaming with the infrastructure, network, database, application and
business intelligence teams to guarantee high data quality and availability
     - Collaboration with application teams to install operating system and Hadoop
updates, patches, version upgrades when required.

READ MORE OF THE JOB DESCRIPTION
Qualifications
Qualifications
     - Bachelors Degree in Information Technology, Computer Science or other
relevant fields
     - General operational expertise such as good troubleshooting skills,
understanding of systems capacity, bottlenecks, basics of memory, CPU, OS,
storage, and networks.
     - Hadoop skills like HBase, Hive, Pig, Mahout
     - Ability to deploy Hadoop cluster, add and remove nodes, keep track of jobs,
monitor critical parts of the cluster, configure name node high availability, schedule
and configure it and take backups.
     - Good knowledge of Linux as Hadoop runs on Linux.
     - Familiarity with open source configuration management and deployment tools
such as Puppet or Chef and Linux scripting.
     Nice to Have
     - Knowledge of Troubleshooting Core Java Applications is a plus.

Hadoop Developer

MNC

Agency job

via Fragma Data Systems by Harpreet kour

Bengaluru (Bangalore)

3 - 6 yrs

₹6L - ₹15L / yr

Apache Hadoop

Hadoop

HDFS

Apache Sqoop

Apache Flume

+5 more

1. Design and development of data ingestion pipelines.
2. Perform data migration and conversion activities.
3. Develop and integrate software applications using suitable development
methodologies and standards, applying standard architectural patterns, taking
into account critical performance characteristics and security measures.
4. Collaborate with Business Analysts, Architects and Senior Developers to
establish the physical application framework (e.g. libraries, modules, execution
environments).
5. Perform end to end automation of ETL process for various datasets that are
being ingested into the big data platform.

Data Engineer

at DemandMatrix

4 recruiters

Posted by Harwinder Singh

Remote only

9 - 12 yrs

₹25L - ₹30L / yr

Big Data

PySpark

Apache Hadoop

Spark

Python

+3 more

Only a solid grounding in computer engineering, Unix, data structures and algorithms would enable you to meet this challenge.

7+ years of experience architecting, developing, releasing, and maintaining large-scale big data platforms on AWS or GCP

Understanding of how Big Data tech and NoSQL stores like MongoDB, HBase/HDFS, ElasticSearch synergize to power applications in analytics, AI and knowledge graphs

Understandingof how data processing models, data location patterns, disk IO, network IO, shuffling affect large scale text processing - feature extraction, searching etc

Expertise with a variety of data processing systems, including streaming, event, and batch (Spark, Hadoop/MapReduce)

5+ years proficiency in configuring and deploying applications on Linux-based systems

5+ years of experience Spark - especially Pyspark for transforming large non-structured text data, creating highly optimized pipelines

Experience with RDBMS, ETL techniques and frameworks (Sqoop, Flume) and big data querying tools (Pig, Hive)

Stickler of world class best practices, uncompromising on the quality of engineering, understand standards and reference architectures and deep in Unix philosophy with appreciation of big data design patterns, orthogonal code design and functional computation models

Hadoop Administrator

at Indium Software

16 recruiters

Posted by Ivarajneasan S K

Chennai

9 - 14 yrs

₹12L - ₹18L / yr

Apache Hadoop

Hadoop

Cloudera

HDFS

MapReduce

+2 more

Deploying a Hadoop cluster, maintaining a hadoop cluster, adding and removing nodes using cluster monitoring tools like Ganglia Nagios or Cloudera Manager, configuring the NameNode high availability and keeping a track of all the running hadoop jobs.

Good understating or hand's on in Kafka Admin / Apache Kafka Streaming.

Implementing, managing, and administering the overall hadoop infrastructure.

Takes care of the day-to-day running of Hadoop clusters

A hadoop administrator will have to work closely with the database team, network team, BI team, and application teams to make sure that all the big data applications are highly available and performing as expected.

If working with open source Apache Distribution, then hadoop admins have to manually setup all the configurations- Core-Site, HDFS-Site, YARN-Site and Map Red-Site. However, when working with popular hadoop distribution like Hortonworks, Cloudera or MapR the configuration files are setup on startup and the hadoop admin need not configure them manually.

Hadoop admin is responsible for capacity planning and estimating the requirements for lowering or increasing the capacity of the hadoop cluster.

Hadoop admin is also responsible for deciding the size of the hadoop cluster based on the data to be stored in HDFS.

Ensure that the hadoop cluster is up and running all the time.

Monitoring the cluster connectivity and performance.

Manage and review Hadoop log files.

Backup and recovery tasks

Resource and security management

Troubleshooting application errors and ensuring that they do not occur again.

Senior Software Engineer (Architect), Data

at Uber

1 video

10 recruiters

Posted by Suvidha Chib

Bengaluru (Bangalore)

7 - 15 yrs

₹0L / yr

Big Data

Hadoop

kafka

Spark

Apache Hive

+9 more

Data Platform engineering at Uber is looking for a strong Technical Lead (Level 5a Engineer) who has built high quality platforms and services that can operate at scale. 5a Engineer at Uber exhibits following qualities:

Demonstrate tech expertise › Demonstrate technical skills to go very deep or broad in solving classes of problems or creating broadly leverageable solutions.
Execute large scale projects › Define, plan and execute complex and impactful projects. You communicate the vision to peers and stakeholders.
Collaborate across teams › Domain resource to engineers outside your team and help them leverage the right solutions. Facilitate technical discussions and drive to a consensus.
Coach engineers › Coach and mentor less experienced engineers and deeply invest in their learning and success. You give and solicit feedback, both positive and negative, to others you work with to help improve the entire team.
Tech leadership › Lead the effort to define the best practices in your immediate team, and help the broader organization establish better technical or business processes.

What You’ll Do

Build a scalable, reliable, operable and performant data analytics platform for Uber’s engineers, data scientists, products and operations teams.
Work alongside the pioneers of big data systems such as Hive, Yarn, Spark, Presto, Kafka, Flink to build out a highly reliable, performant, easy to use software system for Uber’s planet scale of data.
Become proficient of multi-tenancy, resource isolation, abuse prevention, self-serve debuggability aspects of a high performant, large scale, service while building these capabilities for Uber's engineers and operation folks.

What You’ll Need

7+ years experience in building large scale products, data platforms, distributed systems in a high caliber environment.
Architecture: Identify and solve major architectural problems by going deep in your field or broad across different teams. Extend, improve, or, when needed, build solutions to address architectural gaps or technical debt.
Software Engineering/Programming: Create frameworks and abstractions that are reliable and reusable. advanced knowledge of at least one programming language, and are happy to learn more. Our core languages are Java, Python, Go, and Scala.
Data Engineering: Expertise in one of the big data analytics technologies we currently use such as Apache Hadoop (HDFS and YARN), Apache Hive, Impala, Drill, Spark, Tez, Presto, Calcite, Parquet, Arrow etc. Under the hood experience with similar systems such as Vertica, Apache Impala, Drill, Google Borg, Google BigQuery, Amazon EMR, Amazon RedShift, Docker, Kubernetes, Mesos etc.
Execution & Results: You tackle large technical projects/problems that are not clearly defined. You anticipate roadblocks and have strategies to de-risk timelines. You orchestrate work that spans multiple teams and keep your stakeholders informed.
A team player: You believe that you can achieve more on a team that the whole is greater than the sum of its parts. You rely on others’ candid feedback for continuous improvement.
Business acumen: You understand requirements beyond the written word. Whether you’re working on an API used by other developers, an internal tool consumed by our operation teams, or a feature used by millions of customers, your attention to details leads to a delightful user experience.

Demonstrate tech expertise › Demonstrate technical skills to go very deep or broad in solving classes of problems or creating broadly leverageable solutions.
Execute large scale projects › Define, plan and execute complex and impactful projects. You communicate the vision to peers and stakeholders.
Collaborate across teams › Domain resource to engineers outside your team and help them leverage the right solutions. Facilitate technical discussions and drive to a consensus.
Coach engineers › Coach and mentor less experienced engineers and deeply invest in their learning and success. You give and solicit feedback, both positive and negative, to others you work with to help improve the entire team.
Tech leadership › Lead the effort to define the best practices in your immediate team, and help the broader organization establish better technical or business processes.

What You’ll Do

Build a scalable, reliable, operable and performant data analytics platform for Uber’s engineers, data scientists, products and operations teams.
Work alongside the pioneers of big data systems such as Hive, Yarn, Spark, Presto, Kafka, Flink to build out a highly reliable, performant, easy to use software system for Uber’s planet scale of data.
Become proficient of multi-tenancy, resource isolation, abuse prevention, self-serve debuggability aspects of a high performant, large scale, service while building these capabilities for Uber's engineers and operation folks.

What You’ll Need

7+ years experience in building large scale products, data platforms, distributed systems in a high caliber environment.
Architecture: Identify and solve major architectural problems by going deep in your field or broad across different teams. Extend, improve, or, when needed, build solutions to address architectural gaps or technical debt.
Software Engineering/Programming: Create frameworks and abstractions that are reliable and reusable. advanced knowledge of at least one programming language, and are happy to learn more. Our core languages are Java, Python, Go, and Scala.
Data Engineering: Expertise in one of the big data analytics technologies we currently use such as Apache Hadoop (HDFS and YARN), Apache Hive, Impala, Drill, Spark, Tez, Presto, Calcite, Parquet, Arrow etc. Under the hood experience with similar systems such as Vertica, Apache Impala, Drill, Google Borg, Google BigQuery, Amazon EMR, Amazon RedShift, Docker, Kubernetes, Mesos etc.
Execution & Results: You tackle large technical projects/problems that are not clearly defined. You anticipate roadblocks and have strategies to de-risk timelines. You orchestrate work that spans multiple teams and keep your stakeholders informed.
A team player: You believe that you can achieve more on a team that the whole is greater than the sum of its parts. You rely on others’ candid feedback for continuous improvement.
Business acumen: You understand requirements beyond the written word. Whether you’re working on an API used by other developers, an internal tool consumed by our operation teams, or a feature used by millions of customers, your attention to details leads to a delightful user experience.

Software Developer

at flockwits

1 recruiter

Posted by Arvindh Saravanabhavan

Coimbatore

0 - 2 yrs

₹1L - ₹2L / yr

Java

Big Data

Data storage

Intelligence

Spring

+3 more

FlockWits is looking for Software Engineers to join team which is destined to build the advanced unified marketing platform powered by machine learning, enabled by Big data . You will be responsible for helping design, build, and deliver a platform that accelerates sales of clients. This role requires thirst to learn cutting technologies on App development, Data storage and processing, Artificial Intelligence, security and dev ops. We look for desire to create new things, dive in wherever there's a need, eagerness to make an impact as an individual and the willingness to learn new things. You must be self-motivated, innovative, and proactive. The role offers significant opportunities for growth. You will... • Help build the leading platform using Java spring for app, NoSQL for storage , Hadoop for processing , python for machine learning and Jenkins for dev ops. • Design, code, and implement clean and elegant user interfaces and workflows • Design and build UI with best usable features . You have... • Any engineering degree or MCA • Thirst to learn. • Self-driven and motivated, with a strong sense of ownership and craftsmanship. • Enthusiasm to sharpen soft skills like written and verbal communication.

Software Developer

at flockwits

1 recruiter

Posted by Arvindh Saravanabhavan

Coimbatore

0 - 2 yrs

₹1L - ₹2L / yr

Java

Big Data

Data storage

Intelligence

Spring

+4 more

Assistant Manager - Analytics - Product Team

at LatentView Analytics

3 recruiters

Posted by Kannikanti madhuri

Chennai

5 - 8 yrs

₹5L - ₹8L / yr

Data Science

Analytics

Data Analytics

Data modeling

Data mining

+7 more

Job Overview :We are looking for an experienced Data Science professional to join our Product team and lead the data analytics team and manage the processes and people responsible for accurate data collection, processing, modelling and analysis. The ideal candidate has a knack for seeing solutions in sprawling data sets and the business mindset to convert insights into strategic opportunities for our clients. The incumbent will work closely with leaders across product, sales, and marketing to support and implement high-quality, data-driven decisions. They will ensure data accuracy and consistent reporting by designing and creating optimal processes and procedures for analytics employees to follow. They will use advanced data modelling, predictive modelling, natural language processing and analytical techniques to interpret key findings.Responsibilities for Analytics Manager :- Build, develop and maintain data models, reporting systems, data automation systems, dashboards and performance metrics support that support key business decisions.- Design and build technical processes to address business issues.- Manage and optimize processes for data intake, validation, mining and engineering as well as modelling, visualization and communication deliverables.- Examine, interpret and report results to stakeholders in leadership, technology, sales, marketing and product teams.- Develop and implement quality controls and standards to ensure quality standards- Anticipate future demands of initiatives related to people, technology, budget and business within your department and design/implement solutions to meet these needs.- Communicate results and business impacts of insight initiatives to stakeholders within and outside of the company.- Lead cross-functional projects using advanced data modelling and analysis techniques to discover insights that will guide strategic decisions and uncover optimization opportunities.Qualifications for Analytics Manager :- Working knowledge of data mining principles: predictive analytics, mapping, collecting data from multiple cloud-based data sources- Strong SQL skills, ability to perform effective querying- Understanding of and experience using analytical concepts and statistical techniques: hypothesis development, designing tests/experiments, analysing data, drawing conclusions, and developing actionable recommendations for business units.- Experience and knowledge of statistical modelling techniques: GLM multiple regression, logistic regression, log-linear regression, variable selection, etc.- Experience working with and creating databases and dashboards using all relevant data to inform decisions.- Strong problem solving, quantitative and analytical abilities.- Strong ability to plan and manage numerous processes, people and projects simultaneously.- Excellent communication, collaboration and delegation skills.- We- re looking for someone with at least 5 years of experience in a position monitoring, managing and drawing insights from data, and someone with at least 3 years of experience leading a team. The right candidate will also be proficient and experienced with the following tools/programs :- Strong programming skills with querying languages: R, Python etc.- Experience with big data tools like Hadoop- Experience with data visualization tools: Tableau, d3.js, etc.- Experience with Excel, Word, and PowerPoint.

Backend Engineer - Java/Scala/Distributed System/NoSQL

at Intellicar Telematics Pvt Ltd

2 recruiters

Posted by Lata Patil

Bengaluru (Bangalore)

3 - 7 yrs

₹8L - ₹9L / yr

Java

Scala

Distributed Systems

NOSQL Databases

Multithreading

+26 more

Systems EngineerAbout Intellicar Telematics Pvt LtdIntellicar Telematics Private Limited is a vehicular telematics organization founded in 2015 with the vision of connecting businesses and customers to their vehicles in a meaningful way. We provide vehicle owners with the ability to connect and diagnose vehicles remotely in real-time. Our team consists of individuals with an in-depth knowledge and understanding in automotive engineering, driver analytics and information technology. By leveraging our expertise in the automotive domain, we have created solutions to reduce operational and maintenance costs of large fleets, and ensure safety at all times.Solutions :- Enterprise Fleet Management, GPS Tracking- Remote engine diagnostics, Driver behavior & training- Technology Integration : GIS, GPS, GPRS, OBD, WEB, Accelerometer, RFID, On-board Storage.Intellicar's team of accomplished automotive Engineers, hardware manufacturers, Software Developers and Data Scientists have developed the best solutions to track vehicles and drivers, and ensure optimum performance, utilization and safety at all times.We cater to the needs of our clients across various industries such as: Self drive cars, Taxi cab rentals, Taxi cab aggregators, Logistics, Driver training, Bike Rentals, Construction, ecommerce, armored trucks, Manufacturing, dealership and more. Desired skills as a developer :- Education: BE/B.Tech in Computer Science or related field.- 4+ years of experience with scalable distributed systems applications and building scalable multi-threaded server applications.- Strong programming skills in Java or Scala on Linux or a Unix based OS.- Understanding of distributed systems like Hadoop, Spark, Cassandra, Kafka.- Good understanding of HTTP, SQL, Database internals.- Good understanding of Internet and how it works- Create new features from scratch, enhance existing features and optimize existing functionality, from conception and design through testing and deployment.- Work on projects that make our network more stable, faster, and secure.- Work with our development QA and system QA teams to come up with regression tests that cover new changes to our software

Big Data/Java Programming

at Dailyhunt

4 recruiters

Posted by khushboo jain

Bengaluru (Bangalore)

3 - 9 yrs

₹3L - ₹9L / yr

Java

Big Data

Hadoop

Pig

Apache Hive

+13 more

What You'll Do :- Develop analytic tools, working on BigData and Distributed Environment. Scalability will be the key- Provide architectural and technical leadership on developing our core Analytic platform- Lead development efforts on product features on Java- Help scale our mobile platform as we experience massive growthWhat we Need :- Passion to build analytics & personalisation platform at scale- 3 to 9 years of software engineering experience with product based company in data analytics/big data domain- Passion for the Designing and development from the scratch.- Expert level Java programming and experience leading full lifecycle of application Dev.- Exp in Analytics, Hadoop, Pig, Hive, Mapreduce, ElasticSearch, MongoDB is an additional advantage- Strong communication skills, verbal and written

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort