Big data Jobs in Chennai

32+ Big data Jobs in Chennai | Big data Job openings in Chennai

Apply to 32+ Big data Jobs in Chennai on CutShort.io. Explore the latest Big data Job opportunities across top companies like Google, Amazon & Adobe.

PySpark/Scala Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Bengaluru (Bangalore), Hyderabad, Pune, Delhi, Kolkata, Chennai

5 - 8 yrs

₹7L - ₹30L / yr

Scala

Python

PySpark

Apache Hive

Spark

+3 more

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

Skills and competencies:

Required:

· Strong analytical skills in conducting sophisticated statistical analysis using bureau/vendor data, customer performance

Data and macro-economic data to solve business problems.

· Working experience in languages PySpark & Scala to develop code to validate and implement models and codes in

Credit Risk/Banking

· Experience with distributed systems such as Hadoop/MapReduce, Spark, streaming data processing, cloud architecture.

Familiarity with machine learning frameworks and libraries (like scikit-learn, SparkML, tensorflow, pytorch etc.
Experience in systems integration, web services, batch processing
Experience in migrating codes to PySpark/Scala is big Plus
The ability to act as liaison conveying information needs of the business to IT and data constraints to the business

applies equal conveyance regarding business strategy and IT strategy, business processes and work flow

· Flexibility in approach and thought process

· Attitude to learn and comprehend the periodical changes in the regulatory requirement as per FED

IBM MDM Developer

at Tata Consultancy Services

2 recruiters

Agency job

via Risk Resources LLP hyd by susmitha o

Chennai, Hyderabad, Kolkata, Delhi, Pune, Bengaluru (Bangalore)

5 - 8 yrs

₹7L - ₹30L / yr

Informatica MDM

MDM

ETL

Big Data

• Technical expertise in the area of development of Master Data Management, data extraction, transformation, and load (ETL) applications, big data using existing and emerging technology platforms and cloud architecture

• Functions as lead developer• Support System Analysis, Technical/Data design, development, unit testing, and oversee end-to-end data solution.

• Technical SME in Master Data Management application, ETL, big data and cloud technologies

• Collaborate with IT teams to ensure technical designs and implementations account for requirements, standards, and best practices

• Performance tuning of end-to-end MDM, database, ETL, Big data processes or in the source/target database endpoints as needed.

• Mentor and advise junior members of team to provide guidance.

• Perform a technical lead and solution lead role for a team of onshore and offshore developers

• Functions as lead developer• Support System Analysis, Technical/Data design, development, unit testing, and oversee end-to-end data solution.

• Technical SME in Master Data Management application, ETL, big data and cloud technologies

• Collaborate with IT teams to ensure technical designs and implementations account for requirements, standards, and best practices

• Performance tuning of end-to-end MDM, database, ETL, Big data processes or in the source/target database endpoints as needed.

• Mentor and advise junior members of team to provide guidance.

• Perform a technical lead and solution lead role for a team of onshore and offshore developers

Senior Machine Learning Engineer

at NeoGenCode Technologies Pvt Ltd

2 candid answers

Posted by Akshay Patil

Chennai

8 - 12 yrs

₹10L - ₹26L / yr

Python

Machine Learning (ML)

Scikit-Learn

TensorFlow

PyTorch

+10 more

Job Title : Senior Machine Learning Engineer

Experience : 8+ Years

Location : Chennai

Notice Period : Immediate Joiners Only

Work Mode : Hybrid

Job Summary :

We are seeking an experienced Machine Learning Engineer with a strong background in Python, ML algorithms, and data-driven development.

The ideal candidate should have hands-on experience with popular ML frameworks and tools, solid understanding of clustering and classification techniques, and be comfortable working in Unix-based environments with Agile teams.

Mandatory Skills :

Programming Languages : Python
Machine Learning : Strong experience with ML algorithms, models, and libraries such as Scikit-learn, TensorFlow, and PyTorch
ML Concepts : Proficiency in supervised and unsupervised learning, including techniques such as K-Means, DBSCAN, and Fuzzy Clustering
Operating Systems : RHEL or any Unix-based OS
Databases : Oracle or any relational database
Version Control : Git
Development Methodologies : Agile

Desired Skills :

Experience with issue tracking tools such as Azure DevOps or JIRA.
Understanding of data science concepts.
Familiarity with Big Data algorithms, models, and libraries.

Job Title : Senior Machine Learning Engineer

Experience : 8+ Years

Location : Chennai

Notice Period : Immediate Joiners Only

Work Mode : Hybrid

Job Summary :

We are seeking an experienced Machine Learning Engineer with a strong background in Python, ML algorithms, and data-driven development.

Mandatory Skills :

Programming Languages : Python
Machine Learning : Strong experience with ML algorithms, models, and libraries such as Scikit-learn, TensorFlow, and PyTorch
ML Concepts : Proficiency in supervised and unsupervised learning, including techniques such as K-Means, DBSCAN, and Fuzzy Clustering
Operating Systems : RHEL or any Unix-based OS
Databases : Oracle or any relational database
Version Control : Git
Development Methodologies : Agile

Desired Skills :

Experience with issue tracking tools such as Azure DevOps or JIRA.
Understanding of data science concepts.
Familiarity with Big Data algorithms, models, and libraries.

Platform Engineer

at Mobile Programming LLC

1 video

34 recruiters

Posted by Sukhdeep Singh

Chennai

4 - 7 yrs

₹13L - ₹15L / yr

Data Analytics

Data Visualization

PowerBI

Tableau

Qlikview

+10 more

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Kafka Developer

at iLink Systems

1 video

1 recruiter

Posted by Ganesh Sooriyamoorthu

Chennai, Pune, Noida, Bengaluru (Bangalore)

5 - 15 yrs

₹10L - ₹15L / yr

Apache Kafka

Big Data

Java

Spark

Hadoop

+1 more

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

Big Data Engineer

at Virtusa

2 recruiters

Posted by Priyanka Sathiyamoorthi

Chennai

11 - 15 yrs

₹15L - ₹33L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+3 more

We are looking for a Big Data Engineer with java for Chennai Location

Location : Chennai

Exp : 11 to 15 Years

Job description

Required Skill:

1. Candidate should have minimum 7 years of experience as total

2. Candidate should have minimum 4 years of experience in Big Data design and development

3. Candidate should have experience in Java, Spark, Hive & Hadoop, Python

4. Candidate should have experience in any RDBMS.

Roles & Responsibility:

1. To create work plans, monitor and track the work schedule for on time delivery as per the defined quality standards.

2. To develop and guide the team members in enhancing their technical capabilities and increasing productivity.

3. To ensure process improvement and compliance in the assigned module, and participate in technical discussions or review.

4. To prepare and submit status reports for minimizing exposure and risks on the project or closure of escalation

Regards,

Priyanka S

7P8R9I9Y4A0N8K8A7S7

We are looking for a Big Data Engineer with java for Chennai Location

Location : Chennai

Exp : 11 to 15 Years

Job description

Required Skill:

1. Candidate should have minimum 7 years of experience as total

2. Candidate should have minimum 4 years of experience in Big Data design and development

3. Candidate should have experience in Java, Spark, Hive & Hadoop, Python

4. Candidate should have experience in any RDBMS.

Roles & Responsibility:

1. To create work plans, monitor and track the work schedule for on time delivery as per the defined quality standards.

2. To develop and guide the team members in enhancing their technical capabilities and increasing productivity.

3. To ensure process improvement and compliance in the assigned module, and participate in technical discussions or review.

4. To prepare and submit status reports for minimizing exposure and risks on the project or closure of escalation

Regards,

Priyanka S

7P8R9I9Y4A0N8K8A7S7

Senior Data Engineer

at Cubera Tech India Pvt Ltd

Posted by Surabhi Koushik

Bengaluru (Bangalore), Chennai

5 - 8 yrs

Best in industry

Data engineering

Big Data

Java

Python

Hibernate (Java)

+10 more

Data Engineer- Senior

Cubera is a data company revolutionizing big data analytics and Adtech through data share value principles wherein the users entrust their data to us. We refine the art of understanding, processing, extracting, and evaluating the data that is entrusted to us. We are a gateway for brands to increase their lead efficiency as the world moves towards web3.

What are you going to do?

Design & Develop high performance and scalable solutions that meet the needs of our customers.

Closely work with the Product Management, Architects and cross functional teams.

Build and deploy large-scale systems in Java/Python.

Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

Create data tools for analytics and data scientist team members that assist them in building and optimizing their algorithms.

Follow best practices that can be adopted in Bigdata stack.

Use your engineering experience and technical skills to drive the features and mentor the engineers.

What are we looking for ( Competencies) :

Bachelor’s degree in computer science, computer engineering, or related technical discipline.

Overall 5 to 8 years of programming experience in Java, Python including object-oriented design.

Data handling frameworks: Should have a working knowledge of one or more data handling frameworks like- Hive, Spark, Storm, Flink, Beam, Airflow, Nifi etc.

Data Infrastructure: Should have experience in building, deploying and maintaining applications on popular cloud infrastructure like AWS, GCP etc.

Data Store: Must have expertise in one of general-purpose No-SQL data stores like Elasticsearch, MongoDB, Redis, RedShift, etc.

Strong sense of ownership, focus on quality, responsiveness, efficiency, and innovation.

Ability to work with distributed teams in a collaborative and productive manner.

Benefits:

Competitive Salary Packages and benefits.

Collaborative, lively and an upbeat work environment with young professionals.

Job Category: Development

Job Type: Full Time

Job Location: Bangalore

Data Engineer- Senior

What are you going to do?

Design & Develop high performance and scalable solutions that meet the needs of our customers.

Closely work with the Product Management, Architects and cross functional teams.

Build and deploy large-scale systems in Java/Python.

Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

Create data tools for analytics and data scientist team members that assist them in building and optimizing their algorithms.

Follow best practices that can be adopted in Bigdata stack.

Use your engineering experience and technical skills to drive the features and mentor the engineers.

What are we looking for ( Competencies) :

Bachelor’s degree in computer science, computer engineering, or related technical discipline.

Overall 5 to 8 years of programming experience in Java, Python including object-oriented design.

Data handling frameworks: Should have a working knowledge of one or more data handling frameworks like- Hive, Spark, Storm, Flink, Beam, Airflow, Nifi etc.

Data Infrastructure: Should have experience in building, deploying and maintaining applications on popular cloud infrastructure like AWS, GCP etc.

Data Store: Must have expertise in one of general-purpose No-SQL data stores like Elasticsearch, MongoDB, Redis, RedShift, etc.

Strong sense of ownership, focus on quality, responsiveness, efficiency, and innovation.

Ability to work with distributed teams in a collaborative and productive manner.

Benefits:

Competitive Salary Packages and benefits.

Collaborative, lively and an upbeat work environment with young professionals.

Job Category: Development

Job Type: Full Time

Job Location: Bangalore

Big data Cloud

at Altimetrik

8 recruiters

Agency job

via SOT-Science of talent Acquisition consulting services Pvt Ltd by Mahesh Kumar

Chennai, Hyderabad

5 - 10 yrs

₹10L - ₹25L / yr

PySpark

Data engineering

Big Data

Hadoop

Spark

+2 more

Bigdata with cloud:

Experience : 5-10 years

Location : Hyderabad/Chennai

Notice period : 15-20 days Max

1. Expertise in building AWS Data Engineering pipelines with AWS Glue -> Athena -> Quick sight

2. Experience in developing lambda functions with AWS Lambda

3. Expertise with Spark/PySpark – Candidate should be hands on with PySpark code and should be able to do transformations with Spark

4. Should be able to code in Python and Scala.

5. Snowflake experience will be a plus

Bigdata with cloud:

Experience : 5-10 years

Location : Hyderabad/Chennai

Notice period : 15-20 days Max

1. Expertise in building AWS Data Engineering pipelines with AWS Glue -> Athena -> Quick sight

2. Experience in developing lambda functions with AWS Lambda

3. Expertise with Spark/PySpark – Candidate should be hands on with PySpark code and should be able to do transformations with Spark

4. Should be able to code in Python and Scala.

5. Snowflake experience will be a plus

GCP Developer

at Quess Corp Limited

6 recruiters

Posted by Anjali Singh

Noida, Delhi, Gurugram, Ghaziabad, Faridabad, Bengaluru (Bangalore), Chennai

5 - 8 yrs

₹1L - ₹15L / yr

Google Cloud Platform (GCP)

Python

Big Data

Data processing

Data Visualization

GCP Data Analyst profile must have below skills sets :

Knowledge of programming languages like https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.simplilearn.com%2Ftutorials%2Fsql-tutorial%2Fhow-to-become-sql-developer&;data=05%7C01%7Ca_anjali%40hcl.com%7C4ae720b3f3cc45c3e04608da3346b335%7C189de737c93a4f5a8b686f4ca9941912%7C0%7C0%7C637878675987971859%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=EImfaJAD1KHOyrBQ7FkbaPl1STtfnf4QdQlbjw72%2BmE%3D&reserved=0" target="_blank">SQL, Oracle, R, MATLAB, Java and https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.simplilearn.com%2Fwhy-learn-python-a-guide-to-unlock-your-python-career-article&;data=05%7C01%7Ca_anjali%40hcl.com%7C4ae720b3f3cc45c3e04608da3346b335%7C189de737c93a4f5a8b686f4ca9941912%7C0%7C0%7C637878675987971859%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Z2n1Xy%2F3YN6nQqSweU5T7EfUTa1kPAAjbCMTWxDCh%2FY%3D&reserved=0" target="_blank">Python
Data cleansing, data visualization, data wrangling
Data modeling , data warehouse concepts
Adapt to Big data platform like Hadoop, Spark for stream & batch processing
GCP (Cloud Dataproc, Cloud Dataflow, Cloud Datalab, Cloud Dataprep, BigQuery, Cloud Datastore, Cloud Datafusion, Auto ML etc)

Bigdata QA Lead

MNC

Agency job

via Eurka IT SOL by Srikanth a

Chennai

5 - 11 yrs

₹10L - ₹15L / yr

PySpark

SQL

Test Automation (QA)

Big Data

Data Science

Lead QA: more than 5 years experience , led the team of more than 5 people in big data platform, should have experience in Test Automation framework, should have experience of Test process documentation

Big Data Developer / Lead / Architect

Telecom Client

Agency job

via Eurka IT SOL by Srikanth a

Chennai

5 - 13 yrs

₹9L - ₹28L / yr

PySpark

Data engineering

Big Data

Hadoop

Spark

+6 more

Demonstrable experience owning and developing big data solutions, using Hadoop, Hive/Hbase, Spark, Databricks, ETL/ELT for 5+ years

· 10+ years of Information Technology experience, preferably with Telecom / wireless service providers.

· Experience in designing data solution following Agile practices (SAFe methodology); designing for testability, deployability and releaseability; rapid prototyping, data modeling, and decentralized innovation

DataOps mindset: allowing the architecture of a system to evolve continuously over time, while simultaneously supporting the needs of current users
Create and maintain Architectural Runway, and Non-Functional Requirements.
Design for Continuous Delivery Pipeline (CI/CD data pipeline) and enables Built-in Quality & Security from the start.

· To be able to demonstrate an understanding and ideally use of, at least one recognised architecture framework or standard e.g. TOGAF, Zachman Architecture Framework etc

· The ability to apply data, research, and professional judgment and experience to ensure our products are making the biggest difference to consumers

· Demonstrated ability to work collaboratively

· Excellent written, verbal and social skills - You will be interacting with all types of people (user experience designers, developers, managers, marketers, etc.)

· Ability to work in a fast paced, multiple project environment on an independent basis and with minimal supervision

· Technologies: .NET, AWS, Azure; Azure Synapse, Nifi, RDS, Apache Kafka, Azure Data bricks, Azure datalake storage, Power BI, Reporting Analytics, QlickView, SQL on-prem Datawarehouse; BSS, OSS & Enterprise Support Systems

Demonstrable experience owning and developing big data solutions, using Hadoop, Hive/Hbase, Spark, Databricks, ETL/ELT for 5+ years

· 10+ years of Information Technology experience, preferably with Telecom / wireless service providers.

DataOps mindset: allowing the architecture of a system to evolve continuously over time, while simultaneously supporting the needs of current users
Create and maintain Architectural Runway, and Non-Functional Requirements.
Design for Continuous Delivery Pipeline (CI/CD data pipeline) and enables Built-in Quality & Security from the start.

· To be able to demonstrate an understanding and ideally use of, at least one recognised architecture framework or standard e.g. TOGAF, Zachman Architecture Framework etc

· The ability to apply data, research, and professional judgment and experience to ensure our products are making the biggest difference to consumers

· Demonstrated ability to work collaboratively

· Excellent written, verbal and social skills - You will be interacting with all types of people (user experience designers, developers, managers, marketers, etc.)

· Ability to work in a fast paced, multiple project environment on an independent basis and with minimal supervision

Data Engineering Manager

at Amagi Media Labs

3 recruiters

Posted by Rajesh C

Bengaluru (Bangalore), Chennai

10 - 14 yrs

₹40L - ₹60L / yr

Engineering Management

Python

Spark

Java

Big Data

+3 more

Job Title: Engineering Manager

Job Location: Chennai, Bangalore
Job Summary
The Engineering Org is looking for a proficient Engineering Manager to join a team that is building exciting
and futuristic Data Products at Condé Nast to enable both internal and external marketers to target
audiences in real time. As an Engineering Manager, you will drive the day-to-day execution of technical
and architectural decisions. EM will own engineering deliverables inclusive of solving dependencies
such as architecture, solutions, sequencing, and working with other engineering delivery teams.This role
is also responsible for driving innovation, prototyping, and recommending solutions. Above all, you will
influence how users interact with Conde Nast’s industry-leading journalism.
● Primary Responsibilities
● Manage a high performing team of Software and Data Engineers within the Data & ML
Engineering team part of Engineering Data Organization.
● Provide leadership and guidance to the team in Data Discovery, Data Ingestion, Transformation
and Storage
● Utilizing product mindset to build, scale and deploy holistic data products after successful
prototyping and drive their engineering implementation
● Provide technical coaching and lead direct reports and other members of adjacent support teams
to the highest level of performance..
● Evaluate performance of direct reports and offer career development guidance.
● Meeting hiring and retention targets of the team & building a high-performance culture
● Handle escalations from internal stakeholders and manage critical issues to resolution.
● Collaborate with Architects, Product Manager, Project Manager and other teams to deliver high
quality products.
● Identify recurring system and application issues and enable engineers to work with release teams,
infra teams, product development, vendors and other stakeholders in investigating and resolving
the cause.
● Required Skills
● 4+ years of managing Software Development teams, preferably in ML and Data
Engineering teams.
● 4+ years of Agile Software development practices
● 12+ years of Software Development experience.
● Excellent Problem Solving and System Design skill
● Hands on: Writing and Reviewing code primarily in Spark, Python and/or Java
● Hand on: Architect & Design end to end Data Pipeline (noSQL databases, Job Schedulers, Big
Data Development preferably on Databricks / Cloud)
● Experience with SOA & Microservice architecture
● Knowledge of Software Engineering best practices with experience on implementing CI/CD,
Log aggregation/Monitoring/alerting for production system
● Working Knowledge of cloud and devops skills (AWS will be preferred)
● Strong verbal and written communication skills.
● Experience in evaluating team member performance and offering career development
guidance.
● Experience in providing technical coaching to direct reports.
● Experience in architecting highly scalable products.
● Experience in collaborating with global stakeholder teams.
● Experience in working on highly available production systems.
● Strong knowledge of software release process and release pipeline.
About Condé Nast
CONDÉ NAST INDIA (DATA)
Over the years, Condé Nast successfully expanded and diversified into digital, TV, and social
platforms - in other words, a staggering amount of user data. Condé Nast made the right move to
invest heavily in understanding this data and formed a whole new Data team entirely dedicated to
data processing, engineering, analytics, and visualization. This team helps drive engagement, fuel
process innovation, further content enrichment, and increase market revenue. The Data team
aimed to create a company culture where data was the common language and facilitate an
environment where insights shared in real-time could improve performance.
The Global Data team operates out of Los Angeles, New York, Chennai, and London. The team at
Condé Nast Chennai works extensively with data to amplify its brands' digital capabilities and boost
online revenue. We are broadly divided into four groups, Data Intelligence, Data Engineering, Data
Science, and Operations (including Product and Marketing Ops, Client Services) along with Data
Strategy and monetization. The teams built capabilities and products to create data-driven solutions
for better audience engagement.
What we look forward to:
We want to welcome bright, new minds into our midst and work together to create diverse forms of
self-expression. At Condé Nast, we encourage the imaginative and celebrate the extraordinary. We
are a media company for the future, with a remarkable past. We are Condé Nast, and It Starts Here.

Job Title: Engineering Manager

Data Engineer

at Ganit Business Solutions

3 recruiters

Posted by Viswanath Subramanian

Chennai, Bengaluru (Bangalore), Mumbai

4 - 6 yrs

₹7L - ₹15L / yr

SQL

Amazon Web Services (AWS)

Data Warehouse (DWH)

Informatica

ETL

+1 more

Responsibilities:

Must be able to write quality code and build secure, highly available systems.
Assemble large, complex datasets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing datadelivery, re-designing infrastructure for greater scalability, etc with the guidance.
Create datatools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
Monitoring performance and advising any necessary infrastructure changes.
Defining dataretention policies.
Implementing the ETL process and optimal data pipeline architecture
Build analytics tools that utilize the datapipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
Create design documents that describe the functionality, capacity, architecture, and process.
Develop, test, and implement datasolutions based on finalized design documents.
Work with dataand analytics experts to strive for greater functionality in our data
Proactively identify potential production issues and recommend and implement solutions

Skillsets:

Good understanding of optimal extraction, transformation, and loading of datafrom a wide variety of data sources using SQL and AWS ‘big data’ technologies.
Proficient understanding of distributed computing principles
Experience in working with batch processing/ real-time systems using various open-source technologies like NoSQL, Spark, Pig, Hive, Apache Airflow.
Implemented complex projects dealing with the considerable datasize (PB).
Optimization techniques (performance, scalability, monitoring, etc.)
Experience with integration of datafrom multiple data sources
Experience with NoSQL databases, such as HBase, Cassandra, MongoDB, etc.,
Knowledge of various ETL techniques and frameworks, such as Flume
Experience with various messaging systems, such as Kafka or RabbitMQ
Good understanding of Lambda Architecture, along with its advantages and drawbacks
Creation of DAGs for dataengineering
Expert at Python /Scala programming, especially for dataengineering/ ETL purposes

Responsibilities:

Must be able to write quality code and build secure, highly available systems.
Assemble large, complex datasets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing datadelivery, re-designing infrastructure for greater scalability, etc with the guidance.
Create datatools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
Monitoring performance and advising any necessary infrastructure changes.
Defining dataretention policies.
Implementing the ETL process and optimal data pipeline architecture
Build analytics tools that utilize the datapipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
Create design documents that describe the functionality, capacity, architecture, and process.
Develop, test, and implement datasolutions based on finalized design documents.
Work with dataand analytics experts to strive for greater functionality in our data
Proactively identify potential production issues and recommend and implement solutions

Skillsets:

Good understanding of optimal extraction, transformation, and loading of datafrom a wide variety of data sources using SQL and AWS ‘big data’ technologies.
Proficient understanding of distributed computing principles
Experience in working with batch processing/ real-time systems using various open-source technologies like NoSQL, Spark, Pig, Hive, Apache Airflow.
Implemented complex projects dealing with the considerable datasize (PB).
Optimization techniques (performance, scalability, monitoring, etc.)
Experience with integration of datafrom multiple data sources
Experience with NoSQL databases, such as HBase, Cassandra, MongoDB, etc.,
Knowledge of various ETL techniques and frameworks, such as Flume
Experience with various messaging systems, such as Kafka or RabbitMQ
Good understanding of Lambda Architecture, along with its advantages and drawbacks
Creation of DAGs for dataengineering
Expert at Python /Scala programming, especially for dataengineering/ ETL purposes

Team Lead / Development Manager

Client of People First Consultants

Agency job

via People First Consultants by Jayaraj E

Chennai

5 - 13 yrs

₹12L - ₹18L / yr

Tech Lead

Engineering Manager

Engineering Management

Engineering Head

Technical Manager

+17 more

Team Lead and Development Manager

Years of Experience : 6 to 12 Years

No of Position: 6

Location : Chennai

Strong programming skills in one or more languages - Java, C++, JavaScript, Objective-C, Swift, Kotlin

Experience with Databases like SQL, Oracle, MongoDB, Cassandra

Experience with in-memory Databases like Redis, Memcache

Exposure to IPC Mechanisms, Messages Queues, Kafka

Exposure to frameworks like Spring, Angular, ReactJS

Good Team Management skills

Strong debugging skills

Strong written and verbal communication skills

Experience in handling client communication

Good understanding of SDLC processes and Agile Methods

Exposure to Banking and Capital Market domain.

Understanding of 2-Tier/3-Tier System Architecture, Infrastructure, Sizing

Understanding of network programming and protocols.

Experience with Mobile Technologies

Experience with Planning and Implementation

Experience with Microsoft Office Tools, JIRA

Knowledge of TCS Bancs, Refinitiv, 63 Moons platforms will be an added advantage.

CTC : As per Industry Standards

Team Lead and Development Manager

Years of Experience : 6 to 12 Years

No of Position: 6

Location : Chennai

Strong programming skills in one or more languages - Java, C++, JavaScript, Objective-C, Swift, Kotlin

Experience with Databases like SQL, Oracle, MongoDB, Cassandra

Experience with in-memory Databases like Redis, Memcache

Exposure to IPC Mechanisms, Messages Queues, Kafka

Exposure to frameworks like Spring, Angular, ReactJS

Good Team Management skills

Strong debugging skills

Strong written and verbal communication skills

Experience in handling client communication

Good understanding of SDLC processes and Agile Methods

Exposure to Banking and Capital Market domain.

Understanding of 2-Tier/3-Tier System Architecture, Infrastructure, Sizing

Understanding of network programming and protocols.

Experience with Mobile Technologies

Experience with Planning and Implementation

Experience with Microsoft Office Tools, JIRA

Knowledge of TCS Bancs, Refinitiv, 63 Moons platforms will be an added advantage.

CTC : As per Industry Standards

Big Data Architect

Agilisium

Agency job

via Recruiting India by Moumita Santra

Chennai

10 - 19 yrs

₹12L - ₹40L / yr

Big Data

Apache Spark

Spark

PySpark

ETL

+1 more

Job Sector: IT, Software

Job Type: Permanent

Location: Chennai

Experience: 10 - 20 Years

Salary: 12 – 40 LPA

Education: Any Graduate

Notice Period: Immediate

Key Skills: Python, Spark, AWS, SQL, PySpark

Contact at triple eight two zero nine four two double seven

Job Description:

Requirements

Minimum 12 years experience
In depth understanding and knowledge on distributed computing with spark.
Deep understanding of Spark Architecture and internals
Proven experience in data ingestion, data integration and data analytics with spark, preferably PySpark.
Expertise in ETL processes, data warehousing and data lakes.
Hands on with python for Big data and analytics.
Hands on in agile scrum model is an added advantage.
Knowledge on CI/CD and orchestration tools is desirable.
AWS S3, Redshift, Lambda knowledge is preferred

Thanks

Job Sector: IT, Software

Job Type: Permanent

Location: Chennai

Experience: 10 - 20 Years

Salary: 12 – 40 LPA

Education: Any Graduate

Notice Period: Immediate

Key Skills: Python, Spark, AWS, SQL, PySpark

Contact at triple eight two zero nine four two double seven

Job Description:

Requirements

Minimum 12 years experience
In depth understanding and knowledge on distributed computing with spark.
Deep understanding of Spark Architecture and internals
Proven experience in data ingestion, data integration and data analytics with spark, preferably PySpark.
Expertise in ETL processes, data warehousing and data lakes.
Hands on with python for Big data and analytics.
Hands on in agile scrum model is an added advantage.
Knowledge on CI/CD and orchestration tools is desirable.
AWS S3, Redshift, Lambda knowledge is preferred

Thanks

Senior consultant

An IT Services Major, hiring for a leading insurance player.

Agency job

via Indventur Partner by Vanshika kaur

Chennai

3 - 5 yrs

₹5L - ₹10L / yr

Big Data

Hadoop

Apache Kafka

Apache Hive

Microsoft Windows Azure

+1 more

Client An IT Services Major, hiring for a leading insurance player.

Position: SENIOR CONSULTANT

Job Description:

Azure admin- senior consultant with HD Insights(Big data)

Skills and Experience

Microsoft Azure Administrator certification
Bigdata project experience in Azure HDInsight Stack. big data processing frameworks such as Spark, Hadoop, Hive, Kafka or Hbase.
Preferred: Insurance or BFSI domain experience
5 to 5 years of experience is required.

Client An IT Services Major, hiring for a leading insurance player.

Position: SENIOR CONSULTANT

Job Description:

Azure admin- senior consultant with HD Insights(Big data)

Skills and Experience

Microsoft Azure Administrator certification
Bigdata project experience in Azure HDInsight Stack. big data processing frameworks such as Spark, Hadoop, Hive, Kafka or Hbase.
Preferred: Insurance or BFSI domain experience
5 to 5 years of experience is required.

Big Data Engineer

at netmedscom

3 recruiters

Posted by Vijay Hemnath

Chennai

2 - 5 yrs

₹6L - ₹25L / yr

Big Data

Hadoop

Apache Hive

Scala

Spark

+12 more

We are looking for an outstanding Big Data Engineer with experience setting up and maintaining Data Warehouse and Data Lakes for an Organization. This role would closely collaborate with the Data Science team and assist the team build and deploy machine learning and deep learning models on big data analytics platforms.

Roles and Responsibilities:

Develop and maintain scalable data pipelines and build out new integrations and processes required for optimal extraction, transformation, and loading of data from a wide variety of data sources using 'Big Data' technologies.
Develop programs in Scala and Python as part of data cleaning and processing.
Assemble large, complex data sets that meet functional / non-functional business requirements and fostering data-driven decision making across the organization.
Responsible to design and develop distributed, high volume, high velocity multi-threaded event processing systems.
Implement processes and systems to validate data, monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
Provide high operational excellence guaranteeing high availability and platform stability.
Closely collaborate with the Data Science team and assist the team build and deploy machine learning and deep learning models on big data analytics platforms.

Skills:

Experience with Big Data pipeline, Big Data analytics, Data warehousing.
Experience with SQL/No-SQL, schema design and dimensional data modeling.
Strong understanding of Hadoop Architecture, HDFS ecosystem and eexperience with Big Data technology stack such as HBase, Hadoop, Hive, MapReduce.
Experience in designing systems that process structured as well as unstructured data at large scale.
Experience in AWS/Spark/Java/Scala/Python development.
Should have Strong skills in PySpark (Python & SPARK). Ability to create, manage and manipulate Spark Dataframes. Expertise in Spark query tuning and performance optimization.
Experience in developing efficient software code/frameworks for multiple use cases leveraging Python and big data technologies.
Prior exposure to streaming data sources such as Kafka.
Should have knowledge on Shell Scripting and Python scripting.
High proficiency in database skills (e.g., Complex SQL), for data preparation, cleaning, and data wrangling/munging, with the ability to write advanced queries and create stored procedures.
Experience with NoSQL databases such as Cassandra / MongoDB.
Solid experience in all phases of Software Development Lifecycle - plan, design, develop, test, release, maintain and support, decommission.
Experience with DevOps tools (GitHub, Travis CI, and JIRA) and methodologies (Lean, Agile, Scrum, Test Driven Development).
Experience building and deploying applications on on-premise and cloud-based infrastructure.
Having a good understanding of machine learning landscape and concepts.

Qualifications and Experience:

Engineering and post graduate candidates, preferably in Computer Science, from premier institutions with proven work experience as a Big Data Engineer or a similar role for 3-5 years.

Certifications:

Good to have at least one of the Certifications listed here:

AZ 900 - Azure Fundamentals

DP 200, DP 201, DP 203, AZ 204 - Data Engineering

AZ 400 - Devops Certification

Roles and Responsibilities:

Develop and maintain scalable data pipelines and build out new integrations and processes required for optimal extraction, transformation, and loading of data from a wide variety of data sources using 'Big Data' technologies.
Develop programs in Scala and Python as part of data cleaning and processing.
Assemble large, complex data sets that meet functional / non-functional business requirements and fostering data-driven decision making across the organization.
Responsible to design and develop distributed, high volume, high velocity multi-threaded event processing systems.
Implement processes and systems to validate data, monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
Provide high operational excellence guaranteeing high availability and platform stability.
Closely collaborate with the Data Science team and assist the team build and deploy machine learning and deep learning models on big data analytics platforms.

Skills:

Experience with Big Data pipeline, Big Data analytics, Data warehousing.
Experience with SQL/No-SQL, schema design and dimensional data modeling.
Strong understanding of Hadoop Architecture, HDFS ecosystem and eexperience with Big Data technology stack such as HBase, Hadoop, Hive, MapReduce.
Experience in designing systems that process structured as well as unstructured data at large scale.
Experience in AWS/Spark/Java/Scala/Python development.
Should have Strong skills in PySpark (Python & SPARK). Ability to create, manage and manipulate Spark Dataframes. Expertise in Spark query tuning and performance optimization.
Experience in developing efficient software code/frameworks for multiple use cases leveraging Python and big data technologies.
Prior exposure to streaming data sources such as Kafka.
Should have knowledge on Shell Scripting and Python scripting.
High proficiency in database skills (e.g., Complex SQL), for data preparation, cleaning, and data wrangling/munging, with the ability to write advanced queries and create stored procedures.
Experience with NoSQL databases such as Cassandra / MongoDB.
Solid experience in all phases of Software Development Lifecycle - plan, design, develop, test, release, maintain and support, decommission.
Experience with DevOps tools (GitHub, Travis CI, and JIRA) and methodologies (Lean, Agile, Scrum, Test Driven Development).
Experience building and deploying applications on on-premise and cloud-based infrastructure.
Having a good understanding of machine learning landscape and concepts.

Qualifications and Experience:

Engineering and post graduate candidates, preferably in Computer Science, from premier institutions with proven work experience as a Big Data Engineer or a similar role for 3-5 years.

Certifications:

Good to have at least one of the Certifications listed here:

AZ 900 - Azure Fundamentals

DP 200, DP 201, DP 203, AZ 204 - Data Engineering

AZ 400 - Devops Certification

Machine Learning Architect - Deployments

at netmedscom

3 recruiters

Posted by Vijay Hemnath

Chennai

5 - 10 yrs

₹10L - ₹30L / yr

Machine Learning (ML)

Software deployment

CI/CD

Cloud Computing

Snow flake schema

+19 more

We are looking for an outstanding ML Architect (Deployments) with expertise in deploying Machine Learning solutions/models into production and scaling them to serve millions of customers. A candidate with an adaptable and productive working style which fits in a fast-moving environment.

Skills:

- 5+ years deploying Machine Learning pipelines in large enterprise production systems.

- Experience developing end to end ML solutions from business hypothesis to deployment / understanding the entirety of the ML development life cycle.
- Expert in modern software development practices; solid experience using source control management (CI/CD).
- Proficient in designing relevant architecture / microservices to fulfil application integration, model monitoring, training / re-training, model management, model deployment, model experimentation/development, alert mechanisms.
- Experience with public cloud platforms (Azure, AWS, GCP).
- Serverless services like lambda, azure functions, and/or cloud functions.
- Orchestration services like data factory, data pipeline, and/or data flow.
- Data science workbench/managed services like azure machine learning, sagemaker, and/or AI platform.
- Data warehouse services like snowflake, redshift, bigquery, azure sql dw, AWS Redshift.
- Distributed computing services like Pyspark, EMR, Databricks.
- Data storage services like cloud storage, S3, blob, S3 Glacier.
- Data visualization tools like Power BI, Tableau, Quicksight, and/or Qlik.
- Proven experience serving up predictive algorithms and analytics through batch and real-time APIs.
- Solid working experience with software engineers, data scientists, product owners, business analysts, project managers, and business stakeholders to design the holistic solution.
- Strong technical acumen around automated testing.
- Extensive background in statistical analysis and modeling (distributions, hypothesis testing, probability theory, etc.)
- Strong hands-on experience with statistical packages and ML libraries (e.g., Python scikit learn, Spark MLlib, etc.)
- Experience in effective data exploration and visualization (e.g., Excel, Power BI, Tableau, Qlik, etc.)
- Experience in developing and debugging in one or more of the languages Java, Python.
- Ability to work in cross functional teams.
- Apply Machine Learning techniques in production including, but not limited to, neuralnets, regression, decision trees, random forests, ensembles, SVM, Bayesian models, K-Means, etc.

Roles and Responsibilities:

Deploying ML models into production, and scaling them to serve millions of customers.

Technical solutioning skills with deep understanding of technical API integrations, AI / Data Science, BigData and public cloud architectures / deployments in a SaaS environment.

Strong stakeholder relationship management skills - able to influence and manage the expectations of senior executives.
Strong networking skills with the ability to build and maintain strong relationships with both business, operations and technology teams internally and externally.

Provide software design and programming support to projects.

Qualifications & Experience:

Engineering and post graduate candidates, preferably in Computer Science, from premier institutions with proven work experience as a Machine Learning Architect (Deployments) or a similar role for 5-7 years.

Skills:

- 5+ years deploying Machine Learning pipelines in large enterprise production systems.

Roles and Responsibilities:

Deploying ML models into production, and scaling them to serve millions of customers.

Technical solutioning skills with deep understanding of technical API integrations, AI / Data Science, BigData and public cloud architectures / deployments in a SaaS environment.

Provide software design and programming support to projects.

Qualifications & Experience:

Data Scientist

at Kaleidofin

3 recruiters

Posted by Poornima B

Chennai, Bengaluru (Bangalore)

2 - 4 yrs

Best in industry

Machine Learning (ML)

Python

SQL

Customer Acquisition

Big Data

+2 more

Responsibility

Partnering with internal business owners (product, marketing, edit, etc.) to understand needs and develop custom analysis to optimize for user engagement and retention
Good understanding of the underlying business and workings of cross functional teams for successful execution
Design and develop analyses based on business requirement needs and challenges.
Leveraging statistical analysis on consumer research and data mining projects, including segmentation, clustering, factor analysis, multivariate regression, predictive modeling, etc.
Providing statistical analysis on custom research projects and consult on A/B testing and other statistical analysis as needed. Other reports and custom analysis as required.
Identify and use appropriate investigative and analytical technologies to interpret and verify results.
Apply and learn a wide variety of tools and languages to achieve results
Use best practices to develop statistical and/ or machine learning techniques to build models that address business needs.

Requirements

2 - 4 years of relevant experience in Data science.
Preferred education: Bachelor's degree in a technical field or equivalent experience.
Experience in advanced analytics, model building, statistical modeling, optimization, and machine learning algorithms.
Machine Learning Algorithms: Crystal clear understanding, coding, implementation, error analysis, model tuning knowledge on Linear Regression, Logistic Regression, SVM, shallow Neural Networks, clustering, Decision Trees, Random forest, XGBoost, Recommender Systems, ARIMA and Anomaly Detection. Feature selection, hyper parameters tuning, model selection and error analysis, boosting and ensemble methods.
Strong with programming languages like Python and data processing using SQL or equivalent and ability to experiment with newer open source tools.
Experience in normalizing data to ensure it is homogeneous and consistently formatted to enable sorting, query and analysis.
Experience designing, developing, implementing and maintaining a database and programs to manage data analysis efforts.
Experience with big data and cloud computing viz. Spark, Hadoop (MapReduce, PIG, HIVE).
Experience in risk and credit score domains preferred.

Responsibility

Partnering with internal business owners (product, marketing, edit, etc.) to understand needs and develop custom analysis to optimize for user engagement and retention
Good understanding of the underlying business and workings of cross functional teams for successful execution
Design and develop analyses based on business requirement needs and challenges.
Leveraging statistical analysis on consumer research and data mining projects, including segmentation, clustering, factor analysis, multivariate regression, predictive modeling, etc.
Providing statistical analysis on custom research projects and consult on A/B testing and other statistical analysis as needed. Other reports and custom analysis as required.
Identify and use appropriate investigative and analytical technologies to interpret and verify results.
Apply and learn a wide variety of tools and languages to achieve results
Use best practices to develop statistical and/ or machine learning techniques to build models that address business needs.

Requirements

2 - 4 years of relevant experience in Data science.
Preferred education: Bachelor's degree in a technical field or equivalent experience.
Experience in advanced analytics, model building, statistical modeling, optimization, and machine learning algorithms.
Machine Learning Algorithms: Crystal clear understanding, coding, implementation, error analysis, model tuning knowledge on Linear Regression, Logistic Regression, SVM, shallow Neural Networks, clustering, Decision Trees, Random forest, XGBoost, Recommender Systems, ARIMA and Anomaly Detection. Feature selection, hyper parameters tuning, model selection and error analysis, boosting and ensemble methods.
Strong with programming languages like Python and data processing using SQL or equivalent and ability to experiment with newer open source tools.
Experience in normalizing data to ensure it is homogeneous and consistently formatted to enable sorting, query and analysis.
Experience designing, developing, implementing and maintaining a database and programs to manage data analysis efforts.
Experience with big data and cloud computing viz. Spark, Hadoop (MapReduce, PIG, HIVE).
Experience in risk and credit score domains preferred.

Data Engineer

at Bungee Tech India

Posted by Abigail David

Remote, NCR (Delhi | Gurgaon | Noida), Chennai

5 - 10 yrs

₹10L - ₹30L / yr

Big Data

Hadoop

Apache Hive

Spark

ETL

+3 more

Company Description

At Bungee Tech, we help retailers and brands meet customers everywhere and, on every occasion, they are in. We believe that accurate, high-quality data matched with compelling market insights empowers retailers and brands to keep their customers at the center of all innovation and value they are delivering.

We provide a clear and complete omnichannel picture of their competitive landscape to retailers and brands. We collect billions of data points every day and multiple times in a day from publicly available sources. Using high-quality extraction, we uncover detailed information on products or services, which we automatically match, and then proactively track for price, promotion, and availability. Plus, anything we do not match helps to identify a new assortment opportunity.

Empowered with this unrivalled intelligence, we unlock compelling analytics and insights that once blended with verified partner data from trusted sources such as Nielsen, paints a complete, consolidated picture of the competitive landscape.

We are looking for a Big Data Engineer who will work on the collecting, storing, processing, and analyzing of huge sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them.

You will also be responsible for integrating them with the architecture used in the company.

We're working on the future. If you are seeking an environment where you can drive innovation, If you want to apply state-of-the-art software technologies to solve real world problems, If you want the satisfaction of providing visible benefit to end-users in an iterative fast paced environment, this is your opportunity.

Responsibilities

As an experienced member of the team, in this role, you will:

Contribute to evolving the technical direction of analytical Systems and play a critical role their design and development

You will research, design and code, troubleshoot and support. What you create is also what you own.

Develop the next generation of automation tools for monitoring and measuring data quality, with associated user interfaces.

Be able to broaden your technical skills and work in an environment that thrives on creativity, efficient execution, and product innovation.

BASIC QUALIFICATIONS

Bachelor’s degree or higher in an analytical area such as Computer Science, Physics, Mathematics, Statistics, Engineering or similar.
5+ years relevant professional experience in Data Engineering and Business Intelligence
5+ years in with Advanced SQL (analytical functions), ETL, Data Warehousing.
Strong knowledge of data warehousing concepts, including data warehouse technical architectures, infrastructure components, ETL/ ELT and reporting/analytic tools and environments, data structures, data modeling and performance tuning.
Ability to effectively communicate with both business and technical teams.
Excellent coding skills in Java, Python, C++, or equivalent object-oriented programming language
Understanding of relational and non-relational databases and basic SQL
Proficiency with at least one of these scripting languages: Perl / Python / Ruby / shell script

PREFERRED QUALIFICATIONS

Experience with building data pipelines from application databases.
Experience with AWS services - S3, Redshift, Spectrum, EMR, Glue, Athena, ELK etc.
Experience working with Data Lakes.
Experience providing technical leadership and mentor other engineers for the best practices on the data engineering space
Sharp problem solving skills and ability to resolve ambiguous requirements
Experience on working with Big Data
Knowledge and experience on working with Hive and the Hadoop ecosystem
Knowledge of Spark
Experience working with Data Science teams

Company Description

You will also be responsible for integrating them with the architecture used in the company.

Responsibilities

As an experienced member of the team, in this role, you will:

Contribute to evolving the technical direction of analytical Systems and play a critical role their design and development

You will research, design and code, troubleshoot and support. What you create is also what you own.

Develop the next generation of automation tools for monitoring and measuring data quality, with associated user interfaces.

Be able to broaden your technical skills and work in an environment that thrives on creativity, efficient execution, and product innovation.

BASIC QUALIFICATIONS

Bachelor’s degree or higher in an analytical area such as Computer Science, Physics, Mathematics, Statistics, Engineering or similar.
5+ years relevant professional experience in Data Engineering and Business Intelligence
5+ years in with Advanced SQL (analytical functions), ETL, Data Warehousing.
Strong knowledge of data warehousing concepts, including data warehouse technical architectures, infrastructure components, ETL/ ELT and reporting/analytic tools and environments, data structures, data modeling and performance tuning.
Ability to effectively communicate with both business and technical teams.
Excellent coding skills in Java, Python, C++, or equivalent object-oriented programming language
Understanding of relational and non-relational databases and basic SQL
Proficiency with at least one of these scripting languages: Perl / Python / Ruby / shell script

PREFERRED QUALIFICATIONS

Experience with building data pipelines from application databases.
Experience with AWS services - S3, Redshift, Spectrum, EMR, Glue, Athena, ELK etc.
Experience working with Data Lakes.
Experience providing technical leadership and mentor other engineers for the best practices on the data engineering space
Sharp problem solving skills and ability to resolve ambiguous requirements
Experience on working with Big Data
Knowledge and experience on working with Hive and the Hadoop ecosystem
Knowledge of Spark
Experience working with Data Science teams

Data Architect

For a leading manufacturing company

Agency job

via People First Consultants by Jayaraj E

Chennai

5 - 8 yrs

₹6L - ₹7L / yr

Relational Database (RDBMS)

NOSQL Databases

MySQL

MS SQLServer

SQL server

+13 more

Database Architect

5 - 6 Years

Good Knowledge in Relation and Non-Relational Database

To write Complex Queries and Identify problematic queries and provide a Solution

Good Hands on database tools

Experience in Both SQL and NON SQL Database like SQL Server, PostgreSQL, Mango DB, Maria DB. Etc.

Worked on Data Model Preparation & Structuring Database etc.

Database Architect

5 - 6 Years

Good Knowledge in Relation and Non-Relational Database

To write Complex Queries and Identify problematic queries and provide a Solution

Good Hands on database tools

Experience in Both SQL and NON SQL Database like SQL Server, PostgreSQL, Mango DB, Maria DB. Etc.

Worked on Data Model Preparation & Structuring Database etc.

Big Data Engineer

at YourHRfolks

6 recruiters

Posted by Bharat Saxena

Remote, Jaipur, NCR (Delhi | Gurgaon | Noida), Chennai, Bangarmau

5 - 10 yrs

₹15L - ₹30L / yr

Big Data

Hadoop

Spark

Apache Kafka

Amazon Web Services (AWS)

+2 more

Position: Big Data Engineer

What You'll Do

Punchh is seeking to hire Big Data Engineer at either a senior or tech lead level. Reporting to the Director of Big Data, he/she will play a critical role in leading Punchh’s big data innovations. By leveraging prior industrial experience in big data, he/she will help create cutting-edge data and analytics products for Punchh’s business partners.

This role requires close collaborations with data, engineering, and product organizations. His/her job functions include

Work with large data sets and implement sophisticated data pipelines with both structured and structured data.
Collaborate with stakeholders to design scalable solutions.
Manage and optimize our internal data pipeline that supports marketing, customer success and data science to name a few.
A technical leader of Punchh’s big data platform that supports AI and BI products.
Work with infra and operations team to monitor and optimize existing infrastructure
Occasional business travels are required.

What You'll Need

5+ years of experience as a Big Data engineering professional, developing scalable big data solutions.
Advanced degree in computer science, engineering or other related fields.
Demonstrated strength in data modeling, data warehousing and SQL.
Extensive knowledge with cloud technologies, e.g. AWS and Azure.
Excellent software engineering background. High familiarity with software development life cycle. Familiarity with GitHub/Airflow.
Advanced knowledge of big data technologies, such as programming language (Python, Java), relational (Postgres, mysql), NoSQL (Mongodb), Hadoop (EMR) and streaming (Kafka, Spark).
Strong problem solving skills with demonstrated rigor in building and maintaining a complex data pipeline.
Exceptional communication skills and ability to articulate a complex concept with thoughtful, actionable recommendations.

Position: Big Data Engineer

What You'll Do

This role requires close collaborations with data, engineering, and product organizations. His/her job functions include

Work with large data sets and implement sophisticated data pipelines with both structured and structured data.
Collaborate with stakeholders to design scalable solutions.
Manage and optimize our internal data pipeline that supports marketing, customer success and data science to name a few.
A technical leader of Punchh’s big data platform that supports AI and BI products.
Work with infra and operations team to monitor and optimize existing infrastructure
Occasional business travels are required.

What You'll Need

5+ years of experience as a Big Data engineering professional, developing scalable big data solutions.
Advanced degree in computer science, engineering or other related fields.
Demonstrated strength in data modeling, data warehousing and SQL.
Extensive knowledge with cloud technologies, e.g. AWS and Azure.
Excellent software engineering background. High familiarity with software development life cycle. Familiarity with GitHub/Airflow.
Advanced knowledge of big data technologies, such as programming language (Python, Java), relational (Postgres, mysql), NoSQL (Mongodb), Hadoop (EMR) and streaming (Kafka, Spark).
Strong problem solving skills with demonstrated rigor in building and maintaining a complex data pipeline.
Exceptional communication skills and ability to articulate a complex concept with thoughtful, actionable recommendations.

Big Data Developer

at Maveric Systems

3 recruiters

Posted by Rashmi Poovaiah

Bengaluru (Bangalore), Chennai, Pune

4 - 10 yrs

₹8L - ₹15L / yr

Big Data

Hadoop

Spark

Apache Kafka

HiveQL

+2 more

Role Summary/Purpose:

We are looking for a Developer/Senior Developers to be a part of building advanced analytical platform leveraging Big Data technologies and transform the legacy systems. This role is an exciting, fast-paced, constantly changing and challenging work environment, and will play an important role in resolving and influencing high-level decisions.

Requirements:

The candidate must be a self-starter, who can work under general guidelines in a fast-spaced environment.
Overall minimum of 4 to 8 year of software development experience and 2 years in Data Warehousing domain knowledge
Must have 3 years of hands-on working knowledge on Big Data technologies such as Hadoop, Hive, Hbase, Spark, Kafka, Spark Streaming, SCALA etc…
Excellent knowledge in SQL & Linux Shell scripting
Bachelors/Master’s/Engineering Degree from a well-reputed university.
Strong communication, Interpersonal, Learning and organizing skills matched with the ability to manage stress, Time, and People effectively
Proven experience in co-ordination of many dependencies and multiple demanding stakeholders in a complex, large-scale deployment environment
Ability to manage a diverse and challenging stakeholder community
Diverse knowledge and experience of working on Agile Deliveries and Scrum teams.

Responsibilities

Should works as a senior developer/individual contributor based on situations
Should be part of SCRUM discussions and to take requirements
Adhere to SCRUM timeline and deliver accordingly
Participate in a team environment for the design, development and implementation
Should take L3 activities on need basis
Prepare Unit/SIT/UAT testcase and log the results
Co-ordinate SIT and UAT Testing. Take feedbacks and provide necessary remediation/recommendation in time.
Quality delivery and automation should be a top priority
Co-ordinate change and deployment in time
Should create healthy harmony within the team
Owns interaction points with members of core team (e.g.BA team, Testing and business team) and any other relevant stakeholders

Requirements:

The candidate must be a self-starter, who can work under general guidelines in a fast-spaced environment.
Overall minimum of 4 to 8 year of software development experience and 2 years in Data Warehousing domain knowledge
Must have 3 years of hands-on working knowledge on Big Data technologies such as Hadoop, Hive, Hbase, Spark, Kafka, Spark Streaming, SCALA etc…
Excellent knowledge in SQL & Linux Shell scripting
Bachelors/Master’s/Engineering Degree from a well-reputed university.
Strong communication, Interpersonal, Learning and organizing skills matched with the ability to manage stress, Time, and People effectively
Proven experience in co-ordination of many dependencies and multiple demanding stakeholders in a complex, large-scale deployment environment
Ability to manage a diverse and challenging stakeholder community
Diverse knowledge and experience of working on Agile Deliveries and Scrum teams.

Responsibilities

Should works as a senior developer/individual contributor based on situations
Should be part of SCRUM discussions and to take requirements
Adhere to SCRUM timeline and deliver accordingly
Participate in a team environment for the design, development and implementation
Should take L3 activities on need basis
Prepare Unit/SIT/UAT testcase and log the results
Co-ordinate SIT and UAT Testing. Take feedbacks and provide necessary remediation/recommendation in time.
Quality delivery and automation should be a top priority
Co-ordinate change and deployment in time
Should create healthy harmony within the team
Owns interaction points with members of core team (e.g.BA team, Testing and business team) and any other relevant stakeholders

Data Engineer

at Mobile Programming LLC

1 video

34 recruiters

Posted by vandana chauhan

Remote, Chennai

3 - 7 yrs

₹12L - ₹18L / yr

Big Data

Amazon Web Services (AWS)

Hadoop

SQL

Python

+5 more

Position: Data Engineer
Location: Chennai- Guindy Industrial Estate
Duration: Full time role
Company: Mobile Programming (https://www.mobileprogramming.com/" target="_blank">https://www.mobileprogramming.com/)
Client Name: Samsung

We are looking for a Data Engineer to join our growing team of analytics experts. The hire will be
responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing
data flow and collection for cross functional teams. The ideal candidate is an experienced data pipeline
builder and data wrangler who enjoy optimizing data systems and building them from the ground up.
The Data Engineer will support our software developers, database architects, data analysts and data
scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout
ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple
teams, systems and products.

Responsibilities for Data Engineer
 Create and maintain optimal data pipeline architecture,
 Assemble large, complex data sets that meet functional / non-functional business requirements.
 Identify, design, and implement internal process improvements: automating manual processes,
optimizing data delivery, re-designing infrastructure for greater scalability, etc.
 Build the infrastructure required for optimal extraction, transformation, and loading of data
from a wide variety of data sources using SQL and AWS big data technologies.
 Build analytics tools that utilize the data pipeline to provide actionable insights into customer
acquisition, operational efficiency and other key business performance metrics.
 Work with stakeholders including the Executive, Product, Data and Design teams to assist with
data-related technical issues and support their data infrastructure needs.
 Create data tools for analytics and data scientist team members that assist them in building and
optimizing our product into an innovative industry leader.
 Work with data and analytics experts to strive for greater functionality in our data systems.

Qualifications for Data Engineer
 Experience building and optimizing big data ETL pipelines, architectures and data sets.
 Advanced working SQL knowledge and experience working with relational databases, query
authoring (SQL) as well as working familiarity with a variety of databases.
 Experience performing root cause analysis on internal and external data and processes to
answer specific business questions and identify opportunities for improvement.
 Strong analytic skills related to working with unstructured datasets.
 Build processes supporting data transformation, data structures, metadata, dependency and
workload management.
 A successful history of manipulating, processing and extracting value from large disconnected
datasets.

 Working knowledge of message queuing, stream processing and highly scalable ‘big data’ data
stores.
 Strong project management and organizational skills.
 Experience supporting and working with cross-functional teams in a dynamic environment.

We are looking for a candidate with 3-6 years of experience in a Data Engineer role, who has
attained a Graduate degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field. They should also have experience using the following software/tools:
 Experience with big data tools: Spark, Kafka, HBase, Hive etc.
 Experience with relational SQL and NoSQL databases
 Experience with AWS cloud services: EC2, EMR, RDS, Redshift
 Experience with stream-processing systems: Storm, Spark-Streaming, etc.
 Experience with object-oriented/object function scripting languages: Python, Java, Scala, etc.

Skills: Big Data, AWS, Hive, Spark, Python, SQL

Senior Java Software Engineer

Retail Marketing

Agency job

via Abbioc by Abbiocs HR

Chennai

4 - 9 yrs

₹1L - ₹12L / yr

Java

Data Structures

Algorithms

C++

Apache Kafka

+10 more

Requires a bachelor's degree in area of specialty and experience in the field or in a related area. Familiar with standard concepts, practices, and procedures within a particular field. Relies on experience and judgment to plan and accomplish goals. Performs a variety of tasks. A degree of creativity and latitude is required. Typically reports to a supervisor or manager.

Designs, develops, and implements web-based Java applications to support business requirements. Follows approved life cycle methodologies, creates design documents, and performs program coding and testing. Resolves technical issues through debugging, research, and investigation.

Additional Job Details:

Strong in Java, Spring, Spring Boot, REST and developing MicroServices.

Knowledge or experience , Cassandra preferred

Knowledge or experience on Kafka

Good to have but not must

Good to know:

Reporting tools like Splunk/Grafana

Protobuf

Python/Ruby

Additional Job Details:

Strong in Java, Spring, Spring Boot, REST and developing MicroServices.

Knowledge or experience , Cassandra preferred

Knowledge or experience on Kafka

Good to have but not must

Good to know:

Reporting tools like Splunk/Grafana

Protobuf

Python/Ruby

Lead Data Engineer

at Lymbyc

1 video

2 recruiters

Posted by Venky Thiriveedhi

Bengaluru (Bangalore), Chennai

4 - 8 yrs

₹9L - ₹14L / yr

Apache Spark

Apache Kafka

Druid Database

Big Data

Apache Sqoop

+5 more

Key skill set : Apache NiFi, Kafka Connect (Confluent), Sqoop, Kylo, Spark, Druid, Presto, RESTful services, Lambda / Kappa architectures Responsibilities : - Build a scalable, reliable, operable and performant big data platform for both streaming and batch analytics - Design and implement data aggregation, cleansing and transformation layers Skills : - Around 4+ years of hands-on experience designing and operating large data platforms - Experience in Big data Ingestion, Transformation and stream/batch processing technologies using Apache NiFi, Apache Kafka, Kafka Connect (Confluent), Sqoop, Spark, Storm, Hive etc; - Experience in designing and building streaming data platforms in Lambda, Kappa architectures - Should have working experience in one of NoSQL, OLAP data stores like Druid, Cassandra, Elasticsearch, Pinot etc; - Experience in one of data warehousing tools like RedShift, BigQuery, Azure SQL Data Warehouse - Exposure to other Data Ingestion, Data Lake and querying frameworks like Marmaray, Kylo, Drill, Presto - Experience in designing and consuming microservices - Exposure to security and governance tools like Apache Ranger, Apache Atlas - Any contributions to open source projects a plus - Experience in performance benchmarks will be a plus

Assistant Manager - Analytics - Product Team

at LatentView Analytics

3 recruiters

Posted by Kannikanti madhuri

Chennai

5 - 8 yrs

₹5L - ₹8L / yr

Data Science

Analytics

Data Analytics

Data modeling

Data mining

+7 more

Job Overview :We are looking for an experienced Data Science professional to join our Product team and lead the data analytics team and manage the processes and people responsible for accurate data collection, processing, modelling and analysis. The ideal candidate has a knack for seeing solutions in sprawling data sets and the business mindset to convert insights into strategic opportunities for our clients. The incumbent will work closely with leaders across product, sales, and marketing to support and implement high-quality, data-driven decisions. They will ensure data accuracy and consistent reporting by designing and creating optimal processes and procedures for analytics employees to follow. They will use advanced data modelling, predictive modelling, natural language processing and analytical techniques to interpret key findings.Responsibilities for Analytics Manager :- Build, develop and maintain data models, reporting systems, data automation systems, dashboards and performance metrics support that support key business decisions.- Design and build technical processes to address business issues.- Manage and optimize processes for data intake, validation, mining and engineering as well as modelling, visualization and communication deliverables.- Examine, interpret and report results to stakeholders in leadership, technology, sales, marketing and product teams.- Develop and implement quality controls and standards to ensure quality standards- Anticipate future demands of initiatives related to people, technology, budget and business within your department and design/implement solutions to meet these needs.- Communicate results and business impacts of insight initiatives to stakeholders within and outside of the company.- Lead cross-functional projects using advanced data modelling and analysis techniques to discover insights that will guide strategic decisions and uncover optimization opportunities.Qualifications for Analytics Manager :- Working knowledge of data mining principles: predictive analytics, mapping, collecting data from multiple cloud-based data sources- Strong SQL skills, ability to perform effective querying- Understanding of and experience using analytical concepts and statistical techniques: hypothesis development, designing tests/experiments, analysing data, drawing conclusions, and developing actionable recommendations for business units.- Experience and knowledge of statistical modelling techniques: GLM multiple regression, logistic regression, log-linear regression, variable selection, etc.- Experience working with and creating databases and dashboards using all relevant data to inform decisions.- Strong problem solving, quantitative and analytical abilities.- Strong ability to plan and manage numerous processes, people and projects simultaneously.- Excellent communication, collaboration and delegation skills.- We- re looking for someone with at least 5 years of experience in a position monitoring, managing and drawing insights from data, and someone with at least 3 years of experience leading a team. The right candidate will also be proficient and experienced with the following tools/programs :- Strong programming skills with querying languages: R, Python etc.- Experience with big data tools like Hadoop- Experience with data visualization tools: Tableau, d3.js, etc.- Experience with Excel, Word, and PowerPoint.

System Admin Server Management

at Energyly

2 recruiters

Posted by Dayal Nathan

Chennai

1 - 3 yrs

₹2L - ₹7L / yr

Linux/Unix

System Administration

Docker

Oracle Management Server

Cloud Computing

+4 more

Managing cloud deployments & containers using docker Managing Linux - upgrades, application deployments, backup/restores, system tuning & performance Managing & monitoring web servers, queuing systems, big data stream processing, databases like cassandra etc., for security & performance Unix shell scripting

Big Data Developer

at Intelliswift Software

12 recruiters

Posted by Pratish Mishra

Chennai

4 - 8 yrs

₹8L - ₹17L / yr

Big Data

Spark

Scala

SQL

Greetings from Intelliswift! Intelliswift Software Inc. is a premier software solutions and Services Company headquartered in the Silicon Valley, with offices across the United States, India, and Singapore. The company has a proven track record of delivering results through its global delivery centers and flexible engagement models for over 450 brands ranging from Fortune 100 to growing companies. Intelliswift provides a variety of services including Enterprise Applications, Mobility, Big Data / BI, Staffing Services, and Cloud Solutions. Growing at an outstanding rate, it has been recognized as the second largest private IT Company in the East Bay. Domains: IT, Retail, Pharma, Healthcare, BFSI, and Internet & E-commerce website https://www.intelliswift.com/ Experience: 4-8 Years Job Location: Chennai Job Description: Skills: Spark, Scala, Big data, Hive · Strong Working experience in Spark, Scala, big data, h base and hive. · Should have good working experience in SQL and Spark SQL. · Good to have knowledge or experience in Teradata. · Familiar with General engineering Git, jenkins, sbt, maven.

Big Data Developer

at GeakMinds Technologies Pvt Ltd

3 recruiters

Posted by John Richardson

Chennai

1 - 5 yrs

₹1L - ₹6L / yr

Hadoop

Big Data

HDFS

Apache Sqoop

Apache Flume

+2 more

• Looking for Big Data Engineer with 3+ years of experience. • Hands-on experience with MapReduce-based platforms, like Pig, Spark, Shark. • Hands-on experience with data pipeline tools like Kafka, Storm, Spark Streaming. • Store and query data with Sqoop, Hive, MySQL, HBase, Cassandra, MongoDB, Drill, Phoenix, and Presto. • Hands-on experience in managing Big Data on a cluster with HDFS and MapReduce. • Handle streaming data in real time with Kafka, Flume, Spark Streaming, Flink, and Storm. • Experience with Azure cloud, Cognitive Services, Databricks is preferred.

Technical Architect/CTO

at auzmor

5 recruiters

Posted by Loga B

Chennai

3 - 10 yrs

₹10L - ₹30L / yr

Java

React.js

AngularJS (1.x)

Selenium Web driver

Hadoop

+3 more

Description Auzmor is US HQ’ed, funded SaaS startup focussed on disrupting the HR space. We combine passion, domain expertise and build products with focus on great end user experiences We are looking for Technical Architect to envision, build, launch and scale multiple SaaS products What You Will Do: • Understand the broader strategy, business goals, and engineering priorities of the company and how to incorporate them into your designs of systems, components, or features • Designing applications and architectures for multi-tenant SaaS software • Responsible for the selection and use of frameworks, platforms and design patterns for Cloud based multi-tenant SaaS based application • Collaborate with engineers, QA, product managers, UX designers, partners/vendors, and other architects to build scalable systems, services, and products for our diverse ecosystem of users across apps What you will need • Minimum of 5+ years of Hands on engineering experience in SaaS, Cloud services environments with architecture design and definition experience using Java/JEE, Struts, Spring, JMS & ORM (Hibernate, JPA) or other Server side technologies, frameworks. • Strong understanding of architecture patterns such as multi-tenancy, scalability, and federation, microservices(design, decomposition, and maintenance ) to build cloud-ready systems • Experience with server-side technologies (preferably Java or Go),frontend technologies (HTML/CSS, Native JS, React, Angular, etc.) and testing frameworks and automation (PHPUnit, Codeception, Behat, Selenium, webdriver, etc.) • Passion for quality and engineering excellence at scale What we would love to see • Exposure to Big data -related technologies such as Hadoop, Spark, Cassandra, Mapreduce or NoSQL, and data management, data retrieval , data quality , ETL, data analysis. • Familiarity with containerized deployments and cloud computing platforms (AWS, Azure, GCP)