Apache Flume Jobs in Chennai

11+ Apache Flume Jobs in Chennai | Apache Flume Job openings in Chennai

Apply to 11+ Apache Flume Jobs in Chennai on CutShort.io. Explore the latest Apache Flume Job opportunities across top companies like Google, Amazon & Adobe.

Big Data Developer

at GeakMinds Technologies Pvt Ltd

3 recruiters

Posted by John Richardson

Chennai

1 - 5 yrs

₹1L - ₹6L / yr

Hadoop

Big Data

HDFS

Apache Sqoop

Apache Flume

+2 more

• Looking for Big Data Engineer with 3+ years of experience. • Hands-on experience with MapReduce-based platforms, like Pig, Spark, Shark. • Hands-on experience with data pipeline tools like Kafka, Storm, Spark Streaming. • Store and query data with Sqoop, Hive, MySQL, HBase, Cassandra, MongoDB, Drill, Phoenix, and Presto. • Hands-on experience in managing Big Data on a cluster with HDFS and MapReduce. • Handle streaming data in real time with Kafka, Flume, Spark Streaming, Flink, and Storm. • Experience with Azure cloud, Cognitive Services, Databricks is preferred.

Data Science

at Leading Manufacturing Company

Agency job

via People First Consultants by Jayaraj E

Chennai

3 - 6 yrs

₹3L - ₹8L / yr

Machine Learning (ML)

Data Science

Natural Language Processing (NLP)

Data modeling

Data Analytics

+2 more

Location: Chennai
Education: BE/BTech
Experience: Minimum 3+ years of experience as a Data Scientist/Data Engineer

Domain knowledge: Data cleaning, modelling, analytics, statistics, machine learning, AI

Requirements:

To be part of Digital Manufacturing and Industrie 4.0 projects across client group of companies
Design and develop AI//ML models to be deployed across factories
Knowledge on Hadoop, Apache Spark, MapReduce, Scala, Python programming, SQL and NoSQL databases is required
Should be strong in statistics, data analysis, data modelling, machine learning techniques and Neural Networks
Prior experience in developing AI and ML models is required
Experience with data from the Manufacturing Industry would be a plus

Roles and Responsibilities:

Develop AI and ML models for the Manufacturing Industry with a focus on Energy, Asset Performance Optimization and Logistics
Multitasking, good communication necessary
Entrepreneurial attitude

Additional Information:

Travel: Must be willing to travel on shorter duration within India and abroad

Job Location: Chennai
Reporting to: Team Leader, Energy Management System

Location: Chennai
Education: BE/BTech
Experience: Minimum 3+ years of experience as a Data Scientist/Data Engineer

Domain knowledge: Data cleaning, modelling, analytics, statistics, machine learning, AI

Requirements:

To be part of Digital Manufacturing and Industrie 4.0 projects across client group of companies
Design and develop AI//ML models to be deployed across factories
Knowledge on Hadoop, Apache Spark, MapReduce, Scala, Python programming, SQL and NoSQL databases is required
Should be strong in statistics, data analysis, data modelling, machine learning techniques and Neural Networks
Prior experience in developing AI and ML models is required
Experience with data from the Manufacturing Industry would be a plus

Roles and Responsibilities:

Develop AI and ML models for the Manufacturing Industry with a focus on Energy, Asset Performance Optimization and Logistics
Multitasking, good communication necessary
Entrepreneurial attitude

Additional Information:

Travel: Must be willing to travel on shorter duration within India and abroad

Job Location: Chennai
Reporting to: Team Leader, Energy Management System

Software developer

at Tier 1 MNC

Agency job

via People First Consultants by Jayaraj E

Chennai, Pune, Bengaluru (Bangalore), Noida, Gurugram, Kochi (Cochin), Coimbatore, Hyderabad, Mumbai, Navi Mumbai

3 - 12 yrs

₹3L - ₹15L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+1 more

Greetings,
We are hiring for Tier 1 MNC for the software developer with good knowledge in Spark,Hadoop and Scala

Platform Engineer

at Mobile Programming LLC

1 video

34 recruiters

Posted by Sukhdeep Singh

Chennai

4 - 7 yrs

₹13L - ₹15L / yr

Data Analytics

Data Visualization

PowerBI

Tableau

Qlikview

+10 more

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Title: Platform Engineer Location: Chennai Work Mode: Hybrid (Remote and Chennai Office) Experience: 4+ years Budget: 16 - 18 LPA

Responsibilities:

Parse data using Python, create dashboards in Tableau.
Utilize Jenkins for Airflow pipeline creation and CI/CD maintenance.
Migrate Datastage jobs to Snowflake, optimize performance.
Work with HDFS, Hive, Kafka, and basic Spark.
Develop Python scripts for data parsing, quality checks, and visualization.
Conduct unit testing and web application testing.
Implement Apache Airflow and handle production migration.
Apply data warehousing techniques for data cleansing and dimension modeling.

Requirements:

4+ years of experience as a Platform Engineer.
Strong Python skills, knowledge of Tableau.
Experience with Jenkins, Snowflake, HDFS, Hive, and Kafka.
Proficient in Unix Shell Scripting and SQL.
Familiarity with ETL tools like DataStage and DMExpress.
Understanding of Apache Airflow.
Strong problem-solving and communication skills.

Note: Only candidates willing to work in Chennai and available for immediate joining will be considered. Budget for this position is 16 - 18 LPA.

Data Engineer

at Ganit Business Solutions

3 recruiters

Posted by Viswanath Subramanian

Chennai, Bengaluru (Bangalore), Mumbai

4 - 6 yrs

₹7L - ₹15L / yr

SQL

Amazon Web Services (AWS)

Data Warehouse (DWH)

Informatica

ETL

+1 more

Responsibilities:

Must be able to write quality code and build secure, highly available systems.
Assemble large, complex datasets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing datadelivery, re-designing infrastructure for greater scalability, etc with the guidance.
Create datatools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
Monitoring performance and advising any necessary infrastructure changes.
Defining dataretention policies.
Implementing the ETL process and optimal data pipeline architecture
Build analytics tools that utilize the datapipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
Create design documents that describe the functionality, capacity, architecture, and process.
Develop, test, and implement datasolutions based on finalized design documents.
Work with dataand analytics experts to strive for greater functionality in our data
Proactively identify potential production issues and recommend and implement solutions

Skillsets:

Good understanding of optimal extraction, transformation, and loading of datafrom a wide variety of data sources using SQL and AWS ‘big data’ technologies.
Proficient understanding of distributed computing principles
Experience in working with batch processing/ real-time systems using various open-source technologies like NoSQL, Spark, Pig, Hive, Apache Airflow.
Implemented complex projects dealing with the considerable datasize (PB).
Optimization techniques (performance, scalability, monitoring, etc.)
Experience with integration of datafrom multiple data sources
Experience with NoSQL databases, such as HBase, Cassandra, MongoDB, etc.,
Knowledge of various ETL techniques and frameworks, such as Flume
Experience with various messaging systems, such as Kafka or RabbitMQ
Good understanding of Lambda Architecture, along with its advantages and drawbacks
Creation of DAGs for dataengineering
Expert at Python /Scala programming, especially for dataengineering/ ETL purposes

Responsibilities:

Must be able to write quality code and build secure, highly available systems.
Assemble large, complex datasets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing datadelivery, re-designing infrastructure for greater scalability, etc with the guidance.
Create datatools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
Monitoring performance and advising any necessary infrastructure changes.
Defining dataretention policies.
Implementing the ETL process and optimal data pipeline architecture
Build analytics tools that utilize the datapipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
Create design documents that describe the functionality, capacity, architecture, and process.
Develop, test, and implement datasolutions based on finalized design documents.
Work with dataand analytics experts to strive for greater functionality in our data
Proactively identify potential production issues and recommend and implement solutions

Skillsets:

Good understanding of optimal extraction, transformation, and loading of datafrom a wide variety of data sources using SQL and AWS ‘big data’ technologies.
Proficient understanding of distributed computing principles
Experience in working with batch processing/ real-time systems using various open-source technologies like NoSQL, Spark, Pig, Hive, Apache Airflow.
Implemented complex projects dealing with the considerable datasize (PB).
Optimization techniques (performance, scalability, monitoring, etc.)
Experience with integration of datafrom multiple data sources
Experience with NoSQL databases, such as HBase, Cassandra, MongoDB, etc.,
Knowledge of various ETL techniques and frameworks, such as Flume
Experience with various messaging systems, such as Kafka or RabbitMQ
Good understanding of Lambda Architecture, along with its advantages and drawbacks
Creation of DAGs for dataengineering
Expert at Python /Scala programming, especially for dataengineering/ ETL purposes

Python + Data scientist

at A leading global information technology and business process

Agency job

via Jobdost by Mamatha A

Chennai

5 - 14 yrs

₹13L - ₹21L / yr

Python

Java

PySpark

Javascript

Hadoop

Python + Data scientist :
• Hands-on and sound knowledge of Python, Pyspark, Java script

• Build data-driven models to understand the characteristics of engineering systems

• Train, tune, validate, and monitor predictive models

• Sound knowledge on Statistics

• Experience in developing data processing tasks using PySpark such as reading,

merging, enrichment, loading of data from external systems to target data destinations

• Working knowledge on Big Data or/and Hadoop environments

• Experience creating CI/CD Pipelines using Jenkins or like tools

• Practiced in eXtreme Programming (XP) disciplines

Python + Data scientist :
• Hands-on and sound knowledge of Python, Pyspark, Java script

• Build data-driven models to understand the characteristics of engineering systems

• Train, tune, validate, and monitor predictive models

• Sound knowledge on Statistics

• Experience in developing data processing tasks using PySpark such as reading,

merging, enrichment, loading of data from external systems to target data destinations

• Working knowledge on Big Data or/and Hadoop environments

• Experience creating CI/CD Pipelines using Jenkins or like tools

• Practiced in eXtreme Programming (XP) disciplines

GCP Developer

at Quess Corp Limited

6 recruiters

Posted by Anjali Singh

Noida, Delhi, Gurugram, Ghaziabad, Faridabad, Bengaluru (Bangalore), Chennai

5 - 8 yrs

₹1L - ₹15L / yr

Google Cloud Platform (GCP)

Python

Big Data

Data processing

Data Visualization

GCP Data Analyst profile must have below skills sets :

Knowledge of programming languages like https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.simplilearn.com%2Ftutorials%2Fsql-tutorial%2Fhow-to-become-sql-developer&;data=05%7C01%7Ca_anjali%40hcl.com%7C4ae720b3f3cc45c3e04608da3346b335%7C189de737c93a4f5a8b686f4ca9941912%7C0%7C0%7C637878675987971859%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=EImfaJAD1KHOyrBQ7FkbaPl1STtfnf4QdQlbjw72%2BmE%3D&reserved=0" target="_blank">SQL, Oracle, R, MATLAB, Java and https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.simplilearn.com%2Fwhy-learn-python-a-guide-to-unlock-your-python-career-article&;data=05%7C01%7Ca_anjali%40hcl.com%7C4ae720b3f3cc45c3e04608da3346b335%7C189de737c93a4f5a8b686f4ca9941912%7C0%7C0%7C637878675987971859%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Z2n1Xy%2F3YN6nQqSweU5T7EfUTa1kPAAjbCMTWxDCh%2FY%3D&reserved=0" target="_blank">Python
Data cleansing, data visualization, data wrangling
Data modeling , data warehouse concepts
Adapt to Big data platform like Hadoop, Spark for stream & batch processing
GCP (Cloud Dataproc, Cloud Dataflow, Cloud Datalab, Cloud Dataprep, BigQuery, Cloud Datastore, Cloud Datafusion, Auto ML etc)

Data Scientist

at TVS Credit Services

2 recruiters

Posted by Vinodhkumar Panneerselvam

Chennai

4 - 10 yrs

₹10L - ₹20L / yr

Data Science

R Programming

Python

Machine Learning (ML)

Hadoop

+3 more

Job Description: Be responsible for scaling our analytics capability across all internal disciplines and guide our strategic direction in regards to analytics Organize and analyze large, diverse data sets across multiple platforms Identify key insights and leverage them to inform and influence product strategy Technical Interactions with vendor or partners in technical capacity for scope/ approach & deliverables. Develops proof of concept to prove or disprove validity of concept. Working with all parts of the business to identify analytical requirements and formalize an approach for reliable, relevant, accurate, efficientreporting on those requirements Designing and implementing advanced statistical testing for customized problem solving Deliver concise verbal and written explanations of analyses to senior management that elevate findings into strategic recommendations Desired Candidate Profile: MTech / BE / BTech / MSc in CS or Stats or Maths, Operation Research, Statistics, Econometrics or in any quantitative field Experience in using Python, R, SAS Experience in working with large data sets and big data systems (SQL, Hadoop, Hive, etc.) Keen aptitude for large-scale data analysis with a passion for identifying key insights from data Expert working knowledge in various machine learning algorithms such XGBoost, SVM Etc. We are looking candidates from the following: Experience in Unsecured Loans & SME Loans analytics (cards, installment loans) - risk based pricing analytics Experience in Differential pricing / selection analytics (retail, airlines / travel etc). Experience in Digital product companies or Digital eCommerce with Product mindset and experience Experience in Fraud / Risk from Banks, NBFC / Fintech / Credit Bureau Experience in Online media with knowledge of media, online ads & sales (agencies) - Knowledge of DMP, DFP, Adobe/Omniture tools, Cloud Experience in Consumer Durable Loans lending companies (Experience in Credit Cards, Personal Loan - optional) Experience in Tractor Loans lending companies (Experience in Farm) Experience in Recovery, Collections analytics Experience in Marketing Analytics with Digital Marketing, Market Mix modelling, Advertising Technology

Machine Learning Architect - Deployments

at netmedscom

3 recruiters

Posted by Vijay Hemnath

Chennai

5 - 10 yrs

₹10L - ₹30L / yr

Machine Learning (ML)

Software deployment

CI/CD

Cloud Computing

Snow flake schema

+19 more

We are looking for an outstanding ML Architect (Deployments) with expertise in deploying Machine Learning solutions/models into production and scaling them to serve millions of customers. A candidate with an adaptable and productive working style which fits in a fast-moving environment.

Skills:

- 5+ years deploying Machine Learning pipelines in large enterprise production systems.

- Experience developing end to end ML solutions from business hypothesis to deployment / understanding the entirety of the ML development life cycle.
- Expert in modern software development practices; solid experience using source control management (CI/CD).
- Proficient in designing relevant architecture / microservices to fulfil application integration, model monitoring, training / re-training, model management, model deployment, model experimentation/development, alert mechanisms.
- Experience with public cloud platforms (Azure, AWS, GCP).
- Serverless services like lambda, azure functions, and/or cloud functions.
- Orchestration services like data factory, data pipeline, and/or data flow.
- Data science workbench/managed services like azure machine learning, sagemaker, and/or AI platform.
- Data warehouse services like snowflake, redshift, bigquery, azure sql dw, AWS Redshift.
- Distributed computing services like Pyspark, EMR, Databricks.
- Data storage services like cloud storage, S3, blob, S3 Glacier.
- Data visualization tools like Power BI, Tableau, Quicksight, and/or Qlik.
- Proven experience serving up predictive algorithms and analytics through batch and real-time APIs.
- Solid working experience with software engineers, data scientists, product owners, business analysts, project managers, and business stakeholders to design the holistic solution.
- Strong technical acumen around automated testing.
- Extensive background in statistical analysis and modeling (distributions, hypothesis testing, probability theory, etc.)
- Strong hands-on experience with statistical packages and ML libraries (e.g., Python scikit learn, Spark MLlib, etc.)
- Experience in effective data exploration and visualization (e.g., Excel, Power BI, Tableau, Qlik, etc.)
- Experience in developing and debugging in one or more of the languages Java, Python.
- Ability to work in cross functional teams.
- Apply Machine Learning techniques in production including, but not limited to, neuralnets, regression, decision trees, random forests, ensembles, SVM, Bayesian models, K-Means, etc.

Roles and Responsibilities:

Deploying ML models into production, and scaling them to serve millions of customers.

Technical solutioning skills with deep understanding of technical API integrations, AI / Data Science, BigData and public cloud architectures / deployments in a SaaS environment.

Strong stakeholder relationship management skills - able to influence and manage the expectations of senior executives.
Strong networking skills with the ability to build and maintain strong relationships with both business, operations and technology teams internally and externally.

Provide software design and programming support to projects.

Qualifications & Experience:

Engineering and post graduate candidates, preferably in Computer Science, from premier institutions with proven work experience as a Machine Learning Architect (Deployments) or a similar role for 5-7 years.

Skills:

- 5+ years deploying Machine Learning pipelines in large enterprise production systems.

Roles and Responsibilities:

Deploying ML models into production, and scaling them to serve millions of customers.

Technical solutioning skills with deep understanding of technical API integrations, AI / Data Science, BigData and public cloud architectures / deployments in a SaaS environment.

Provide software design and programming support to projects.

Qualifications & Experience:

Assistant Manager - Analytics - Product Team

at LatentView Analytics

3 recruiters

Posted by Kannikanti madhuri

Chennai

5 - 8 yrs

₹5L - ₹8L / yr

Data Science

Analytics

Data Analytics

Data modeling

Data mining

+7 more

Job Overview :We are looking for an experienced Data Science professional to join our Product team and lead the data analytics team and manage the processes and people responsible for accurate data collection, processing, modelling and analysis. The ideal candidate has a knack for seeing solutions in sprawling data sets and the business mindset to convert insights into strategic opportunities for our clients. The incumbent will work closely with leaders across product, sales, and marketing to support and implement high-quality, data-driven decisions. They will ensure data accuracy and consistent reporting by designing and creating optimal processes and procedures for analytics employees to follow. They will use advanced data modelling, predictive modelling, natural language processing and analytical techniques to interpret key findings.Responsibilities for Analytics Manager :- Build, develop and maintain data models, reporting systems, data automation systems, dashboards and performance metrics support that support key business decisions.- Design and build technical processes to address business issues.- Manage and optimize processes for data intake, validation, mining and engineering as well as modelling, visualization and communication deliverables.- Examine, interpret and report results to stakeholders in leadership, technology, sales, marketing and product teams.- Develop and implement quality controls and standards to ensure quality standards- Anticipate future demands of initiatives related to people, technology, budget and business within your department and design/implement solutions to meet these needs.- Communicate results and business impacts of insight initiatives to stakeholders within and outside of the company.- Lead cross-functional projects using advanced data modelling and analysis techniques to discover insights that will guide strategic decisions and uncover optimization opportunities.Qualifications for Analytics Manager :- Working knowledge of data mining principles: predictive analytics, mapping, collecting data from multiple cloud-based data sources- Strong SQL skills, ability to perform effective querying- Understanding of and experience using analytical concepts and statistical techniques: hypothesis development, designing tests/experiments, analysing data, drawing conclusions, and developing actionable recommendations for business units.- Experience and knowledge of statistical modelling techniques: GLM multiple regression, logistic regression, log-linear regression, variable selection, etc.- Experience working with and creating databases and dashboards using all relevant data to inform decisions.- Strong problem solving, quantitative and analytical abilities.- Strong ability to plan and manage numerous processes, people and projects simultaneously.- Excellent communication, collaboration and delegation skills.- We- re looking for someone with at least 5 years of experience in a position monitoring, managing and drawing insights from data, and someone with at least 3 years of experience leading a team. The right candidate will also be proficient and experienced with the following tools/programs :- Strong programming skills with querying languages: R, Python etc.- Experience with big data tools like Hadoop- Experience with data visualization tools: Tableau, d3.js, etc.- Experience with Excel, Word, and PowerPoint.

Big Data Developer

at Intelliswift Software

12 recruiters

Posted by Pratish Mishra

Chennai

4 - 8 yrs

₹8L - ₹17L / yr

Big Data

Spark

Scala

SQL

Greetings from Intelliswift! Intelliswift Software Inc. is a premier software solutions and Services Company headquartered in the Silicon Valley, with offices across the United States, India, and Singapore. The company has a proven track record of delivering results through its global delivery centers and flexible engagement models for over 450 brands ranging from Fortune 100 to growing companies. Intelliswift provides a variety of services including Enterprise Applications, Mobility, Big Data / BI, Staffing Services, and Cloud Solutions. Growing at an outstanding rate, it has been recognized as the second largest private IT Company in the East Bay. Domains: IT, Retail, Pharma, Healthcare, BFSI, and Internet & E-commerce website https://www.intelliswift.com/ Experience: 4-8 Years Job Location: Chennai Job Description: Skills: Spark, Scala, Big data, Hive · Strong Working experience in Spark, Scala, big data, h base and hive. · Should have good working experience in SQL and Spark SQL. · Good to have knowledge or experience in Teradata. · Familiar with General engineering Git, jenkins, sbt, maven.

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort