EMC GreenPlum Jobs in Pune

11+ EMC GreenPlum Jobs in Pune | EMC GreenPlum Job openings in Pune

Apply to 11+ EMC GreenPlum Jobs in Pune on CutShort.io. Explore the latest EMC GreenPlum Job opportunities across top companies like Google, Amazon & Adobe.

Data Engineer

at consulting & implementation services in the area of Oil & Gas, Mining and Manufacturing Industry

Agency job

via Jobdost by Sathish Kumar

Ahmedabad, Hyderabad, Pune, Delhi

5 - 7 yrs

₹18L - ₹25L / yr

AWS Lambda

AWS Simple Notification Service (SNS)

AWS Simple Queuing Service (SQS)

Python

PySpark

+9 more

Data Engineer

Required skill set: AWS GLUE, AWS LAMBDA, AWS SNS/SQS, AWS ATHENA, SPARK, SNOWFLAKE, PYTHON

Mandatory Requirements 

Experience in AWS Glue
Experience in Apache Parquet 
Proficient in AWS S3 and data lake 
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS 
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies 
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform 
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations 
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible 
Identify and interpret trends and patterns from complex data sets 
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders. 
Key participant in regular Scrum ceremonies with the agile teams  
Proficient at developing queries, writing reports and presenting findings 
Mentor junior members and bring best industry practices

 QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales) 
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C# 
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake  
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools. 
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines. 
Good written and oral communication skills and ability to present results to non-technical audiences 
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus: 

AWS certification
Spark Streaming 
Kafka Streaming / Kafka Connect 
ELK Stack 
Cassandra / MongoDB 
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Data Engineer

Required skill set: AWS GLUE, AWS LAMBDA, AWS SNS/SQS, AWS ATHENA, SPARK, SNOWFLAKE, PYTHON

Mandatory Requirements 

Experience in AWS Glue
Experience in Apache Parquet 
Proficient in AWS S3 and data lake 
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS 
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies 
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform 
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations 
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible 
Identify and interpret trends and patterns from complex data sets 
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders. 
Key participant in regular Scrum ceremonies with the agile teams  
Proficient at developing queries, writing reports and presenting findings 
Mentor junior members and bring best industry practices

 QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales) 
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C# 
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake  
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools. 
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines. 
Good written and oral communication skills and ability to present results to non-technical audiences 
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus: 

AWS certification
Spark Streaming 
Kafka Streaming / Kafka Connect 
ELK Stack 
Cassandra / MongoDB 
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Kafka Developer

at iLink Systems

1 video

1 recruiter

Posted by Ganesh Sooriyamoorthu

Chennai, Pune, Noida, Bengaluru (Bangalore)

5 - 15 yrs

₹10L - ₹15L / yr

Apache Kafka

Big Data

Java

Spark

Hadoop

+1 more

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

KSQL
Data Engineering spectrum (Java/Spark)
Spark Scala / Kafka Streaming
Confluent Kafka components
Basic understanding of Hadoop

Big Data developer

at one of the world's leading multinational investment bank

Agency job

via HiyaMee by Lithin Raj

Pune

5 - 9 yrs

₹5L - ₹15L / yr

PySpark

Data engineering

Big Data

Hadoop

Spark

+2 more

This role is for a developer with strong core application or system programming skills in Scala, java and
good exposure to concepts and/or technology across the broader spectrum. Enterprise Risk Technology
covers a variety of existing systems and green-field projects.
A Full stack Hadoop development experience with Scala development
A Full stack Java development experience covering Core Java (including JDK 1.8) and good understanding
of design patterns.
Requirements:-
• Strong hands-on development in Java technologies.
• Strong hands-on development in Hadoop technologies like Spark, Scala and experience on Avro.
• Participation in product feature design and documentation
• Requirement break-up, ownership and implantation.
• Product BAU deliveries and Level 3 production defects fixes.
Qualifications & Experience
• Degree holder in numerate subject
• Hands on Experience on Hadoop, Spark, Scala, Impala, Avro and messaging like Kafka
• Experience across a core compiled language – Java
• Proficiency in Java related frameworks like Springs, Hibernate, JPA
• Hands on experience in JDK 1.8 and strong skillset covering Collections, Multithreading with

For internal use only
For internal use only
experience working on Distributed applications.
• Strong hands-on development track record with end-to-end development cycle involvement
• Good exposure to computational concepts
• Good communication and interpersonal skills
• Working knowledge of risk and derivatives pricing (optional)
• Proficiency in SQL (PL/SQL), data modelling.
• Understanding of Hadoop architecture and Scala program language is a good to have.

Data Engineer

at GradMener Technology Pvt. Ltd.

Posted by Soni Jagwani

Pune, Chennai

5 - 9 yrs

₹15L - ₹20L / yr

Scala

PySpark

Spark

SQL Azure

Hadoop

+4 more

5+ years of experience in a Data Engineering role on cloud environment

Must have good experience in Scala/PySpark (preferably on data-bricks environment)

Extensive experience with Transact-SQL.
Experience in Data-bricks/Spark.

Strong experience in Dataware house projects
Expertise in database development projects with ETL processes.
Manage and maintain data engineering pipelines

Develop batch processing, streaming and integration solutions
Experienced in building and operationalizing large-scale enterprise data solutions and applications

Using one or more of Azure data and analytics services in combination with custom solutions
Azure Data Lake, Azure SQL DW (Synapse), and SQL Database products or equivalent products from other cloud services providers

In-depth understanding of data management (e. g. permissions, security, and monitoring).
Cloud repositories for e.g. Azure GitHub, Git
Experience in an agile environment (Prefer Azure DevOps).

Good to have

Manage source data access security
Automate Azure Data Factory pipelines
Continuous Integration/Continuous deployment (CICD) pipelines, Source Repositories
Experience in implementing and maintaining CICD pipelines
Power BI understanding, Delta Lake house architecture
Knowledge of software development best practices.
Excellent analytical and organization skills.
Effective working in a team as well as working independently.
Strong written and verbal communication skills.
Expertise in database development projects and ETL processes.

5+ years of experience in a Data Engineering role on cloud environment

Must have good experience in Scala/PySpark (preferably on data-bricks environment)

Extensive experience with Transact-SQL.
Experience in Data-bricks/Spark.

Strong experience in Dataware house projects
Expertise in database development projects with ETL processes.
Manage and maintain data engineering pipelines

Develop batch processing, streaming and integration solutions
Experienced in building and operationalizing large-scale enterprise data solutions and applications

Using one or more of Azure data and analytics services in combination with custom solutions
Azure Data Lake, Azure SQL DW (Synapse), and SQL Database products or equivalent products from other cloud services providers

In-depth understanding of data management (e. g. permissions, security, and monitoring).
Cloud repositories for e.g. Azure GitHub, Git
Experience in an agile environment (Prefer Azure DevOps).

Good to have

Manage source data access security
Automate Azure Data Factory pipelines
Continuous Integration/Continuous deployment (CICD) pipelines, Source Repositories
Experience in implementing and maintaining CICD pipelines
Power BI understanding, Delta Lake house architecture
Knowledge of software development best practices.
Excellent analytical and organization skills.
Effective working in a team as well as working independently.
Strong written and verbal communication skills.
Expertise in database development projects and ETL processes.

Big Data Engineer

at Clairvoyant India Private Limited

5 recruiters

Posted by Taruna Roy

Remote, Pune

3 - 8 yrs

₹4L - ₹15L / yr

Big Data

Hadoop

Java

Spark

Hibernate (Java)

+5 more

ob Title/Designation:
Mid / Senior Big Data Engineer
Job Description:
Role: Big Data EngineerNumber of open positions: 5Location: PuneAt Clairvoyant, we're building a thriving big data practice to help enterprises enable and accelerate the adoption of Big data and cloud services. In the big data space, we lead and serve as innovators, troubleshooters, and enablers. Big data practice at Clairvoyant, focuses on solving our customer's business problems by delivering products designed with best in class engineering practices and a commitment to keep the total cost of ownership to a minimum.
Must Have:

4-10 years of experience in software development.
At least 2 years of relevant work experience on large scale Data applications.
Strong coding experience in Java is mandatory
Good aptitude, strong problem solving abilities, and analytical skills, ability to take ownership as appropriate
Should be able to do coding, debugging, performance tuning and deploying the apps to Prod.
Should have good working experience on
o Hadoop ecosystem (HDFS, Hive, Yarn, File formats like Avro/Parquet)
o Kafka
o J2EE Frameworks (Spring/Hibernate/REST)
o Spark Streaming or any other streaming technology.
Strong coding experience in Java is mandatory
Ability to work on the sprint stories to completion along with Unit test case coverage.
Experience working in Agile Methodology
Excellent communication and coordination skills
Knowledgeable (and preferred hands on) - UNIX environments, different continuous integration tools.
Must be able to integrate quickly into the team and work independently towards team goals

Role & Responsibilities:

Take the complete responsibility of the sprint stories' execution
Be accountable for the delivery of the tasks in the defined timelines with good quality.
Follow the processes for project execution and delivery.
Follow agile methodology
Work with the team lead closely and contribute to the smooth delivery of the project.
Understand/define the architecture and discuss the pros-cons of the same with the team
Involve in the brainstorming sessions and suggest improvements in the architecture/design.
Work with other team leads to get the architecture/design reviewed.
Work with the clients and counter-parts (in US) of the project.
Keep all the stakeholders updated about the project/task status/risks/issues if there are any.

Education: BE/B.Tech from reputed institute.
Experience: 4 to 9 years
Keywords: java, scala, spark, software development, hadoop, hive
Locations: Pune

4-10 years of experience in software development.
At least 2 years of relevant work experience on large scale Data applications.
Strong coding experience in Java is mandatory
Good aptitude, strong problem solving abilities, and analytical skills, ability to take ownership as appropriate
Should be able to do coding, debugging, performance tuning and deploying the apps to Prod.
Should have good working experience on
o Hadoop ecosystem (HDFS, Hive, Yarn, File formats like Avro/Parquet)
o Kafka
o J2EE Frameworks (Spring/Hibernate/REST)
o Spark Streaming or any other streaming technology.
Strong coding experience in Java is mandatory
Ability to work on the sprint stories to completion along with Unit test case coverage.
Experience working in Agile Methodology
Excellent communication and coordination skills
Knowledgeable (and preferred hands on) - UNIX environments, different continuous integration tools.
Must be able to integrate quickly into the team and work independently towards team goals

Role & Responsibilities:

Take the complete responsibility of the sprint stories' execution
Be accountable for the delivery of the tasks in the defined timelines with good quality.
Follow the processes for project execution and delivery.
Follow agile methodology
Work with the team lead closely and contribute to the smooth delivery of the project.
Understand/define the architecture and discuss the pros-cons of the same with the team
Involve in the brainstorming sessions and suggest improvements in the architecture/design.
Work with other team leads to get the architecture/design reviewed.
Work with the clients and counter-parts (in US) of the project.
Keep all the stakeholders updated about the project/task status/risks/issues if there are any.

Education: BE/B.Tech from reputed institute.
Experience: 4 to 9 years
Keywords: java, scala, spark, software development, hadoop, hive
Locations: Pune

Data Engineer

at EASEBUZZ

1 recruiter

Posted by Amala Baby

Pune

2 - 4 yrs

₹2L - ₹20L / yr

Spotfire

Qlikview

Tableau

PowerBI

Data Visualization

+12 more

Company Profile:

Easebuzz is a payment solutions (fintech organisation) company which enables online merchants to accept, process and disburse payments through developer friendly APIs. We are focusing on building plug n play products including the payment infrastructure to solve complete business problems. Definitely a wonderful place where all the actions related to payments, lending, subscription, eKYC is happening at the same time.

We have been consistently profitable and are constantly developing new innovative products, as a result, we are able to grow 4x over the past year alone. We are well capitalised and have recently closed a fundraise of $4M in March, 2021 from prominent VC firms and angel investors. The company is based out of Pune and has a total strength of 180 employees. Easebuzz’s corporate culture is tied into the vision of building a workplace which breeds open communication and minimal bureaucracy. An equal opportunity employer, we welcome and encourage diversity in the workplace. One thing you can be sure of is that you will be surrounded by colleagues who are committed to helping each other grow.

Easebuzz Pvt. Ltd. has its presence in Pune, Bangalore, Gurugram.

Salary: As per company standards.

Designation: Data Engineering

Location: Pune

Experience with ETL, Data Modeling, and Data Architecture

Design, build and operationalize large scale enterprise data solutions and applications using one or more of AWS data and analytics services in combination with 3rd parties
- Spark, EMR, DynamoDB, RedShift, Kinesis, Lambda, Glue.

Experience with AWS cloud data lake for development of real-time or near real-time use cases

Experience with messaging systems such as Kafka/Kinesis for real time data ingestion and processing

Build data pipeline frameworks to automate high-volume and real-time data delivery

Create prototypes and proof-of-concepts for iterative development.

Experience with NoSQL databases, such as DynamoDB, MongoDB etc

Create and maintain optimal data pipeline architecture,

Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies.

Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics.

Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.

Keep our data separated and secure across national boundaries through multiple data centers and AWS regions.

Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.

Evangelize a very high standard of quality, reliability and performance for data models and algorithms that can be streamlined into the engineering and sciences workflow

Build and enhance data pipeline architecture by designing and implementing data ingestion solutions.

Employment Type

Full-time

Company Profile:

Easebuzz Pvt. Ltd. has its presence in Pune, Bangalore, Gurugram.

Salary: As per company standards.

Designation: Data Engineering

Location: Pune

Employment Type

Full-time

Bigdata Lead Architecture

at DataMetica

1 video

7 recruiters

Posted by Nikita Aher

Pune, Hyderabad

7 - 12 yrs

₹12L - ₹33L / yr

Big Data

Hadoop

Spark

Apache Spark

Apache Hive

+3 more

Job description

Role : Lead Architecture (Spark, Scala, Big Data/Hadoop, Java)

Primary Location : India-Pune, Hyderabad

Experience : 7 - 12 Years

Management Level: 7

Joining Time: Immediate Joiners are preferred

Attend requirements gathering workshops, estimation discussions, design meetings and status review meetings
Experience of Solution Design and Solution Architecture for the data engineer model to build and implement Big Data Projects on-premises and on cloud.
Align architecture with business requirements and stabilizing the developed solution
Ability to build prototypes to demonstrate the technical feasibility of your vision
Professional experience facilitating and leading solution design, architecture and delivery planning activities for data intensive and high throughput platforms and applications
To be able to benchmark systems, analyses system bottlenecks and propose solutions to eliminate them
Able to help programmers and project managers in the design, planning and governance of implementing projects of any kind.
Develop, construct, test and maintain architectures and run Sprints for development and rollout of functionalities
Data Analysis, Code development experience, ideally in Big Data Spark, Hive, Hadoop, Java, Python, PySpark,
Execute projects of various types i.e. Design, development, Implementation and migration of functional analytics Models/Business logic across architecture approaches
Work closely with Business Analysts to understand the core business problems and deliver efficient IT solutions of the product
Deployment sophisticated analytics program of code using any of cloud application.

Perks and Benefits we Provide!

Working with Highly Technical and Passionate, mission-driven people
Subsidized Meals & Snacks
Flexible Schedule
Approachable leadership
Access to various learning tools and programs
Pet Friendly
Certification Reimbursement Policy
Check out more about us on our website below!

www.datametica.com

Job description

Role : Lead Architecture (Spark, Scala, Big Data/Hadoop, Java)

Primary Location : India-Pune, Hyderabad

Experience : 7 - 12 Years

Management Level: 7

Joining Time: Immediate Joiners are preferred

Attend requirements gathering workshops, estimation discussions, design meetings and status review meetings
Experience of Solution Design and Solution Architecture for the data engineer model to build and implement Big Data Projects on-premises and on cloud.
Align architecture with business requirements and stabilizing the developed solution
Ability to build prototypes to demonstrate the technical feasibility of your vision
Professional experience facilitating and leading solution design, architecture and delivery planning activities for data intensive and high throughput platforms and applications
To be able to benchmark systems, analyses system bottlenecks and propose solutions to eliminate them
Able to help programmers and project managers in the design, planning and governance of implementing projects of any kind.
Develop, construct, test and maintain architectures and run Sprints for development and rollout of functionalities
Data Analysis, Code development experience, ideally in Big Data Spark, Hive, Hadoop, Java, Python, PySpark,
Execute projects of various types i.e. Design, development, Implementation and migration of functional analytics Models/Business logic across architecture approaches
Work closely with Business Analysts to understand the core business problems and deliver efficient IT solutions of the product
Deployment sophisticated analytics program of code using any of cloud application.

Perks and Benefits we Provide!

Working with Highly Technical and Passionate, mission-driven people
Subsidized Meals & Snacks
Flexible Schedule
Approachable leadership
Access to various learning tools and programs
Pet Friendly
Certification Reimbursement Policy
Check out more about us on our website below!

www.datametica.com

Data Analytics Trainer

at Edubridge Learning

6 recruiters

Posted by Hemal Thakker

Mumbai, Pune, Hyderabad, Gurugram

2 - 6 yrs

₹4L - ₹7L / yr

Data Analytics

Python

R Programming

SAS

Machine Learning (ML)

+1 more

JOB DESCRIPTION

2 to 6 years of experience in imparting technical training/ mentoring
Must have very strong concepts of Data Analytics
Must have hands-on and training experience on Python, Advanced Python, R programming, SAS and machine learning
Must have good knowledge of SQL and Advanced SQL
Should have basic knowledge of Statistics
Should be good in Operating systems GNU/Linux, Network fundamentals,
Must have knowledge on MS office (Excel/ Word/ PowerPoint)
Self-Motivated and passionate about technology
Excellent analytical and logical skills and team player
Must have exceptional Communication Skills/ Presentation Skills
Good Aptitude skills is preferred
Exceptional communication skills

Responsibilities:

Ability to quickly learn any new technology and impart the same to other employees
Ability to resolve all technical queries of students
Conduct training sessions and drive the placement driven quality in the training
Must be able to work independently without the supervision of a senior person
Participate in reviews/ meetings

Qualification:

UG: Any Graduate in IT/Computer Science, B.Tech/B.E. – IT/ Computers
PG: MCA/MS/MSC – Computer Science
Any Graduate/ Post graduate, provided they are certified in similar courses

ABOUT EDUBRIDGE

EduBridge is an Equal Opportunity employer and we believe in building a meritorious culture where everyone is recognized for their skills and contribution.

Launched in 2009 EduBridge Learning is a workforce development and skilling organization with 50+ training academies in 18 States pan India. The organization has been providing skilled manpower to corporates for over 10 years and is a leader in its space. We have trained over a lakh semi urban & economically underprivileged youth on relevant life skills and industry-specific skills and provided placements in over 500 companies. Our latest product E-ON is committed to complementing our training delivery with an Online training platform, enabling the students to learn anywhere and anytime.

To know more about EduBridge please visit: http://www.edubridgeindia.com/">http://www.edubridgeindia.com/

You can also visit us on https://www.facebook.com/Edubridgelearning/">Facebook , https://www.linkedin.com/company/edubridgelearning/">LinkedIn for our latest initiatives and products

JOB DESCRIPTION

2 to 6 years of experience in imparting technical training/ mentoring
Must have very strong concepts of Data Analytics
Must have hands-on and training experience on Python, Advanced Python, R programming, SAS and machine learning
Must have good knowledge of SQL and Advanced SQL
Should have basic knowledge of Statistics
Should be good in Operating systems GNU/Linux, Network fundamentals,
Must have knowledge on MS office (Excel/ Word/ PowerPoint)
Self-Motivated and passionate about technology
Excellent analytical and logical skills and team player
Must have exceptional Communication Skills/ Presentation Skills
Good Aptitude skills is preferred
Exceptional communication skills

Responsibilities:

Ability to quickly learn any new technology and impart the same to other employees
Ability to resolve all technical queries of students
Conduct training sessions and drive the placement driven quality in the training
Must be able to work independently without the supervision of a senior person
Participate in reviews/ meetings

Qualification:

UG: Any Graduate in IT/Computer Science, B.Tech/B.E. – IT/ Computers
PG: MCA/MS/MSC – Computer Science
Any Graduate/ Post graduate, provided they are certified in similar courses

ABOUT EDUBRIDGE

EduBridge is an Equal Opportunity employer and we believe in building a meritorious culture where everyone is recognized for their skills and contribution.

To know more about EduBridge please visit: http://www.edubridgeindia.com/">http://www.edubridgeindia.com/

You can also visit us on https://www.facebook.com/Edubridgelearning/">Facebook , https://www.linkedin.com/company/edubridgelearning/">LinkedIn for our latest initiatives and products

Data Engineer For Python

at A2Tech Consultants

3 recruiters

Posted by Dhaval B

Pune

4 - 12 yrs

₹6L - ₹15L / yr

Data engineering

Data Engineer

ETL

Spark

Apache Kafka

+5 more

We are looking for a smart candidate with:

Strong Python Coding skills and OOP skills
Should have worked on Big Data product Architecture
Should have worked with any one of the SQL-based databases like MySQL, PostgreSQL and any one of
NoSQL-based databases such as Cassandra, Elasticsearch etc.
Hands on experience on frameworks like Spark RDD, DataFrame, Dataset
Experience on development of ETL for data product
Candidate should have working knowledge on performance optimization, optimal resource utilization, Parallelism and tuning of spark jobs
Working knowledge on file formats: CSV, JSON, XML, PARQUET, ORC, AVRO
Good to have working knowledge with any one of the Analytical Databases like Druid, MongoDB, Apache Hive etc.
Experience to handle real-time data feeds (good to have working knowledge on Apache Kafka or similar tool)

Key Skills:

Python and Scala (Optional), Spark / PySpark, Parallel programming

We are looking for a smart candidate with:

Strong Python Coding skills and OOP skills
Should have worked on Big Data product Architecture
Should have worked with any one of the SQL-based databases like MySQL, PostgreSQL and any one of
NoSQL-based databases such as Cassandra, Elasticsearch etc.
Hands on experience on frameworks like Spark RDD, DataFrame, Dataset
Experience on development of ETL for data product
Candidate should have working knowledge on performance optimization, optimal resource utilization, Parallelism and tuning of spark jobs
Working knowledge on file formats: CSV, JSON, XML, PARQUET, ORC, AVRO
Good to have working knowledge with any one of the Analytical Databases like Druid, MongoDB, Apache Hive etc.
Experience to handle real-time data feeds (good to have working knowledge on Apache Kafka or similar tool)

Key Skills:

Python and Scala (Optional), Spark / PySpark, Parallel programming

Machine Learning Engineers

at Ignite Solutions

6 recruiters

Posted by Juzar Malubhoy

Pune

3 - 7 yrs

₹7L - ₹15L / yr

Machine Learning (ML)

Python

Data Science

We are looking for a Machine Learning Engineer with 3+ years of experience with a background in Statistics and hands-on experience in the Python ecosystem, using sound Software Engineering practices. Skills & Knowledge: - Formal knowledge of fundamentals of probability & statistics along with the ability to apply basic statistical analysis methods like hypothesis testing, t-tests, ANOVA etc. - Hands-on knowledge of data formats, data extraction, loading, wrangling, transformation, pre-processing and analysis. - Thorough understanding of data-modeling and machine-learning concepts - Complete understanding and ability to apply, implement and adapt standard implementations of machine learning algorithms - Good understanding and ability to apply and adapt Neural Networks and Deep Learning, including common high-level Deep Learning architectures like CNNs and RNNs - Fundamentals of computer science & programming, especially Data structures (like multi-dimensional arrays, trees, and graphs) and Algorithms (like searching, sorting, and dynamic programming) - Fundamentals of software engineering and system design, such as requirements analysis, REST APIs, database queries, system and library calls, version control, etc. Languages and Libraries: - Hands-on experience with Python and Python Libraries for data analysis and machine learning, especially Scikit-learn, Tensorflow, Pandas, Numpy, Statsmodels, and Scipy. - Experience with R and its ecosystem is a plus - Knowledge of other open source machine learning and data modeling frameworks like Spark MLlib, H2O, etc. is a plus

Data Scientist

at Saama Technologies

6 recruiters

Posted by Sandeep Chaudhary

Pune

4 - 8 yrs

₹1L - ₹16L / yr

Data Science

Python

Machine Learning (ML)

Natural Language Processing (NLP)

Big Data

+2 more

Description Must have Direct Hands- on, 4 years of experience, building complex Data Science solutions Must have fundamental knowledge of Inferential Statistics Should have worked on Predictive Modelling, using Python / R Experience should include the following, File I/ O, Data Harmonization, Data Exploration Machine Learning Techniques (Supervised, Unsupervised) Multi- Dimensional Array Processing Deep Learning NLP, Image Processing Prior experience in Healthcare Domain, is a plus Experience using Big Data, is a plus Should have Excellent Analytical, Problem Solving ability. Should be able to grasp new concepts quickly Should be well familiar with Agile Project Management Methodology Should have excellent written and verbal communication skills Should be a team player with open mind

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort