Data Engineer - AWS

at A global business process management company

Agency job
icon
Gurugram, Pune, Mumbai, Bengaluru (Bangalore), Chennai, Nashik
icon
4 - 12 yrs
icon
₹12L - ₹15L / yr
icon
Full time
Skills
Data engineering
Data modeling
data pipeline
Data integration
Data Warehouse (DWH)
Data engineer
AWS RDS
Glue
AWS CloudFormation
Amazon Web Services (AWS)
DevOps
AWS Lambda
Python
Django
Data Pipeline
Step functions
RDS

 

 

Designation – Deputy Manager - TS


Job Description

  1. Total of  8/9 years of development experience Data Engineering . B1/BII role
  2. Minimum of 4/5 years in AWS Data Integrations and should be very good on Data modelling skills.
  3. Should be very proficient in end to end AWS Data solution design, that not only includes strong data ingestion, integrations (both Data @ rest and Data in Motion) skills but also complete DevOps knowledge.
  4. Should have experience in delivering at least 4 Data Warehouse or Data Lake Solutions on AWS.
  5. Should be very strong experience on Glue, Lambda, Data Pipeline, Step functions, RDS, CloudFormation etc.
  6. Strong Python skill .
  7. Should be an expert in Cloud design principles, Performance tuning and cost modelling. AWS certifications will have an added advantage
  8. Should be a team player with Excellent communication and should be able to manage his work independently with minimal or no supervision.
  9. Life Science & Healthcare domain background will be a plus

Qualifications

BE/Btect/ME/MTech

 

Read more
Why apply to jobs via Cutshort
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly. No 3rd party agencies here.
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
2101133
Matches delivered
3712187
Network size
15000
Companies hiring

Similar jobs

a reputed firm providing world-class consulting Company
Agency job
via Jobdost by Saida Jabbar
Ahmedabad, Hyderabad, Pune, Delhi
5 - 8 yrs
₹25L - ₹30L / yr
Snow flake schema
Amazon Web Services (AWS)
AWS Lambda
ETL
Informatica
+1 more

Data Engineer 

 

Mandatory Requirements 

  • Expertise in ETL , SNowFlake
  • Experience in AWS ETL using AWS Glue, AWS Lambda
  • Proficient in blob storage and data lake 
  • Understanding of file-based ingestion best practices. 

CORE RESPONSIBILITIES

  • Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies 
  • Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform 
  • Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations 
  • Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
  • Define process improvement opportunities to optimize data collection, insights and displays.
  • Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible 
  • Identify and interpret trends and patterns from complex data sets 
  • Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders. 
  • Key participant in regular Scrum ceremonies with the agile teams  
  • Proficient at developing queries, writing reports and presenting findings 
  • Mentor junior members and bring best industry practices 

 

QUALIFICATIONS

  • 5-7+ years’ experience as data engineer in consumer finance or manufacturing or Oil & Gas industry 
  • Strong background in math, statistics, computer science, data science or related discipline
  • Advanced knowledge one of language Python, R, C# 
  • Production experience with: HDFS, YARN, Hive, Spark, Kafka, Azure, Docker / Kubernetes, SQL Server, Synapse, Snowflake,AWS
  • Proficient with
    • Data mining/programming tools (e.g. SAS, SQL, R, Python)
    • Database technologies (e.g. MongoDB, PostgreSQL, Redshift, Snowflake. and Greenplum)
    • Data visualization (e.g. Tableau, PowerBI, QlikSense)
  • Comfortable learning about and deploying new technologies and tools. 
  • Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines. 
  • Good written and oral communication skills and ability to present results to non-technical audiences 
  • Knowledge of business intelligence and analytical tools, technologies and techniques.

 

Read more
DP
Posted by Shridhar Nayak
Bengaluru (Bangalore)
2 - 6 yrs
Best in industry
Machine Learning (ML)
Data Science
Natural Language Processing (NLP)
Computer Vision
Neural networks
+7 more
We are hiring at InViz AI ( https://www.inviz.ai/ )
InViz is Bangalore based agile/lean consulting and product company helping enterprises build their systems in various domains like Search, Machine-based intelligence, Cloud migrations, building distributed platforms for b2b and b2c needs. We use state-of-the-art technologies in Computer Vision, Natural Language Processing, Text Mining, ML techniques to extract information/concepts from data of different formats- text, images, videos and make them easily discoverable through simple human-friendly touchpoints. We build services and data platforms to solve complex problems with simplicity. 

 

Experience:  2 or above years of experience 

Responsibilities (but not limited to): 

  • Create data staging, transformation layers
  • Prepare model-ready-data 
  • Create consumption layer of data/models by exposing them as service 
  • Maintain/Monitor and ensure scalability 

Preferred Skills (but not limited to): 

  • Strong background in handling data, writing efficient SQL, python scripts, optimizing a query, loops, designing dataflow jobs, identifying the bottlenecks in a code and optimizing them, data structures, and design 
  • Strong background in deploying ML/Data as a service by writing APIs, monitoring, error handling, load balancing, access, and authentications 
  • Conversant with using API developments ( like GCP APIgee, FastAPI, Spring boot ), 
  • Have an understanding of Apache Airflow, Spark Streaming, SparkML 
  • Familiarity with development of javascript, jquery UI, UX design while keeping in mind the optimized load balancing and other front-end aspects
Read more
at TIGI HR Solution Pvt. Ltd.
1 video
31 recruiters
DP
Posted by Dhara Raval
Ahmedabad
5 - 9 yrs
₹12L - ₹15L / yr
Data Science
Keras
Python
Java
TensorFlow
+9 more

Position: Data Scientist

Experience: 5+ Years

 
Required Skillset:
 
• 5+ Years of hands-on development experience with AI/ML technologies
• Programming experience in Python, R, or Java
• Extensive data modeling and data architecture skills
• Experience working with modern frameworks like Keras, Tensorflow, PyTorch, and MXNet
• Experience with Container Services & Registries to Serve ML Models in the Cloud
• Experience with the Amazon Web Services platform and associated machine learning
services (Polly, Transcribe, Lex, Recognition, Comprehend, Translate, etc.)
• Experience with open-source application development stacks
• Use Terraform, Ansible, Jenkins, AWS-CDK (or similar) to Setup Automated CI/CD Pipelines
 
Desired Skill Set:
 
• Advanced math skills (linear algebra, Bayesian statistics, group theory)
• Theoretical understanding of model architectures of various object classification, object
detection models, recommender systems, NLP, Text, and voice processing models, 3D models
• Knowledge of Hadoop or other distributed computing systems
• Understanding of performance and accuracy metrics for different classes of neural
networks. Familiarity with industry-standard models and datasets and neural network tuning is a
plus.
Read more
at Kaplan
6 recruiters
DP
Posted by Akshata Ranka
Bengaluru (Bangalore)
7 - 10 yrs
₹15L - ₹20L / yr
Statistical Analysis
Data mining
Data Visualization
Data Science
R Programming
+7 more

Senior Data Scientist-Job Description

The Senior Data Scientist role is a creative problem solver who utilizes statistical/mathematical principles and modelling skills to uncover new insights that will significantly and meaningfully impact business decisions and actions.  She/he applies their data science expertise in identifying, defining, and executing state-of-art techniques for academic opportunities and business objectives in collaboration with other Analytics team members. The Senior Data Scientist will execute analyses & outputs spanning test design and measurement, predictive analytics, multivariate analysis, data/text mining, pattern recognition, artificial intelligence, and machine learning.

 

Key Responsibilities:

  • Perform the full range of data science activities including test design and measurement, predictive/advanced analytics, and data mining, and analytic dashboards.
  • Extract, manipulate, analyse & interpret data from various corporate data sources developing advanced analytic solutions, deriving key observations, findings, insights, and formulating actionable recommendations.
  • Generate clearly understood and intuitive data science / advanced analytics outputs.
  • Provide thought leadership and recommendations on business process improvement, analytic solutions to complex problems.
  • Participate in best practice sharing and communication platform for advancement of the data science discipline.
  • Coach and collaborate with other data scientists and data analysts.
  • Present impact, insights, outcomes & recommendations to key business partners and stakeholders.
  • Comply with established Service Level Agreements to ensure timely, high quality deliverables with value-add recommendations, clearly articulated key findings and observations.

Qualification:

  • Bachelor's Degree (B.A./B.S.) or Master’s Degree (M.A./M.S.) in Computer Science, Statistics, Mathematics, Machine Learning, Physics, or similar degree
  • 5+ years of experience in data science in a digitally advanced industry focusing on strategic initiatives, marketing and/or operations.
  • Advanced knowledge of best-in-class analytic software tools and languages: Python, SQL, R, SAS, Tableau, Excel, PowerPoint.
  • Expertise in statistical methods, statistical analysis, data visualization, and data mining techniques.
  • Experience in Test design, Design of Experiments, A/B Testing, Measurement Science Strong influencing skills to drive a robust testing agenda and data driven decision making for process improvements
  • Strong Critical thinking skills to track down complex data and engineering issues, evaluate different algorithmic approaches, and analyse data to solve problems.
  • Experience in partnering with IT, marketing operations & business operations to deploy predictive analytic solutions.
  • Ability to translate/communicate complex analytical/statistical/mathematical concepts with non-technical audience.
  • Strong written and verbal communications skills, as well as presentation skills.

 

Read more
at AdElement
2 recruiters
DP
Posted by Sachin Bhatevara
Pune
2 - 7 yrs
₹4L - ₹15L / yr
Python
MySQL
athena
Data Visualization
Data Analytics
AdElement is an online advertising startup based in Pune. We do AI driven ad personalization for video and display ads. Audiences are targeted algorithmically across biddable sources of ad inventory through real time bidding. We are looking to grow our teams to meet the rapidly expanding market opportunity.

Job Description

  • Use statistical methods to analyze data and generate useful business reports and insights
  • Analyze Publisher and Demand side data and provide actionable insights to improve monetisation to operations team and implement the strategies
  • Provide support for ad hoc data requests from the Operations teams and Management
  • Use 3rd party API's, web scraping, csv report processing to build dashboards in Google Data Studio
  • Provide support for Analytics Processes monitoring and troubleshooting
  • Support in creating reports, dashboards and models
  • Independently determine the appropriate approach for new assignments
Required Skills
  • Inquisitive and having great problem-solving skills
  • Ability to own projects and work independently once given a direction
  • Experience working directly with business users to build reports, dashboards, models and solving business questions with data
  • Tools Expertise - Relational Databases -SQL is a must along with Python
  • Familiarity with AWS Athena, Redshift a plus

Experience

  • 2-7 years

Education

  • UG - B.Tech/B.E.; PG - M.Tech/ MSc, Computer Science, Statistics, Maths, Data Science/ Data Analytics
Read more
Analytics Consulting Company | REMOTE
Agency job
via Unnati by Veena Salian
Remote, Bengaluru (Bangalore)
2 - 4 yrs
₹18L - ₹20L / yr
Data Science
R Programming
Python
MongoDB
SQL
+3 more
Do you want your software skills to contribute meaningfully into finding technology driven solutions for various businesses and alongside grow your career, then read on.
 
Our client provides data-based process optimization and analytics solutions to businesses worldwide. Their innovative algorithms and customized IT solutions cater to complex problems related to every field or industry, through tools that are non standard and are backed-up by extensive research. They serve startups as well as large, medium and small enterprises, a majority of their clients being industry leaders.
 
With registered offices in India, USA and UAE, their projects support various sectors and functions like logistics, IT, Retail, Ecommerce, Healthcare industry among others, across Asia, America and Europe. The founder holds a Master’s degree from IIT and a PhD in Operations Research from USA, with rich experience in Optimization and Analytics for various industries. His team of top scientists and pedagogy experts are focusing on innovative revenue generation ideas with minimum operational costs.
 
As a Data Scientist, you will apply expertise in machine-learning, data mining and statistical methods to design, prototype, and build the next-generation analytics engines and services.
 
What you will do:
  • Conducting advanced statistical analysis to provide actionable insights, identify trends, and measure performance
  • Performing data exploration, cleaning, preparation and feature engineering; in addition to executing tasks such as building a POC, validation/ AB testing
  • Collaborating with data engineers & architects to implement and deploy scalable solutions
  • Communicating results to diverse audiences with effective writing and visualizations
  • Identifying and executing on high impact projects, triage external requests, and ensure timely completion for the results to be useful
  • Providing thought leadership by researching best practices, conducting experiments, and collaborating with industry leaders

 

 

What you need to have:
  • 2-4 year experience in machine learning algorithms, predictive analytics, demand forecasting in real-world projects
  • Strong statistical background in descriptive and inferential statistics, regression, forecasting techniques.
  • Strong Programming background in Python (including packages like Tensorflow), R, D3.js , Tableau, Spark, SQL, MongoDB.
  • Preferred exposure to Optimization & Meta-heuristic algorithm and related applications
  • Background in a highly quantitative field like Data Science, Computer Science, Statistics, Applied Mathematics,Operations Research, Industrial Engineering, or similar fields.
  • Should have 2-4 years of experience in Data Science algorithm design and implementation, data analysis in different applied problems.
  • DS Mandatory skills : Python, R, SQL, Deep learning, predictive analysis, applied statistics
Read more
at RS Consultants
2 recruiters
DP
Posted by Rahul Inamdar
Pune
4 - 6 yrs
₹18L - ₹30L / yr
Python
Amazon Web Services (AWS)
Data Science
Machine Learning (ML)
Java
+3 more

Data Scientist - Product Development

Employment Type: Full Time, Permanent

Experience: 3-5 Years as a Full Time Data Scientist

Job Description:

We are looking for an exceptional Data Scientist who is passionate about data and motivated to build large scale machine learning solutions to shine our data products. This person will be contributing to the analytics of data for insight discovery and development of machine learning pipeline to support modeling of terabytes (TB) of daily data for various use cases.

 

Location: Pune (Currently remote up till pandemic, later you need to relocate)

About the Organization: A funded product development company, headquarter in Singapore and offices in Australia, United States, Germany, United Kingdom and India. You will gain work experience in a global environment. Qualifications:

 

Candidate Profile:

  • 3+ years relevant working experience
  • Master / Bachelor’s in computer science or engineering
  • Working knowledge of Python, Spark / Pyspark, SQL
  • Experience working with large-scale data
  • Experience in data manipulation, analytics, visualization, model building, model deployment
  • Proficiency of various ML algorithms for supervised and unsupervised learning
  • Experience working in Agile/Lean model
  • Exposure to building large-scale ML models using one or more of modern tools and libraries such as AWS Sagemaker, Spark ML-Lib, Tensorflow, PyTorch, Keras, GCP ML Stack
  • Exposure to MLOps tools such as MLflow, Airflow
  • Exposure to modern Big Data tech such as Cassandra/Scylla, Snowflake, Kafka, Ceph, Hadoop
  • Exposure to IAAS platforms such as AWS, GCP, Azure
  • Experience with Java and Golang is a plus
  • Experience with BI toolkit such as Superset, Tableau, Quicksight, etc is a plus

 

****** Looking for someone who can join immediately / within a month and carries experience with product development companies and dealt with streaming data. Experience working in a product development team is desirable. AWS experience is a must. Strong experience in Python and its related library is required.

Read more
Intergral Add Science
Agency job
via VIPSA TALENT SOLUTIONS by Prashma S R
Pune
5 - 8 yrs
₹9L - ₹25L / yr
Java
Hadoop
Apache Spark
Scala
Python
+3 more
  • 6+ years of recent hands-on Java development
  • Developing data pipelines in AWS or Google Cloud
  • Java, Python, JavaScript programming languages
  • Great understanding of designing for performance, scalability, and reliability of data intensive application
  • Hadoop MapReduce, Spark, Pig. Understanding of database fundamentals and advanced SQL knowledge.
  • In-depth understanding of object oriented programming concepts and design patterns
  • Ability to communicate clearly to technical and non-technical audiences, verbally and in writing
  • Understanding of full software development life cycle, agile development and continuous integration
  • Experience in Agile methodologies including Scrum and Kanban
Read more
at Simplilearn Solutions
1 video
36 recruiters
DP
Posted by Aniket Manhar Nanjee
Bengaluru (Bangalore)
2 - 5 yrs
₹6L - ₹10L / yr
Data Science
R Programming
Python
Scala
Tableau
+1 more
Simplilearn.com is the world’s largest professional certifications company and an Onalytica Top 20 influential brand. With a library of 400+ courses, we've helped 500,000+ professionals advance their careers, delivering $5 billion in pay raises. Simplilearn has over 6500 employees worldwide and our customers include Fortune 1000 companies, top universities, leading agencies and hundreds of thousands of working professionals. We are growing over 200% year on year and having fun doing it. Description We are looking for candidates with strong technical skills and proven track record in building predictive solutions for enterprises. This is a very challenging role and provides an opportunity to work on developing insights based Ed-Tech software products used by large set of customers across globe. It provides an exciting opportunity to work across various advanced analytics & data science problem statement using cutting-edge modern technologies collaborating with product, marketing & sales teams. Responsibilities • Work on enterprise level advanced reporting requirements & data analysis. • Solve various data science problems customer engagement, dynamic pricing, lead scoring, NPS improvement, optimization, chatbots etc. • Work on data engineering problems utilizing our tech stack - S3 Datalake, Spark, Redshift, Presto, Druid, Airflow etc. • Collect relevant data from source systems/Use crawling and parsing infrastructure to put together data sets. • Craft, conduct and analyse A/B experiments to evaluate machine learning models/algorithms. • Communicate findings and take algorithms/models to production with ownership. Desired Skills • BE/BTech/MSc/MS in Computer Science or related technical field. • 2-5 years of experience in advanced analytics discipline with solid data engineering & visualization skills. • Strong SQL skills and BI skills using Tableau & ability to perform various complex analytics in data. • Ability to propose hypothesis and design experiments in the context of specific problems using statistics & ML algorithms. • Good overlap with Modern Data processing framework such as AWS-lambda, Spark using Scala or Python. • Dedication and diligence in understanding the application domain, collecting/cleaning data and conducting various A/B experiments. • Bachelor Degree in Statistics or, prior experience with Ed-Tech is a plus
Read more
at Wise Source
1 recruiter
DP
Posted by Wise HR
Remote, Guindy
0 - 2 yrs
₹1L - ₹1.5L / yr
Artificial Intelligence (AI)
Machine Learning (ML)
Internship
Java
Python
Looking out for Internship Candidates . Designation:- Intern/ Trainee Technology : .NET/JAVA/ Python/ AI/ ML Duration : 2-3 Months Job Location :Online Internship Joining :Immediately Job Type :Internship Job Description - MCA/M.Tech/ B.Tech/ BE who need 2-6 months internship project to be done. - Should be available to join us immediately. - Should be flexible to work on any Skills/ Technologies. - Ready to work in long working hours. - Must possess excellent analytical and logical skills. - Internship experience is provided from experts - Internship Certificate will be provided at the end of training. - The requirement is strictly for internship and not a permanent job - Stipend will be provided only based on the performance.
Read more
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
Get to hear about interesting companies hiring right now
iconFollow Cutshort
Want to apply to this role at A global business process management company?
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Learn more
Get to hear about interesting companies hiring right now
iconFollow Cutshort