Similar jobs
Founded 2015  •  Product  •  500-1000 employees  •  Raised funding
Big Data
Datawarehousing
Scala
Machine Learning (ML)
Deep Learning
SQL
Data modeling
Hadoop
Spark
Apache Hive
PySpark
Python
Amazon Web Services (AWS)
Java
Cassandra
DevOps
HDFS
Chennai
2 - 5 yrs
₹6L - ₹25L / yr

We are looking for an outstanding Big Data Engineer with experience setting up and maintaining Data Warehouse and Data Lakes for an Organization. This role would closely collaborate with the Data Science team and assist the team build and deploy machine learning and deep learning models on big data analytics platforms.

Roles and Responsibilities:

  • Develop and maintain scalable data pipelines and build out new integrations and processes required for optimal extraction, transformation, and loading of data from a wide variety of data sources using 'Big Data' technologies.
  • Develop programs in Scala and Python as part of data cleaning and processing.
  • Assemble large, complex data sets that meet functional / non-functional business requirements and fostering data-driven decision making across the organization.  
  • Responsible to design and develop distributed, high volume, high velocity multi-threaded event processing systems.
  • Implement processes and systems to validate data, monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
  • Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Provide high operational excellence guaranteeing high availability and platform stability.
  • Closely collaborate with the Data Science team and assist the team build and deploy machine learning and deep learning models on big data analytics platforms.

Skills:

  • Experience with Big Data pipeline, Big Data analytics, Data warehousing.
  • Experience with SQL/No-SQL, schema design and dimensional data modeling.
  • Strong understanding of Hadoop Architecture, HDFS ecosystem and eexperience with Big Data technology stack such as HBase, Hadoop, Hive, MapReduce.
  • Experience in designing systems that process structured as well as unstructured data at large scale.
  • Experience in AWS/Spark/Java/Scala/Python development.
  • Should have Strong skills in PySpark (Python & SPARK). Ability to create, manage and manipulate Spark Dataframes. Expertise in Spark query tuning and performance optimization.
  • Experience in developing efficient software code/frameworks for multiple use cases leveraging Python and big data technologies.
  • Prior exposure to streaming data sources such as Kafka.
  • Should have knowledge on Shell Scripting and Python scripting.
  • High proficiency in database skills (e.g., Complex SQL), for data preparation, cleaning, and data wrangling/munging, with the ability to write advanced queries and create stored procedures.
  • Experience with NoSQL databases such as Cassandra / MongoDB.
  • Solid experience in all phases of Software Development Lifecycle - plan, design, develop, test, release, maintain and support, decommission.
  • Experience with DevOps tools (GitHub, Travis CI, and JIRA) and methodologies (Lean, Agile, Scrum, Test Driven Development).
  • Experience building and deploying applications on on-premise and cloud-based infrastructure.
  • Having a good understanding of machine learning landscape and concepts. 

 

Qualifications and Experience:

Engineering and post graduate candidates, preferably in Computer Science, from premier institutions with proven work experience as a Big Data Engineer or a similar role for 3-5 years.

Certifications:

Good to have at least one of the Certifications listed here:

    AZ 900 - Azure Fundamentals

    DP 200, DP 201, DP 203, AZ 204 - Data Engineering

    AZ 400 - Devops Certification

Read more
Job posted by
Vijay Hemnath
Apply for job
Founded 2018  •  Product  •  1000-5000 employees  •  Profitable
PowerBI
Python
SQL
Remote only
5 - 10 yrs
₹15L - ₹35L / yr
Thrasio is the consumer goods company reimagining omnichannel commerce and consumer products, and boasts an innovation engine that brings high-quality products to market across digital marketplaces, channels, and retailers globally.
 
With the experience of evaluating more than 6,000 Amazon companies, acquiring over 130 top-rated brands, and managing the scale of 22,000 products, Thrasio is the largest acquirer of Amazon FBA brands. Since our founding in 2018, the team has grown to more than 1,000 people globally--most of that growth has occurred during the COVID-19 pandemic. Hiring people who share a passion for their craft in the eCommerce space is the reason we’re projected to grow more than 10x in the next few years. This growth is supported by investors whose portfolios include Facebook, Google, Jet.com, StitchFix, and Lululemon. We do our best work when we’re surrounded by people who are insatiably curious, agile, and who thrive in collaborative, check-your-ego-at-the door working environments. Sound like you? We’d love to chat.
 
The Team:
Analytics ensures that Thrasio has the data and insights required to facilitate data-driven decisions that help to manage our existing brands, identify new opportunities and ways to foster continued growth required to reach our goal of becoming the leading consumer products company.
 
The Role:
We are looking for a self-driven, curious, and analytical individual to join our Data Analytics team as a Senior Data Analyst. You will work with cross-functional teams to help solve complex business problems by leveraging data, building scalable processes, and compelling visualization.

Key Responsibilities Include:

    • Partner with our business stakeholders, product and tech teams to define complex and ambiguous problems and solve them with data. 
    • Develop innovative analytical methodologies to answer strategic business questions across the organization that helps us become more data-driven.
    • Relentlessly challenge the status quo in our processes to drive efficiency, speed, quality, and cost savings.
    • Work closely with the business to develop project roadmaps, including key activities, stakeholder engagement, and milestones.
    • Contribute innovative and strategic ideas to help shape a brand new team, and continue to drive impact as we scale.

Minimum Qualifications:

    • 5+ years of work experience in SQL/Python and data visualization/BI Tools to communicate business insights using data.
    • Strong Data Modeling and Power BI experience.
    • Intellectually curious, high energy, and strong work ethic; passionate about working with, normalizing, and synthesizing large amounts of data into actionable insights.
    • Process-oriented with strong organizational and communication skills.
    • Ability to identify and succinctly summarize roadblocks and constraints, propose potential solutions, and drive towards resolution.

Nice to have, but not required:

    • Bachelor’s degree in a quantitative field (Mathematics, Economics, Statistics, Quantitative, Physics, Engineering, etc.).
    • Experience managing junior analysts, providing feedback and mentorship to help grow the team.
THRASIO IS PROUD TO BE AN EQUAL OPPORTUNITY EMPLOYER AND CONSIDERS ALL QUALIFIED APPLICANTS FOR EMPLOYMENT WITHOUT REGARD TO RACE, COLOR, RELIGION, SEX, GENDER, SEXUAL ORIENTATION, GENDER IDENTITY, ANCESTRY, AGE, OR NATIONAL ORIGIN. FURTHER, QUALIFIED APPLICANTS WILL NOT BE DISCRIMINATED AGAINST ON THE BASIS OF DISABILITY, PROTECTED CLASSES, OR PROTECTED VETERAN STATUS. THRASIO PARTICIPATES IN E-VERIFY.
Read more
Job posted by
Virender Singh
Apply for job
at 5 years old AI Startup
Data Science
Python
Natural Language Processing (NLP)
Deep Learning
Machine Learning (ML)
Pune
2 - 6 yrs
₹12L - ₹18L / yr
  •  3+ years of experience in Machine Learning
  • Bachelors/Masters in Computer Engineering/Science.
  • Bachelors/Masters in Engineering/Mathematics/Statistics with sound knowledge of programming and computer concepts.
  • 10 and 12th acedemics 70 % & above.

Skills :
 - Strong Python/ programming skills
 - Good conceptual understanding of Machine Learning/Deep Learning/Natural Language            Processing
 - Strong verbal and written communication skills.
 - Should be able to manage team, meet project deadlines and interface with clients.
 - Should be able to work across different domains and quickly ramp up the business                   processes & flows & translate business problems into the data solutions

Read more
Job posted by
Ramya D
Apply for job
at Global content marketplace
Agency job
via Qrata
Machine Learning (ML)
Natural Language Processing (NLP)
Python
Mumbai
4 - 8 yrs
₹20L - ₹30L / yr

We are building a global content marketplace that brings companies and content

creators together to scale up content creation processes across 50+ content verticals and 150+ industries. Over the past 2.5 years, we’ve worked with companies like India Today, Amazon India, Adobe, Swiggy, Dunzo, Businessworld, Paisabazaar, IndiGo Airlines, Apollo Hospitals, Infoedge, Times Group, Digit, BookMyShow, UpGrad, Yulu, YourStory, and 350+ other brands.
Our mission is to become the world’s largest content creation and distribution platform for all kinds of content creators and brands.

 

Our Team

 

We are a 25+ member company and is scaling up rapidly in both team size and our ambition.

If we were to define the kind of people and the culture we have, it would be -

a) Individuals with an Extreme Sense of Passion About Work

b) Individuals with Strong Customer and Creator Obsession

c) Individuals with Extraordinary Hustle, Perseverance & Ambition

We are on the lookout for individuals who are always open to going the extra mile and thrive in a fast-paced environment. We are strong believers in building a great, enduring

a company that can outlast its builders and create a massive impact on the lives of our

employees, creators, and customers alike.

 

Our Investors

 

We are fortunate to be backed by some of the industry’s most prolific angel investors - Kunal Bahl and Rohit Bansal (Snapdeal founders), YourStory Media. (Shradha Sharma); Dr. Saurabh Srivastava, Co-founder of IAN and NASSCOM; Slideshare co-founder Amit Ranjan; Indifi's Co-founder and CEO Alok Mittal; Sidharth Rao, Chairman of Dentsu Webchutney; Ritesh Malik, Co-founder and CEO of Innov8; Sanjay Tripathy, former CMO, HDFC Life, and CEO of Agilio Labs; Manan Maheshwari, Co-founder of WYSH; and Hemanshu Jain, Co-founder of Diabeto.
Backed by Lightspeed Venture Partners



Job Responsibilities:
● Design, develop, test, deploy, maintain and improve ML models
● Implement novel learning algorithms and recommendation engines
● Apply Data Science concepts to solve routine problems of target users
● Translates business analysis needs into well-defined machine learning problems, and
selecting appropriate models and algorithms
● Create an architecture, implement, maintain and monitor various data source pipelines
that can be used across various different types of data sources
● Monitor performance of the architecture and conduct optimization
● Produce clean, efficient code based on specifications
● Verify and deploy programs and systems
● Troubleshoot, debug and upgrade existing applications
● Guide junior engineers for productive contribution to the development
The ideal candidate must -

ML and NLP Engineer
● 4 or more years of experience in ML Engineering
● Proven experience in NLP
● Familiarity with language generative model - GPT3
● Ability to write robust code in Python
● Familiarity with ML frameworks and libraries
● Hands on experience with AWS services like Sagemaker and Personalize
● Exposure to state of the art techniques in ML and NLP
● Understanding of data structures, data modeling, and software architecture
● Outstanding analytical and problem-solving skills
● Team player, an ability to work cooperatively with the other engineers.
● Ability to make quick decisions in high-pressure environments with limited information.
Read more
Job posted by
Mrunal Kokate
Apply for job
Founded 2012  •  Products & Services  •  20-100 employees  •  Profitable
Amazon Web Services (AWS)
PySpark
Spark
SQL
Apache Spark
Java
Python
R Language
AWS Glue
Hyderabad
1 - 5 yrs
₹2L - ₹7L / yr

About Us: Helical IT, based out of Hyderabad, is a software company that specializes in Open Source Data Warehousing & Business Intelligence, servicing clients in various domains like Manufacturing, HR, Energy, Insurance, Social Media Analytics, E-commerce, Travel, etc.

 

Job Description:

  • Hands-on Experience with AWS and AWS Glue Mandatory
  • Demonstrated strength in data modeling, ETL development, and data warehousing
  • Hands-on Experience using big data technologies (Hadoop, Hive, Hbase, Spark, etc.) Apache Spark mandatory
  • Hands-on Experience using Spark, SQL
  • Hands-on Experience using programming language – Scala, python, R, or Java (any one)
  • Strong database knowledge
  • Proven success in communicating with users, other technical teams, and senior management to collect requirements, describe data modeling decisions and data engineering strategy
  • Agile development and understanding

 

Nice to Have:

  • Experience using business intelligence reporting tools
  • Experience on AWS Quicksight
  • Database understanding Postgres, SQL Server, Cassandra, S3, Hadoop
  • Performance tuning of spark jobs
  • Any BI tool knowledge like tableau, Jasper, Pentaho, helical insight

 

Skills and Qualification:

 

  • BE, B.Tech / MS Degree in Computer Science, Engineering or a related subject.
  • Having experience of 2+ years.
  • Ability to work independently.
  • Good written and oral communication skills
Read more
Job posted by
Monica Patidar
Apply for job
at leading pharmacy provider
Agency job
Data Science
R Programming
Python
Algorithms
Predictive modelling
Noida, NCR (Delhi | Gurgaon | Noida)
- yrs
₹18L - ₹24L / yr
Job Description:

• Help build a Data Science team which will be engaged in researching, designing,
implementing, and deploying full-stack scalable data analytics vision and machine learning
solutions to challenge various business issues.
• Modelling complex algorithms, discovering insights and identifying business
opportunities through the use of algorithmic, statistical, visualization, and mining techniques
• Translates business requirements into quick prototypes and enable the
development of big data capabilities driving business outcomes
• Responsible for data governance and defining data collection and collation
guidelines.
• Must be able to advice, guide and train other junior data engineers in their job.

Must Have:

• 4+ experience in a leadership role as a Data Scientist
• Preferably from retail, Manufacturing, Healthcare industry(not mandatory)
• Willing to work from scratch and build up a team of Data Scientists
• Open for taking up the challenges with end to end ownership
• Confident with excellent communication skills along with a good decision maker
Read more
Job posted by
Jyotsna Econolytics
Apply for job
Founded 2007  •  Products & Services  •  100-1000 employees  •  Profitable
Data Science
R Programming
Python
Deep Learning
Neural networks
OpenCV
Machine Learning (ML)
Image Processing
Remote, Bengaluru (Bangalore)
3 - 10 yrs
₹5L - ₹24L / yr
  • Adept at Machine learning techniques and algorithms.

Feature selection, dimensionality reduction, building and

  • optimizing classifiers using machine learning techniques
  • Data mining using state-of-the-art methods
  • Doing ad-hoc analysis and presenting results
  • Proficiency in using query languages such as N1QL, SQL

Experience with data visualization tools, such as D3.js, GGplot,

  • Plotly, PyPlot, etc.

Creating automated anomaly detection systems and constant tracking

  • of its performance
  • Strong in Python is a must.
  • Strong in Data Analysis and mining is a must
  • Deep Learning, Neural Network, CNN, Image Processing (Must)

Building analytic systems - data collection, cleansing and

  • integration

Experience with NoSQL databases, such as Couchbase, MongoDB,

Cassandra, HBase

Read more
Job posted by
Nikita Sadarangani
Apply for job
Founded 1969  •  Products & Services  •  100-1000 employees  •  Profitable
Data Science
Python
Spark
scala
Remote, Kochi (Cochin)
1 - 5 yrs
₹4L - ₹10L / yr
Job Description Summary
Skill sets in Job Profile
1)Machine learning development using Python or Scala Spark
2)Knowledge of multiple ML algorithms like Random forest, XG boost, RNN, CNN, Transform learning etc..
3)Aware of typical challenges in machine learning implementation and respective applications

Good to have
1)Stack development or DevOps team experience
2)Cloud service (AWS, Cloudera), SAAS, PAAS
3)Big data tools and framework
4)SQL experience

Read more
Job posted by
Sony Shetty
Apply for job
Founded 1969  •  Products & Services  •  100-1000 employees  •  Profitable
Data Warehouse (DWH)
Amazon Web Services (AWS)
SQL
MDM
Business Intelligence (BI)
Python
Pune
3 - 6 yrs
₹5L - ₹15L / yr
Consultants will have the opportunity to :
- Build a team with skills in ETL, reporting, MDM and ad-hoc analytics support
- Build technical solutions using latest open source and cloud based technologies
- Work closely with offshore senior consultant, onshore team and client's business and IT teams to gather project requirements
- Assist overall project execution from India - starting from project planning, team formation system design and development, testing, UAT and deployment
- Build demos and POCs in support of business development for new and existing clients
- Prepare project documents and PowerPoint presentations for client communication
- Conduct training sessions to train associates and help shape their growth
Read more
Job posted by
Nishigandha Wagh
Apply for job
Founded 2017  •  Products & Services  •  20-100 employees  •  Profitable
Data Science
R Programming
Python
TensorFlow
Hyderabad
- yrs
₹1L - ₹1L / yr
freshers of Bigdata, Data scientist, Computer vision of their skills
Read more
Job posted by
Jasmine Shaik
Apply for job
Did not find a job you were looking for?
Search
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.