We at CarDekho create technology products used daily by millions of people. We’re seeking an experienced Data scientist to deliver that insight to us on a daily basis. Our ideal team member will have the mathematical and statistical expertise - As you mine, interpret, and clean our data, we will rely on you to ask questions, connect the dots, and uncover opportunities that lie hidden within - all with the ultimate goal of realizing the data’s full potential.
Responsibilities:-
- 3+ years of industrial experience in predictive modeling and analysis, predictive software development.
- Experience in mentoring junior team members, and guiding them on machine learning and data modeling applications.
- Strong communication and data presentation skills. Experience implementing ML algorithms such as Logistic Regression, Naive Bayes, Bayesian Network, Decision Tree, Neural Network, SVM, Random Forest, convex optimization, transfer learning.
- Hands-on experience on a minimum of 2 projects that involves either ML or Deep Learning(Theano OR Keras OR Tensorflow).
- Excellent organizational and analytical skills. Exposure to REST concepts.
- Expert knowledge developing and debugging in Node.js or Python.
- Contribute to the production solutions development, testing, and deployment
Requirement:-
- BTech/MTech/Ph.D. in Computer Science or degree in statistics, applied mathematics, from Tier 1 Institutes
- 4+ years experience in data science Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
- Proficiency with data mining, mathematics, and statistical analysis Advanced pattern recognition and predictive modeling experience
About CarDekho
Similar jobs
- 3+ years experience in practical implementation and deployment of ML based systems preferred.
- BE/B Tech or M Tech (preferred) in CS/Engineering with strong mathematical/statistical background
- Strong mathematical and analytical skills, especially statistical and ML techniques, with familiarity with different supervised and unsupervised learning algorithms
- Implementation experiences and deep knowledge of Classification, Time Series Analysis, Pattern Recognition, Reinforcement Learning, Deep Learning, Dynamic Programming and Optimisation
- Experience in working on modeling graph structures related to spatiotemporal systems
- Programming skills in Python
- Experience in developing and deploying on cloud (AWS or Google or Azure)
- Good verbal and written communication skills
- Familiarity with well-known ML frameworks such as Pandas, Keras, TensorFlow
2+ years of Analytics with predominant experience in SQL, SAS, Statistics, R , Python, Visualization
Experienced in writing complex SQL select queries (window functions & CTE’s) with advanced SQL experience
Should be an individual contributor for initial few months based on project movement team will be aligned
Strong in querying logic and data interpretation
Solid communication and articulation skills
Able to handle stakeholders independently with less interventions of reporting manager
Develop strategies to solve problems in logical yet creative ways
Create custom reports and presentations accompanied by strong data visualization and storytelling
• Engage directly with the client to understand marketing objectives
• Develop custom performance reporting and analyses across multiple channels(Search, Display, Social,
Email etc)
• Architect solutions & provide recommendations that drive results on client campaigns & objectives
• Support A/B testing build work using customer experience optimization tools
• Collaborate closely with other teams to formulate industry-best-practice analytic solutions and
directly contribute to a variety of validation activities such as model design, execution and
assessment; data review; and campaign/marketing performance evaluation.
• Responsible for assisting in defining a comprehensive measurement framework, developing effective
reporting and dashboards with an objective to evaluate marketing performance and provide
recommendations for enhancement
• Manage the day to day core insights/trends of our digital marketing programs to help guide
optimization efforts
• Work with various internal and externalstakeholders to develop project plan, manage the day-to-day
tasks and meet project deadlines
Qualifications:
• 2 - 5 years of industry experience required
• Bachelors or Masters degree in Mathematics, Statistics, Economics, Finance, or Engineering required
• Strong experience with SQL and Python required
• Demonstrated proficiency in multiple digital marketing channels (Paid Search or SEM, Organic Search
or SEO, Paid Social, Earned Social, Display, Email) required
• Media Platforms and 3rd Party Tools experience (Google Adwords, DoubleClick, MediaMath, Bing Ads,
ExactTarget, Hitwise, BrightEdge, Facebook, etc.) useful
• Strong experience in Web Analytics tools (Adobe, Omniture, WebTrends, Google Analytics, AdWords,
AdCenter, DoubleClick, MediaMath, Exact Target, etc.) preferred
• Understanding of relational databases and familiarity with data processing
• Excellent written and oral presentation skills
• Strong problem solving and consulting skills
· Build data products and processes alongside the core engineering and technology team.
· Collaborate with senior data scientists to curate, wrangle, and prepare data for use in their advanced analytical models
· Integrate data from a variety of sources, assuring that they adhere to data quality and accessibility standards
· Modify and improve data engineering processes to handle ever larger, more complex, and more types of data sources and pipelines
· Use Hadoop architecture and HDFS commands to design and optimize data queries at scale
· Evaluate and experiment with novel data engineering tools and advises information technology leads and partners about new capabilities to determine optimal solutions for particular technical problems or designated use cases .
About Us:
We are a VC-funded startup solving one of the biggest transportation problems India faces. Most passengers in India travel long distance by IRCTC trains. At time of booking, approx 1 out of every 2 passengers end up with a Waitlisted or RAC ticket. This creates a lot of anxiety for passengers, as Railway only announces only 4 hour before departure if they have a confirmed seat. We solve this problem through our Waitlist & RAC Protection. Protection can be bought against each IRCTC ticket at time of booking. If train ticket is not confirmed, we fly the passenger to the destination. Our team consists of 3 Founders from IIT, IIM and ISB.
Functional Experience:
- Computer Science or IT Engineering background with solid understanding of basics of Data Structures and Algorithms
- 2+ years of data science experience working with large datasets
- Expertise in Python packages like pandas, numPy, sklearn, matplotlib, seaborn, keras and tensorflow
- Expertise in Big Data technologies like Hadoop, Cassandra and PostgreSQL
- Expertise in Cloud computing on AWS with EC2, AutoML, Lambda and RDS
- Good knowledge of Machine Learning and Statistical time series analysis (optional)
- Unparalleled logical ability making you the go to guy for all things related to data
- You love coding like a hobby and are up for a challenge!
Cultural:
- Assume a strong sense of ownership of analytics : Design, develop & deploy
- Collaborate with senior management, operations & business team
- Ensure Quality & sustainability of the architecture
- Motivation to join an early stage startup should go beyond compensation
• Help build a Data Science team which will be engaged in researching, designing,
implementing, and deploying full-stack scalable data analytics vision and machine learning
solutions to challenge various business issues.
• Modelling complex algorithms, discovering insights and identifying business
opportunities through the use of algorithmic, statistical, visualization, and mining techniques
• Translates business requirements into quick prototypes and enable the
development of big data capabilities driving business outcomes
• Responsible for data governance and defining data collection and collation
guidelines.
• Must be able to advice, guide and train other junior data engineers in their job.
Must Have:
• 4+ experience in a leadership role as a Data Scientist
• Preferably from retail, Manufacturing, Healthcare industry(not mandatory)
• Willing to work from scratch and build up a team of Data Scientists
• Open for taking up the challenges with end to end ownership
• Confident with excellent communication skills along with a good decision maker
Senior Computer Vision Developer
This position is not for freshers. We are looking for candidates with AI/ML/CV experience of at least 4 year in the industry.
● Working hand in hand with application developers and data scientists to help build softwares that scales in terms of performance and stability Skills ● 3+ years of experience managing large scale data infrastructure and building data pipelines/ data products. ● Proficient in - Any data engineering technologies and proficient in AWS data engineering technologies is plus. ● Language - python, scala or go ● Experience in working with real time streaming systems Experience in handling millions of events per day Experience in developing and deploying data models on Cloud ● Bachelors/Masters in Computer Science or equivalent experience Ability to learn and use skills in new technologies
along with metrics to track their progress
Managing available resources such as hardware, data, and personnel so that deadlines
are met
Analysing the ML algorithms that could be used to solve a given problem and ranking
them by their success probability
Exploring and visualizing data to gain an understanding of it, then identifying
differences in data distribution that could affect performance when deploying the model
in the real world
Verifying data quality, and/or ensuring it via data cleaning
Supervising the data acquisition process if more data is needed
Defining validation strategies
Defining the pre-processing or feature engineering to be done on a given dataset
Defining data augmentation pipelines
Training models and tuning their hyper parameters
Analysing the errors of the model and designing strategies to overcome them
Deploying models to production