Job Description
We are looking for a data scientist that will help us to discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products. Your primary focus will be in applying data mining techniques, doing statistical analysis, and building high quality prediction systems integrated with our products.
Responsibilities
- Selecting features, building and optimizing classifiers using machine learning techniques
- Data mining using state-of-the-art methods
- Extending company’s data with third party sources of information when needed
- Enhancing data collection procedures to include information that is relevant for building analytic systems
- Processing, cleansing, and verifying the integrity of data used for analysis
- Doing ad-hoc analysis and presenting results in a clear manner
- Creating automated anomaly detection systems and constant tracking of its performance
Skills and Qualifications
- Excellent understanding of machine learning techniques and algorithms, such as Linear regression, SVM, Decision Forests, LSTM, CNN etc.
- Experience with Deep Learning preferred.
- Experience with common data science toolkits, such as R, NumPy, MatLab, etc. Excellence in at least one of these is highly desirable
- Great communication skills
- Proficiency in using query languages such as SQL, Hive, Pig
- Good applied statistics skills, such as statistical testing, regression, etc.
- Good scripting and programming skills
- Data-oriented personality
About One Labs
Building products in Artificial Intelligence and E-Commerce
Similar jobs
About Clutterbot
Many families feel overwhelmed trying to tidy up clutter while balancing work and other responsibilities. We’re developing a safe household robot that drives around the house, picks up toys and clothing off the floor, and organizes them into containers.
You’ll be joining a small close-knit team who’s excited about building a new kind of home robot. We're opening a new 4,500 sqft R&D space in HSR Layout, Bengaluru.
About the Role
We are looking for a senior machine learning or computer vision engineer to lead our efforts in the research and development of segmentation & detection deep learning models. The ideal candidate will be experienced in computer vision, applied machine learning, model hyperparameter tuning, model evaluation, and model pruning & quantization.
You’ll be part of our machine learning team and collaborate closely with others as we build solutions for autonomous robot behavior.
Skills & Qualifications
- Bachelor's degree in computer science, software engineering, mechatronics, or a related field
- 3+ years of relevant career experience either in industry or in an academic setting
- Experience training and deploying ML computer vision models such as semantic segmentation, object detection, and image classification
- Experience in ML model hyperparameter tuning and model evaluation.
- Experience in ML model optimization like pruning, quantization, etc.
- Strong C++ and Python programming skills
- Strong mathematics and data science skills, understanding of statistics, neural networks, and backpropagation
- Ability to stay abreast of current and emerging technologies
- Capable of solving complex problems with little supervision in a timely manner
- Knowledge of various dataset formats like COCO, VOC, CITYSCAPE, KITTY, etc
- Knowledge Data transformers for various deep learning frameworks specifically for PyTorch, Tensorflow, and Keras
- Understanding of architecture and implementation of the latest research works in computer vision eg: Object Detection (Yolov5 architecture), segmentations architectures, Depth estimation, Hydranets, etc
- Someone who has already worked on image datasets at a huge scale is more desirable
- Knowledge of Bigdata pipelines like pyspark or hadoop will be plus
- The ideal candidate will be life long learner and has the tenacity to understand new research and implement the same.
Benefits:
- Competitive compensation
- Health insurance
- Team Outing
- Company-sponsored devices.
- Flexible work culture
- As we expand, many, many, more to come!!
- 4+ years of experience Solid understanding of Python, Java and general software development skills (source code management, debugging, testing, deployment etc.).
- Experience in working with Solr and ElasticSearch Experience with NLP technologies & the handling of unstructured text Detailed understanding of text pre-processing and normalisation techniques such as tokenisation, lemmatisation, stemming, POS tagging etc.
- Prior experience in implementation of traditional ML solutions - classification, regression or clustering problem Expertise in text-analytics - Sentiment Analysis, Entity Extraction, Language modelling - and associated sequence learning models ( RNN, LSTM, GRU).
- Comfortable working with deep-learning libraries (eg. PyTorch)
- Candidate can even be a fresher with 1 or 2 years of experience IIIT, IIIT, Bits Pilani, top 5 local colleges are preferred colleges and universities.
- A Masters candidate in machine learning.
- Can source candidates from Mu Sigma and Manthan.
Bigdata JD :
Data Engineer – SQL, RDBMS, pySpark/Scala, Python, Hive, Hadoop, Unix
Data engineering services required:
- Builddataproducts and processes alongside the core engineering and technology team
- Collaborate with seniordatascientists to curate, wrangle, and prepare data for use in their advanced analytical models
- Integratedatafrom a variety of sources, assuring that they adhere to data quality and accessibility standards
- Modify and improvedataengineering processes to handle ever larger, more complex, and more types of data sources and pipelines
- Use Hadoop architecture and HDFS commands to design and optimizedataqueries at scale
- Evaluate and experiment with noveldataengineering tools and advises information technology leads and partners about new capabilities to determine optimal solutions for particular technical problems or designated use cases
Big data engineering skills:
- Demonstrated ability to perform the engineering necessary to acquire, ingest, cleanse, integrate, and structure massive volumes ofdatafrom multiple sources and systems into enterprise analytics platforms
- Proven ability to design and optimize queries to build scalable, modular, efficientdatapipelines
- Ability to work across structured, semi-structured, and unstructureddata, extracting information and identifying linkages across disparatedata sets
- Proven experience delivering production-readydataengineering solutions, including requirements definition, architecture selection, prototype development, debugging, unit-testing, deployment, support, and maintenance
- Ability to operate with a variety ofdataengineering tools and technologies; vendor agnostic candidates preferred
Domain and industry knowledge:
- Strong collaboration and communication skills to work within and across technology teams and business units
- Demonstrates the curiosity, interpersonal abilities, and organizational skills necessary to serve as a consulting partner, includes the ability to uncover, understand, and assess the needs of various business stakeholders
- Experience with problem discovery, solution design, and insight delivery that involves frequent interaction, education, engagement, and evangelism with senior executives
- Ideal candidate will have extensive experience with the creation and delivery of advanced analytics solutions for healthcare payers or insurance companies, including anomaly detection, provider optimization, studies of sources of fraud, waste, and abuse, and analysis of clinical and economic outcomes of treatment and wellness programs involving medical or pharmacy claimsdata, electronic medical recorddata, or other health data
- Experience with healthcare providers, pharma, or life sciences is a plus
What are we looking for:
- Strong experience in MySQL and writing advanced queries
- Strong experience in Bash and Python
- Familiarity with ElasticSearch, Redis, Java, NodeJS, ClickHouse, S3
- Exposure to cloud services such as AWS, Azure, or GCP
- 2+ years of experience in the production support
- Strong experience in log management and performance monitoring like ELK, Prometheus + Grafana, logging services on various cloud platforms
- Strong understanding of Linux OSes like Ubuntu, CentOS / Redhat Linux
- Interest in learning new languages / framework as needed
- Good written and oral communications skills
- A growth mindset and passionate about building things from the ground up, and most importantly, you should be fun to work with
As a product solutions engineer, you will:
- Analyze recorded runtime issues, diagnose and do occasional code fixes of low to medium complexity
- Work with developers to find and correct more complex issues
- Address urgent issues quickly, work within and measure against customer SLAs
- Using shell and python scripts, and use scripting to actively automate manual / repetitive activities
- Build anomaly detectors wherever applicable
- Pass articulated feedback from customers to the development and product team
- Maintain ongoing record of the operation of problem analysis and resolution in a on call monitoring system
- Offer technical support needed in development
Data Scientist
applied research.
● Understand, apply and extend state-of-the-art NLP research to better serve our customers.
● Work closely with engineering, product, and customers to scientifically frame the business problems and come up with the underlying AI models.
● Design, implement, test, deploy, and maintain innovative data and machine learning solutions to accelerate our business.
● Think creatively to identify new opportunities and contribute to high-quality publications or patents.
Desired Qualifications and Experience
● At Least 1 year of professional experience.
● Bachelors in Computer Science or related fields from the top colleges.
● Extensive knowledge and practical experience in one or more of the following areas: machine learning, deep learning, NLP, recommendation systems, information retrieval.
● Experience applying ML to solve complex business problems from scratch.
● Experience with Python and a deep learning framework like Pytorch/Tensorflow.
● Awareness of the state of the art research in the NLP community.
● Excellent verbal and written communication and presentation skills.
- Measure the sales effectiveness efforts using data science/app/digital nudges.
- Should be able to work on the clickstream data
- Should be well versed and willing to work hands-on various Machine Learning techniques
Skills
- Ability to lead a team of 5-6 members.
- Ability to work with large data sets and present conclusions to key stakeholders.
- Develop a clear understanding of the client’s business issue to inform the best approach to the problem.
- Root-cause analysis
- Define data requirements for creating a model and understand the business problem
- Clean, aggregate, analyze, interpret data and carry out quality analysis of it
- Set up data for predictive/prescriptive analysis
- Development of AI/ML models or statistical/econometric models.
- Working along with the team members
- Looking for insight and creating a presentation to demonstrate these insights
- Supporting development and maintenance of proprietary marketing techniques and other knowledge development projects.
------------------------
Solve problems in speech and NLP domain using advanced Deep learning and Machine Learning techniques. Few examples of the problems are -
* Limited resource Speaker Diarization on mono-channel recordings in noisy environment.
* Speech Enhancement to improve accuracy of downstream speech analytics tasks.
* Automated Speech Recognition for accent heavy audio with a noisy background.
* Speech analytic tasks, which include: emotions, empathy, keyword extraction.
* Text analytic tasks, which include: topic modeling, entity and intent extraction, opinion mining, text classification, and sentiment detection on multilingual data.
A typical day at work
-----------------------------
You will work closely with the product team to own a business problem. You will then model the business problem into a Machine Learning problem. Next you will do literature review to identify approaches to solve the problem. Test these approaches, identify the best approach, add your own insights to improve the performance and ship that to production!
What should you know?
---------------------------------
* Solid understanding of Classical Machine Learning and Deep Learning concepts and algorithms.
* Experience with literature review either in academia or industry.
* Proficiency in at least one programming language such as Python, C, C++, Java, etc.
* Proficiency in Machine Learning tools such as TensorFlow, Keras, Caffe, Torch/PyTorch or Theano.
* Advanced degree in Computer Science, Electrical Engineering, Machine Learning, Mathematics, Statistics, Physics, or Computational Linguistics
Why DeepAffects?
--------------------------
* You’ll learn insanely fast here.
* Esops and competitive compensation.
* Opportunity and encouragement for publishing research at top conferences, paid trips to attend workshop and conferences where you have published.
* Independent work, flexible timings and sense of ownership of your work.
* Mentorship from distinguished researchers and professors.
Data Analyst
at Dotball Interactive Private Limited