Data Steward

at Infogain

icon
NCR (Delhi | Gurgaon | Noida), Bengaluru (Bangalore), Mumbai, Pune
icon
7 - 8 yrs
icon
₹15L - ₹16L / yr
icon
Full time
Skills
Data steward
MDM
Tamr
Reltio
Data engineering
Python
ETL
SQL
Windows Azure
sas
dm studio
profisee
  1. Data Steward :

Data Steward will collaborate and work closely within the group software engineering and business division. Data Steward has overall accountability for the group's / Divisions overall data and reporting posture by responsibly managing data assets, data lineage, and data access, supporting sound data analysis. This role requires focus on data strategy, execution, and support for projects, programs, application enhancements, and production data fixes. Makes well-thought-out decisions on complex or ambiguous data issues and establishes the data stewardship and information management strategy and direction for the group. Effectively communicates to individuals at various levels of the technical and business communities. This individual will become part of the corporate Data Quality and Data management/entity resolution team supporting various systems across the board.

 

Primary Responsibilities:

 

  • Responsible for data quality and data accuracy across all group/division delivery initiatives.
  • Responsible for data analysis, data profiling, data modeling, and data mapping capabilities.
  • Responsible for reviewing and governing data queries and DML.
  • Accountable for the assessment, delivery, quality, accuracy, and tracking of any production data fixes.
  • Accountable for the performance, quality, and alignment to requirements for all data query design and development.
  • Responsible for defining standards and best practices for data analysis, modeling, and queries.
  • Responsible for understanding end-to-end data flows and identifying data dependencies in support of delivery, release, and change management.
  • Responsible for the development and maintenance of an enterprise data dictionary that is aligned to data assets and the business glossary for the group responsible for the definition and maintenance of the group's data landscape including overlays with the technology landscape, end-to-end data flow/transformations, and data lineage.
  • Responsible for rationalizing the group's reporting posture through the definition and maintenance of a reporting strategy and roadmap.
  • Partners with the data governance team to ensure data solutions adhere to the organization’s data principles and guidelines.
  • Owns group's data assets including reports, data warehouse, etc.
  • Understand customer business use cases and be able to translate them to technical specifications and vision on how to implement a solution.
  • Accountable for defining the performance tuning needs for all group data assets and managing the implementation of those requirements within the context of group initiatives as well as steady-state production.
  • Partners with others in test data management and masking strategies and the creation of a reusable test data repository.
  • Responsible for solving data-related issues and communicating resolutions with other solution domains.
  • Actively and consistently support all efforts to simplify and enhance the Clinical Trial Predication use cases.
  • Apply knowledge in analytic and statistical algorithms to help customers explore methods to improve their business.
  • Contribute toward analytical research projects through all stages including concept formulation, determination of appropriate statistical methodology, data manipulation, research evaluation, and final research report.
  • Visualize and report data findings creatively in a variety of visual formats that appropriately provide insight to the stakeholders.
  • Achieve defined project goals within customer deadlines; proactively communicate status and escalate issues as needed.

 

Additional Responsibilities:

 

  • Strong understanding of the Software Development Life Cycle (SDLC) with Agile Methodologies
  • Knowledge and understanding of industry-standard/best practices requirements gathering methodologies.
  • Knowledge and understanding of Information Technology systems and software development.
  • Experience with data modeling and test data management tools.
  • Experience in the data integration project • Good problem solving & decision-making skills.
  • Good communication skills within the team, site, and with the customer

 

Knowledge, Skills and Abilities

 

  • Technical expertise in data architecture principles and design aspects of various DBMS and reporting concepts.
  • Solid understanding of key DBMS platforms like SQL Server, Azure SQL
  • Results-oriented, diligent, and works with a sense of urgency. Assertive, responsible for his/her own work (self-directed), have a strong affinity for defining work in deliverables, and be willing to commit to deadlines.
  • Experience in MDM tools like MS DQ, SAS DM Studio, Tamr, Profisee, Reltio etc.
  • Experience in Report and Dashboard development
  • Statistical and Machine Learning models
  • Python (sklearn, numpy, pandas, genism)
  • Nice to Have:
  • 1yr of ETL experience
  • Natural Language Processing
  • Neural networks and Deep learning
  • xperience in keras,tensorflow,spacy, nltk, LightGBM python library

 

Interaction :  Frequently interacts with subordinate supervisors.

Education : Bachelor’s degree, preferably in Computer Science, B.E or other quantitative field related to the area of assignment. Professional certification related to the area of assignment may be required

Experience :  7 years of Pharmaceutical /Biotech/life sciences experience, 5 years of Clinical Trials experience and knowledge, Excellent Documentation, Communication, and Presentation Skills including PowerPoint

 

About Infogain

Founded
Type
Size
employees
Stage
View full company details
Why apply to jobs via Cutshort
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly. No 3rd party agencies here.
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
2101133
Matches delivered
3712187
Network size
15000
Companies hiring

Similar jobs

Data Engineer

at Indium Software

Founded 1999  •  Services  •  100-1000 employees  •  Profitable
SQL
Python
Data Analytics
Data Visualization
PowerBI
Tableau
Qlikview
Spotfire
Scala
Spark
icon
Remote only
icon
1 - 7 yrs
icon
₹7L - ₹15L / yr

What we ask

  • 2+ years of Data Engineering Experience - Design, develop, deliver and maintain data infrastructures.
  • SQL Specialist – Strong knowledge and Seasoned experience with SQL Queries (strong in outer joins, aggregations, unions, window functions & CTE’s)
  • Languages: Python
  • Good communicator, shows initiative, works well with stakeholders.
  • Experience working closely with Data Analysts and provide the data they need and guide them on the issues.
  • Solid ETL experience and Hadoop/Hive/Pyspark/Presto/ SparkSQL  
  • Solid communication and articulation skills
  • Able to handle stakeholders independently with less interventions of reporting manager.
  • Develop strategies to solve problems in logical yet creative ways.
  • Create custom reports and presentations accompanied by strong data visualization and storytelling
Job posted by
Swaathipriya P

Analytics

at ProGrad

Founded 2018  •  Services  •  20-100 employees  •  Profitable
Python
Java
Tableau
SQL
PowerBI
icon
Chennai
icon
1 - 4 yrs
icon
₹3L - ₹8L / yr
Company Name: LatentView Analytics

Job Summary :


Independently handle the delivery of analytics assignments by mentoring a team of 3 - 10 people and delivering to exceed client expectations

Responsibilities :

- Co-ordinate with onsite company consultants to ensure high quality, on-time delivery

- Take responsibility for technical skill-building within the organization (training, process definition, research of new tools and techniques etc.)

- Take part in organizational development activities to take company to the next level

Qualification, Skills & Prior Work Experience :

- Great analytical skills, detail-oriented approach

- Sound knowledge in MS Office tools like Excel, Power Point and data visualization tools like Tableau, PowerBI or such tools

- Strong experience in SQL, Python, SAS, SPSS, Statistica, R, MATLAB or such tools would be preferable

- Ability to adapt and thrive in the fast-paced environment that young companies operate in

- Priority for people with analytics work experience

- Programming skills- Java/Python/SQL/OOPS based programming knowledge

Job Location : Chennai, Work from Home will be provided until COVID situation improves

Note :

- Minimum one year experience needed

- Only 2019, 2020 and 2020 passed outs applicable

- Only above 70% aggregate throughout studies is applicable

- POST GRADUATION is must
Job posted by
Heruba C

Data Scientist

at Impetus Technologies

Founded 2005  •  Products & Services  •  1000-5000 employees  •  Profitable
Data Science
Pricing Strategy
Python
Predictive analytics
Pricing models
Machine Learning (ML)
icon
Bengaluru (Bangalore)
icon
4 - 8 yrs
icon
₹20L - ₹35L / yr
Looking for Data Scientist with strong expertise in Classical Machine Learning algorithms and strong expertise in SQL and Python.
Experience in Pricing models will be definite plus
Job posted by
Gangadhar T.M
Data Analytics
SQL server
SQL
Data Analyst
icon
Bengaluru (Bangalore)
icon
3 - 8 yrs
icon
₹15L - ₹18L / yr

1. Ability to work independently and to set priorities while managing several projects simultaneously; strong attention to detail is essential.
2.Collaborates with Business Systems Analysts and/or directly with key business users to ensure business requirements and report specifications are documented accurately and completely.
3.Develop data field mapping documentation.
4. Document data sources and processing flow.
5. Ability to design, refine and enhance existing reports from source systems or data warehouse.
6.Ability to analyze and optimize data including data deduplication required for reports.
7. Analysis and rationalization of reports.
8. Support QA and UAT teams in defining test scenarios and clarifying requirements.
9. Effectively communicate results of the data analysis to internal and external customers to support decision making.
10.Follows established SDLC, change control, release management and incident management processes.
11.Perform source data analysis and assessment.
12. Perform data profiling to capture business and technical rules.
13. Track and help to remediate issues and defects due to data quality exceptions.


Job posted by
Harpreet kour

Machine Learning Engineer

at IDfy

Founded 2011  •  Products & Services  •  100-1000 employees  •  Raised funding
Machine Learning (ML)
Python
TensorFlow
PyTorch
Scikit-Learn
elixir
icon
Mumbai, Pune
icon
1 - 3 yrs
icon
₹6L - ₹14L / yr
About the team
● The machine learning team is a self-contained team of 9 people responsible for building models and services that support key workflows for IDfy.
● Our models are gating criteria for these workflows and as such are expected to perform accurately and quickly. We use a mix of conventional and hand-crafted deep learning models.
● The team comes from diverse backgrounds and experiences. We have ex-bankers, startup founders, IIT-ians, and more.
● We work directly with business and product teams to craft solutions for our customers. We know that we are, and function as a platform and not a services company.

● Be working on all aspects of a production machine learning system. You will be acquiring data, training and building models, deploying models, building API services for exposing these models, maintaining them in production, and more.
● Work on performance tuning of models
● From time to time work on support and debugging of these production systems
● Work on researching the latest technology in the areas of our interest and applying it to build newer products and enhancement of the existing platform.
● Building workflows for training and production systems
● Contribute to documentation

About you

● You are an early-career machine learning engineer (or data scientist). Our ideal candidate is
someone with 1-3 years of experience in data science.

Must Haves

● You have a good understanding of Python and Scikit-learn, Tensorflow, or Pytorch. Our systems are built with these tools/language and we expect a strong base in these.
● You are proficient at exploratory analysis and know which model to use in most scenarios
● You should have worked on framing and solving problems with the application of machine learning or deep learning models.
● You have some experience in building and delivering complete or part AI solutions
● You appreciate that the role of the Machine Learning engineer is not only modeling, but also building product solutions and you strive towards this.
● Enthusiasm and drive to learn and assimilate the state of art research. A lot of what we are building will require innovative approaches using newly researched models and applications.

Good to Have

● Knowledge of and experience in computer vision. While a large part of our work revolves around computer
vision, we believe this is something you can learn on the job.
● We build our own services, hence we would want you to have some knowledge of writing APIs.
● Our stack also includes languages like Ruby, Go, and Elixir. We would love it if you know any of these or take an interest in functional programming.
● Knowledge of and experience in ML Ops and tooling would be a welcome addition. We use Docker and Kubernetes for deploying our services.
Job posted by
Rati from

Data Scientist

at upGrad

Founded 2015  •  Product  •  100-500 employees  •  Raised funding
Data Science
R Programming
Python
SQL
Natural Language Processing (NLP)
Machine Learning (ML)
Tableau
icon
Bengaluru (Bangalore), Mumbai
icon
4 - 6 yrs
icon
₹10L - ₹21L / yr

About Us

upGrad is an online education platform building the careers of tomorrow by offering the most industry-relevant programs in an immersive learning experience. Our mission is to create a new digital-first learning experience to deliver tangible career impact to individuals at scale. upGrad currently offers programs in Data Science, Machine Learning, Product Management, Digital Marketing, and Entrepreneurship, etc. upGrad is looking for people passionate about management and education to help design learning programs for working professionals to stay sharp and stay relevant and help build the careers of tomorrow.

  • upGrad was awarded the Best Tech for Education by IAMAI for 2018-19

  • upGrad was also ranked as one of the LinkedIn Top Startups 2018: The 25 most sought-

    after startups in India

  • upGrad was earlier selected as one of the top ten most innovative companies in India

    by FastCompany.

  • We were also covered by the Financial Times along with other disruptors in Ed-Tech

  • upGrad is the official education partner for Government of India - Startup India

    program

  • Our program with IIIT B has been ranked #1 program in the country in the domain of Artificial Intelligence and Machine Learning

     

    Role Summary

    Are you excited by the challenge and the opportunity of applying data-science and data- analytics techniques to the fast developing education technology domain? Do you look forward to, the sense of ownership and achievement that comes with innovating and creating data products from scratch and pushing it live into Production systems? Do you want to work with a team of highly motivated members who are on a mission to empower individuals through education?
    If this is you, come join us and become a part of the upGrad technology team. At upGrad the technology team enables all the facets of the business - whether it’s bringing efficiency to ourmarketing and sales initiatives, to enhancing our student learning experience, to empowering our content, delivery and student success teams, to aiding our student’s for their desired careeroutcomes. We play the part of bringing together data & tech to solve these business problems and opportunities at hand.
    We are looking for an highly skilled, experienced and passionate data-scientist who can come on-board and help create the next generation of data-powered education tech product. The ideal candidate would be someone who has worked in a Data Science role before wherein he/she is comfortable working with unknowns, evaluating the data and the feasibility of applying scientific techniques to business problems and products, and have a track record of developing and deploying data-science models into live applications. Someone with a strong math, stats, data-science background, comfortable handling data (structured+unstructured) as well as strong engineering know-how to implement/support such data products in Production environment.
    Ours is a highly iterative and fast-paced environment, hence being flexible, communicating well and attention-to-detail are very important too. The ideal candidate should be passionate about the customer impact and comfortable working with multiple stakeholders across the company.


    Roles & Responsibilities

      • 3+ years of experience in analytics, data science, machine learning or comparable role
      • Bachelor's degree in Computer Science, Data Science/Data Analytics, Math/Statistics or related discipline 
      • Experience in building and deploying Machine Learning models in Production systems
      • Strong analytical skills: ability to make sense out of a variety of data and its relation/applicability to the business problem or opportunity at hand
      • Strong programming skills: comfortable with Python - pandas, numpy, scipy, matplotlib; Databases - SQL and noSQL
      • Strong communication skills: ability to both formulate/understand the business problem at hand as well as ability to discuss with non data-science background stakeholders 
      • Comfortable dealing with ambiguity and competing objectives

       

      Skills Required

      • Experience in Text Analytics, Natural Language Processing

      • Advanced degree in Data Science/Data Analytics or Math/Statistics

      • Comfortable with data-visualization tools and techniques

      • Knowledge of AWS and Data Warehousing

      • Passion for building data-products for Production systems - a strong desire to impact

        the product through data-science technique

Job posted by
Priyanka Muralidharan

Team Lead- Data Delivery

at Service company, helps businesses harness the power of data

Agency job
via Jobdost
Python
Ruby
Ruby on Rails (ROR)
Data Structures
Algorithms
DOM
XPath
Selenium
Automated testing
icon
Remote only
icon
4 - 8 yrs
icon
₹10L - ₹20L / yr

About the Company:

 It is a Data as a Service company that helps businesses harness the power of data. Our technology fuels some of the most interesting big data projects of the word. We are a small bunch of people working towards shaping the imminent data-driven future by solving some of its fundamental and toughest challenges. 

 

 

Role: We are looking for an experienced team lead to drive data acquisition projects end to end. In this role, you will be working in the web scraping team with data engineers, helping them solve complex web problems and mentor them along the way. You’ll be adept at delivering large-scale web crawling projects, breaking down barriers for your team and planning at a higher level, and getting into the detail to make things happen when needed.  

 

Responsibilities  

  •  Interface with clients and sales team to translate functional requirements into technical requirements 
  •  Plan and estimate tasks with your team, in collaboration with the delivery managers 
  •  Engineer complex data acquisition projects 
  •  Guide and mentor your team of engineers 
  •  Anticipate issues that might arise and proactively consider those into design 
  •  Perform code reviews and suggest design changes 

 

 

Prerequisites 

  •  Between 5-8 years of relevant experience 
  • Fluent programming skills and well-versed with scripting languages like Python or Ruby 
  • Solid foundation in data structures and algorithms 
  • Excellent tech troubleshooting skills 
  • Good understanding of web data landscape 
  • Prior exposure to DOM, XPATH and hands on experience with selenium/automated testing is a plus 

 

Skills and competencies 

  • Prior experience with team handling and people management is mandatory 
  • Work independently with little to no supervision 
  • Extremely high attention to detail  
  •  Ability to juggle between multiple projects  
Job posted by
Ankitha Vyas

Data Scientist

at One Labs

Founded 2015  •  Product  •  20-100 employees  •  Raised funding
Data Science
Deep Learning
Python
Keras
TensorFlow
Machine Learning (ML)
icon
NCR (Delhi | Gurgaon | Noida)
icon
1 - 3 yrs
icon
₹3L - ₹6L / yr

Job Description


We are looking for a data scientist that will help us to discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products. Your primary focus will be in applying data mining techniques, doing statistical analysis, and building high quality prediction systems integrated with our products. 

Responsibilities

  • Selecting features, building and optimizing classifiers using machine learning techniques
  • Data mining using state-of-the-art methods
  • Extending company’s data with third party sources of information when needed
  • Enhancing data collection procedures to include information that is relevant for building analytic systems
  • Processing, cleansing, and verifying the integrity of data used for analysis
  • Doing ad-hoc analysis and presenting results in a clear manner
  • Creating automated anomaly detection systems and constant tracking of its performance

Skills and Qualifications

  • Excellent understanding of machine learning techniques and algorithms, such as Linear regression, SVM, Decision Forests, LSTM, CNN etc.
  • Experience with Deep Learning preferred.
  • Experience with common data science toolkits, such as R, NumPy, MatLab, etc. Excellence in at least one of these is highly desirable
  • Great communication skills
  • Proficiency in using query languages such as SQL, Hive, Pig 
  • Good applied statistics skills, such as statistical testing, regression, etc.
  • Good scripting and programming skills 
  • Data-oriented personality
Job posted by
Rahul Gupta

Lead Data Scientist

at Spotmentor Technologies

Founded 2018  •  Product  •  20-100 employees  •  Raised funding
Python
Machine Learning (ML)
Natural Language Processing (NLP)
NOSQL Databases
icon
NCR (Delhi | Gurgaon | Noida)
icon
2 - 5 yrs
icon
₹20L - ₹30L / yr
Spotmentor is focussed on using the Intelligence-age tools and technologies like AI and Text analytics to create HR technology products which go beyond compliance and ERPs to give HR the power to become strategic, improve business results and increase the competitiveness. The HR and People departments have long sought to become strategic partners with businesses. We are focussed on taking this concept out of the board room meetings and making it a reality and you can be a part of this journey. At the end of it, you would be able to claim that there was an inflection point in History, which changed how business was transacted and you made that happen. Our first product is a Learning and Skill development platform which helps the organisations to acquire capabilities critical for them by helping employees attain their best potential through learning opportunities. Spotmentor was started by 4 IIT Kharagpur alumni with experiences in creating Technology products and Management consulting. We are looking for a Data Scientist who will help discover the information hidden in vast amounts of data, and help us make smarter decisions that benefit the employees of our customer organisations. Your primary focus will be on applying data mining techniques, doing statistical analysis, and building high quality prediction systems using structured and unstructured data. Technical Responsibilities: - Selecting features, building and optimizing classifiers using machine learning techniques - Data mining using state-of-the-art methods - Extending the existing data sets with third party sources of information - Processing, cleansing, and verifying the integrity of data used for analysis - Build recommendation systems - Automate scoring of documents using machine learning techniques Salary: This is a founding team member role with a salary of 20 Lacs to 30 Lacs per year and a meaningful ESOP component. Location: Gurgaon We believe in making Spotmentor the best place for the pursuit of excellence and diversity of opinions is an important tool to achieve that. Although as a startup our primary objective is growth, Spotmentor is focussed on creating a diverse and inclusive workplace where everyone can attain their best potential and we welcome female, minority and specially abled candidates to apply.
Job posted by
Deepak Singh

Python Machine Learning Developer

at SpotDraft

Founded 2017  •  Product  •  0-20 employees  •  Raised funding
Python
TensorFlow
caffee
icon
Noida, NCR (Delhi | Gurgaon | Noida)
icon
3 - 7 yrs
icon
₹3L - ₹24L / yr
We are building the AI core for a Legal Workflow solution. You will be expected to build and train models to extract relevant information from contracts and other legal documents. Required Skills/Experience: - Python - Basics of Deep Learning - Experience with one ML framework (like TensorFlow, Keras, Caffee) Preferred Skills/Expereince: - Exposure to ML concepts like LSTM, RNN and Conv Nets - Experience with NLP and Stanford POS tagger
Job posted by
Madhav Bhagat
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
Get to hear about interesting companies hiring right now
iconFollow Cutshort
Want to apply to this role at Infogain?
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Learn more
Get to hear about interesting companies hiring right now
iconFollow Cutshort