We are looking for a Spark developer who knows how to fully exploit the potential of our Spark cluster. You will clean, transform, and analyze vast amounts of raw data from various systems using Spark to provide ready-to-use data to our feature developers and business analysts. This involves both ad-hoc requests as well as data pipelines that are embedded in our production
Requirements:
The candidate should be well-versed in the Scala programming language Should have experience in Spark Architecture and Spark Internals Exp in AWS is preferable Should have experience in the full life cycle of at least one big data application
About Innova Solutions
Similar jobs
Team:- We are a team of 9 data scientists working on Video Analytics Projects, Data Analytics projects for internal AI requirements of Reliance Industries as well for the external business. At a time, we make progress on multiple projects(atleast 4) in Video Analytics or Data Analytics.
Job Description
We are looking for an experienced engineer to join our data science team, who will help us design, develop, and deploy machine learning models in production. You will develop robust models, prepare their deployment into production in a controlled manner, while providing appropriate means to monitor their performance and stability after deployment.
What You’ll Do will include (But not limited to):
- Preparing datasets needed to train and validate our machine learning models
- Anticipate and build solutions for problems that interrupt availability, performance, and stability in our systems, services, and products at scale.
- Defining and implementing metrics to evaluate the performance of the models, both for computing performance (such as CPU & memory usage) and for ML performance (such as precision, recall, and F1)
- Supporting the deployment of machine learning models on our infrastructure, including containerization, instrumentation, and versioning
- Supporting the whole lifecycle of our machine learning models, including gathering data for retraining, A/B testing, and redeployments
- Developing, testing, and evaluating tools for machine learning models deployment, monitoring, retraining.
- Working closely within a distributed team to analyze and apply innovative solutions over billions of documents
- Supporting solutions ranging from rule-bases, classical ML techniques to the latest deep learning systems.
- Partnering with cross-functional team members to bring large scale data engineering solutions to production
- Communicating your approach and results to a wider audience through presentations
Your Qualifications:
- Demonstrated success with machine learning in a SaaS or Cloud environment, with hands–on knowledge of model creation and deployments in production at scale
- Good knowledge of traditional machine learning methods and neural networks
- Experience with practical machine learning modeling, especially on time-series forecasting, analysis, and causal inference.
- Experience with data mining algorithms and statistical modeling techniques for anomaly detection in time series such as clustering, classification, ARIMA, and decision trees is preferred.
- Ability to implement data import, cleansing and transformation functions at scale
- Fluency in Docker, Kubernetes
- Working knowledge of relational and dimensional data models with appropriate visualization techniques such as PCA.
- Solid English skills to effectively communicate with other team members
Due to the nature of the role, it would be nice if you have also:
- Experience with large datasets and distributed computing, especially with the Google Cloud Platform
- Fluency in at least one deep learning framework: PyTorch, TensorFlow / Keras
- Experience with No–SQL and Graph databases
- Experience working in a Colab, Jupyter, or Python notebook environment
- Some experience with monitoring, analysis, and alerting tools like New Relic, Prometheus, and the ELK stack
- Knowledge of Java, Scala or Go-Lang programming languages
- Familiarity with KubeFlow
- Experience with transformers, for example the Hugging Face libraries
- Experience with OpenCV
About Egnyte
In a content critical age, Egnyte fuels business growth by enabling content-rich business processes, while also providing organizations with visibility and control over their content assets. Egnyte’s cloud-native content services platform leverages the industry’s leading content intelligence engine to deliver a simple, secure, and vendor-neutral foundation for managing enterprise content across business applications and storage repositories. More than 16,000 customers trust Egnyte to enhance employee productivity, automate data management, and reduce file-sharing cost and complexity. Investors include Google Ventures, Kleiner Perkins, Caufield & Byers, and Goldman Sachs. For more information, visit www.egnyte.com
#LI-Remote
About us
- Innoviti (http://www.innoviti.com/">innoviti.com) is India’s largest provider of payment solutions to Enterprise merchants.
- Processing $10 Bn of payments from 2000 cities. We today have 76% of Enterprise market share.
- Enterprises such as Reliance, Landmark Group, Shoppers Stop, Pantaloons, Hamleys, Van Heusen, Louise Phillipe, Madura Garments and several other hundreds of Enterprises are our customers.
- Our vision is to use technology to unlock the hidden value in payments, helping large and small businesses constantly find new and unique ways to grow faster with lesser efforts.
- We believe payment transactions are more than money moving pipes. Whenever a payment happens, it is not only the merchant who makes money, but also the brand whose product was sold and the bank whose payment instrument was used. A merchant, brand, and bank talk to the same consumer, however there is no easy way for them to talk to each other. Our technology makes this collaboration happen, and makes it happen at the point of payment. We help them share customers and share marketing budgets, in turn targeting customers better than possible otherwise. By making it happen at the point of payment we ensure for each party that their marketing efforts are translating into sales.
- Today our technology has helped large businesses grow faster with lesser efforts. They have been able to offer EMI, BNPL, Cashbacks, Loyalty Point redemption and other offers with merchants, brands and banks participating in it. Every day new use cases are designed and stitched using our technology platform.
- Our next target are small merchants. For them, the problems are far larger and more complex. They have no means to access the marketing budgets or customers of large brands or banks. They cannot even cross the regional offices of these businesses. This is where we come in. We are bringing the power of the technology created for large merchants to these businesses. Helping bring the power of large brands and banks with whom we already partner to small merchants.
- Innoviti’s first target category in this segment is small merchants selling electronic goods – mobiles, durables, laptops etc. They are struggling the most as their customers find attractive offers online through partner brands and banks, which they can’t match in their stores. These businesses offer the advantage of touch, feel and explanation in local language. What they lack is the technology to discover, access customers and attract them with superior offers.
- GENIE, our smart marketing platform helps them do that. It is a mobile app that integrates with bank and brands on one end and Google and Facebook on the other. It not only helps them discover and access customers searching for these products online, but also attract them to their stores with offers better than online.
- The app was launched in July 2021 and has shown a 100% month-on-month growth, with more than 23% of the merchant’s monthly sales now happening through it.
- In the next 5 years Innoviti wants to scale up its GTV from $10Bn to $30Bn, multiplying its revenue 10X.
- We want to be recognized in this space as the company that transformed the payments industry by showing ways of extracting value from payment transactions better than anyone else. We want to be setting the path for the next generation of payment solutions, that will be followed by others.
- Towards this we need to hire aggressive talented individuals, who firmly believe that no problem is unsolvable. That there is always a way to solve the toughest of problems. Those who get excited by solving such tough problems at scale, and seeing the impact of that around them in the market. Those who feel fulfilled when they see that it is making a difference to the lives of people around them.
To drive this roadmap the company is looking for talent who are ready to do what no payments platform has done before and what every payment platform will do thereafter.
Would you like to join this journey?
Job description:
Designation: Database Administrator
Location: Bangalore
Responsibilities
- Provision MySQL instances, both in clustered and non-clustered configurations
- Ensure performance, security, and availability of databases
- Prepare documentations and specifications
- Handle common database procedures, such as upgrade, backup, recovery, migration, etc.
- Profile server resource usage, optimize and tweak as necessary
- Collaborate with other team members and stakeholders
Skills and Qualification:
- Graduate degree in Computer Science, Computer Engineering, or related technical discipline from Tier-1 Institutes
- 4-6 years of proven experience in Database Administration
- Decent experience with recent versions of MySQL
- Understanding of MySQL’s underlying storage engines, such as InnoDB and MyISAM
- Experience with replication configuration in MySQL
- Knowledge of de-facto standards and best practices in MySQL
- Proficient in writing and optimizing SQL statements
- Knowledge of MySQL features, such as its event scheduler
- Ability to plan resource requirements from high level specifications
- Familiarity with other SQL/NoSQL databases such as PostgreSQL, MongoDB, etc.
- Knowledge of limitations in MySQL and their workarounds in contrast to other popular relational databases
Personality:
The ideal person for this role would be someone who loves the challenge of an entrepreneurial environment, who has high ownership to be available 24x7 for our customers, who is able to deal with complexity and rapid change and who has large dreams to be part of interesting journey.
The person would be self-driven, results-oriented with a positive outlook and impeccable integrity. He/she would have a track record of delivering results consistently in uncertain environments. Excellent communication skills with an ability to manage crucial conversations with senior stakeholders.
Retail industry demands “I’ll do it now instead of tomorrow” attitude. Please be prepared for interesting journey if you want to grow fast with no age barriers.
● Proficient in Python and using packages like NLTK, Numpy, Pandas
● Should have worked on deep learning frameworks (like Tensorflow, Keras, PyTorch, etc)
● Hands-on experience in Natural Language Processing, Sequence, and RNN Based models
● Mathematical intuition of ML and DL algorithms
● Should be able to perform thorough model evaluation by creating hypotheses on the basis of statistical
analyses
● Should be comfortable in going through open-source code and reading research papers.
Roles and Responsibilities
- Managing available resources such as hardware, data, and personnel so that deadlines are met.
- Analyzing the ML and Deep Learning algorithms that could be used to solve a given problem and ranking them by their success probabilities
- Exploring data to gain an understanding of it, then identifying differences in data distribution that could affect performance when deploying the model in the real world
- Defining validation framework and establish a process to ensure acceptable data quality criteria are met
- Supervising the data acquisition and partnership roadmaps to create stronger product for our customers.
- Defining feature engineering process to ensure usage of meaningful features given the business constraints which may vary by market
- Device self-learning strategies through analysis of errors from the models
- Understand business issues and context, devise a framework for solving unstructured problems and articulate clear and actionable solutions underpinned by analytics.
- Manage multiple projects simultaneously while demonstrating business leadership to collaborate & coordinate with different functions to deliver the solutions in a timely, efficient and effective manner.
- Manage project resources optimally to deliver projects on time; drive innovation using residual resources to create strong solution pipeline; provide direction, coaching & training, feedbacks to project team members to enhance performance, support development and encourage value aligned behaviour of the project team members; Provide inputs for periodic performance appraisal of project team members.
Preferred Technical & Professional expertise
- Undergraduate Degree in Computer Science / Engineering / Mathematics / Statistics / economics or other quantitative fields
- At least 2+ years of experience of managing Data Science projects with specializations in Machine Learning
- In-depth knowledge of cloud analytics tools.
- Able to drive Python Code optimization; ability review codes and provide inputs to improve the quality of codes
- Ability to evaluate hardware selection for running ML models for optimal performance
- Up to date with Python libraries and versions for machine learning; Extensive hands-on experience with Regressors; Experience working with data pipelines.
- Deep knowledge of math, probability, statistics and algorithms; Working knowledge of Supervised Learning, Adversarial Learning and Unsupervised learning
- Deep analytical thinking with excellent problem-solving abilities
- Strong verbal and written communication skills with a proven ability to work with all levels of management; effective interpersonal and influencing skills.
- Ability to manage a project team through effectively allocation of tasks, anticipating risks and setting realistic timelines for managing the expectations of key stakeholders
- Strong organizational skills and an ability to balance and handle multiple concurrent tasks and/or issues simultaneously.
- Ensure that the project team understand and abide by compliance framework for policies, data, systems etc. as per group, region and local standards
lesser concentration on enforcing how to do a particular task, we believe in giving people the opportunity to think out of the box and come up with their own innovative solution to problem solving.
You will primarily be developing, managing and executing handling multiple prospect campaigns as part of Prospect Marketing Journey to ensure best conversion rates and retention rates. Below are the roles, responsibilities and skillsets we are looking for and if you feel these resonate with you, please get in touch with us by applying to this role.
Roles and Responsibilities:
• You'd be responsible for development and maintenance of applications with technologies involving Enterprise Java and Distributed technologies.
• You'd collaborate with developers, product manager, business analysts and business users in conceptualizing, estimating and developing new software applications and enhancements.
• You'd Assist in the definition, development, and documentation of software’s objectives, business requirements, deliverables, and specifications in collaboration with multiple cross-functional teams.
• Assist in the design and implementation process for new products, research and create POC for possible solutions.
Skillset:
• Bachelors or Masters Degree in a technology related field preferred.
• Overall experience of 2-3 years on the Big Data Technologies.
• Hands on experience with Spark (Java/ Scala)
• Hands on experience with Hive, Shell Scripting
• Knowledge on Hbase, Elastic Search
• Development experience In Java/ Python is preferred
• Familiar with profiling, code coverage, logging, common IDE’s and other
development tools.
• Demonstrated verbal and written communication skills, and ability to interface with Business, Analytics and IT organizations.
• Ability to work effectively in short-cycle, team oriented environment, managing multiple priorities and tasks.
• Ability to identify non-obvious solutions to complex problems
This person MUST have:
- B.E Computer Science or equivalent
- 5 years experience with the Django framework
- Experience with building APIs (REST or GraphQL)
- Strong Troubleshooting and debugging skills
- React.js knowledge would be an added bonus
- Understanding on how to use a database like Postgres (prefered choice), SQLite, MongoDB, MySQL.
- Sound knowledge of object-oriented design and analysis.
- A strong passion for writing simple, clean and efficient code.
- Proficient understanding of code versioning tools Git.
- Strong communication skills.
Experience:
- Min 5 year experience
- Startup experience is a must.
Location:
- Remote developer
Timings:
- 40 hours a week but with 4 hours a day overlapping with client timezone. Typically clients are in California PST Timezone.
Position:
- Full time/Direct
- We have great benefits such as PF, medical insurance, 12 annual company holidays, 12 PTO leaves per year, annual increments, Diwali bonus, spot bonuses and other incentives etc.
- We dont believe in locking in people with large notice periods. You will stay here because you love the company. We have only a 15 days notice period.
Aikon Labs Pvt Ltd is a start-up focused on Realizing Ideas. One such idea is iEngage.io , our Intelligent Engagement Platform. We leverage Augmented Intelligence, a combination of machine-driven insights & human understanding, to serve a timely response to every interaction from the people you care about.
Get in touch If you are interested.
Do you have a passion to be a part of an innovative startup? Here’s an opportunity for you - become an active member of our core platform development team.
Main Duties
● Quickly research the latest innovations in Machine Learning, especially with respect to
Natural Language Understanding & implement them if useful
● Train models to provide different insights, mainly from text but also other media such as Audio and Video
● Validate the models trained. Fine-tune & optimise as necessary
● Deploy validated models, wrapped in a Flask server as a REST API or containerize in docker containers
● Build preprocessing pipelines for the models that are bieng served as a REST API
● Periodically, test & validate models in use. Update where necessary
Role & Relationships
We consider ourselves a team & you will be a valuable part of it. You could be reporting to a Senior member or directly to our Founder, CEO
Educational Qualifications
We don’t discriminate. As long as you have the required skill set & the right attitude
Experience
Upto two years of experience, preferably working on ML. Freshers are welcome too!
Skills
Good
● Strong understanding of Java / Python
● Clarity on concepts of Data Science
● A strong grounding in core Machine Learning
● Ability to wrangle & manipulate data into a processable form
● Knowledge of web technologies like Web server (Flask, Django etc), REST API's
Even better
● Experience with deep learning
● Experience with frameworks like Scikit-Learn, Tensorflow, Pytorch, Keras
Competencies
● Knowledge of NLP libraries such as NLTK, spacy, gensim.
● Knowledge of NLP models such as Wod2vec, Glove, ELMO, Fasttext
● An aptitude to solve problems & learn something new
● Highly self-motivated
● Analytical frame of mind
● Ability to work in fast-paced, dynamic environment
Location
Pune
Remuneration
Once we meet, we shall make an offer depending on how good a fit you are & the experience you already have