Cutshort logo
IT solutions specialized in Apps Lifecycle management. (MG1) logo
MLOps Lead Engineer
IT solutions specialized in Apps Lifecycle management. (MG1)
MLOps Lead Engineer
IT solutions specialized in Apps Lifecycle management. (MG1)'s logo

MLOps Lead Engineer

at IT solutions specialized in Apps Lifecycle management. (MG1)

Agency job
5 - 6 yrs
₹10L - ₹12L / yr
Bengaluru (Bangalore)
Skills
Mlops
skill iconKubernetes
skill iconDocker
Ansible
PySpark
skill iconJenkins
skill iconPython
MLOps Workflow Automation
  • Automate and maintain ML and Data pipelines at scale
  • Collaborate with Data Scientists and Data Engineers on feature development teams to containerize and build out deployment pipelines for new modules
  • Maintain and expand our on-prem deployments with spark clusters
  • Design, build and optimize applications containerization and orchestration with Docker and Kubernetes and AWS or Azure
Skills:
  • 5 years of IT experience in data-driven or AI technology products
  • Understanding of ML Model Deployment and Lifecycle
  • Extensive experience in Apache airflow for MLOps workflow automation
  • Experience is building and automating data pipelines
  • Experience in working on Spark Cluster architecture
  • Extensive experience with Unix/Linux environments
  • Experience with standard concepts and technologies used in CI/CD build, deployment pipelines using Jenkins
  • Strong experience in Python and PySpark and building required automation (using standard technologies such as Docker, Jenkins, and Ansible).
  • Experience with Kubernetes or Docker Swarm
  • Working technical knowledge of current systems software, protocols, and standards, including firewalls, Active Directory, etc.
  • Basic knowledge of Multi-tier architectures: load balancers, caching, web servers, application servers, and databases.
  • Experience with various virtualization technologies and multi-tenant, private and hybrid cloud environments.
  • Hands-on software and hardware troubleshooting experience.
  • Experience documenting and maintaining configuration and process information.
  • Basic Knowledge of machine learning frameworks: Tensorflow, Caffe/Caffe2, Pytorch
Read more
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos

Similar jobs

Gruve
Reshika Mendiratta
Posted by Reshika Mendiratta
Bengaluru (Bangalore), Pune
8yrs+
Upto ₹50L / yr (Varies
)
DevOps
CI/CD
skill iconGit
skill iconKubernetes
Ansible
+7 more

About the Company:

Gruve is an innovative Software Services startup dedicated to empowering Enterprise Customers in managing their Data Life Cycle. We specialize in Cyber Security, Customer Experience, Infrastructure, and advanced technologies such as Machine Learning and Artificial Intelligence. Our mission is to assist our customers in their business strategies utilizing their data to make more intelligent decisions. As an well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks.

 

Why Gruve:

At Gruve, we foster a culture of innovation, collaboration, and continuous learning. We are committed to building a diverse and inclusive workplace where everyone can thrive and contribute their best work. If you’re passionate about technology and eager to make an impact, we’d love to hear from you.

Gruve is an equal opportunity employer. We welcome applicants from all backgrounds and thank all who apply; however, only those selected for an interview will be contacted.

 

Position summary:

We are seeking a Staff Engineer – DevOps with 8-12 years of experience in designing, implementing, and optimizing CI/CD pipelines, cloud infrastructure, and automation frameworks. The ideal candidate will have expertise in Kubernetes, Terraform, CI/CD, Security, Observability, and Cloud Platforms (AWS, Azure, GCP). You will play a key role in scaling and securing our infrastructure, improving developer productivity, and ensuring high availability and performance. 

Key Roles & Responsibilities:

  • Design, implement, and maintain CI/CD pipelines using tools like Jenkins, GitLab CI/CD, ArgoCD, and Tekton.
  • Deploy and manage Kubernetes clusters (EKS, AKS, GKE) and containerized workloads.
  • Automate infrastructure provisioning using Terraform, Ansible, Pulumi, or CloudFormation.
  • Implement observability and monitoring solutions using Prometheus, Grafana, ELK, OpenTelemetry, or Datadog.
  • Ensure security best practices in DevOps, including IAM, secrets management, container security, and vulnerability scanning.
  • Optimize cloud infrastructure (AWS, Azure, GCP) for performance, cost efficiency, and scalability.
  • Develop and manage GitOps workflows and infrastructure-as-code (IaC) automation.
  • Implement zero-downtime deployment strategies, including blue-green deployments, canary releases, and feature flags.
  • Work closely with development teams to optimize build pipelines, reduce deployment time, and improve system reliability. 


Basic Qualifications:

  • A bachelor’s or master’s degree in computer science, electronics engineering or a related field
  • 8-12 years of experience in DevOps, Site Reliability Engineering (SRE), or Infrastructure Automation.
  • Strong expertise in CI/CD pipelines, version control (Git), and release automation.
  •  Hands-on experience with Kubernetes (EKS, AKS, GKE) and container orchestration.
  • Proficiency in Terraform, Ansible for infrastructure automation.
  • Experience with AWS, Azure, or GCP services (EC2, S3, IAM, VPC, Lambda, API Gateway, etc.).
  • Expertise in monitoring/logging tools such as Prometheus, Grafana, ELK, OpenTelemetry, or Datadog.
  • Strong scripting and automation skills in Python, Bash, or Go.


Preferred Qualifications  

  • Experience in FinOps Cloud Cost Optimization) and Kubernetes cluster scaling.
  • Exposure to serverless architectures and event-driven workflows.
  • Contributions to open-source DevOps projects. 
Read more
Series B product based company
Series B product based company
Agency job
via Qrata by Blessy Fernandes
Mumbai, Navi Mumbai
1 - 3 yrs
₹5L - ₹8L / yr
Linux/Unix
Microservices
skill iconPython
skill iconAmazon Web Services (AWS)
Amazon EC2
+12 more

Roles & Responsibilities:

  • Bachelor’s degree in Computer Science, Information Technology or a related field


  • Experience in designing and maintaining high volume and scalable micro-services architecture on cloud infrastructure


  • Knowledge in Linux/Unix Administration and Python/Shell Scripting


  • Experience working with cloud platforms like AWS (EC2, ELB, S3, Auto-scaling, VPC, Lambda), GCP, Azure


  • Knowledge in deployment automation, Continuous Integration and Continuous Deployment (Jenkins, Maven, Puppet, Chef, GitLab) and monitoring tools like Zabbix, Cloud Watch Monitoring, Nagios Knowledge of Java Virtual Machines, Apache Tomcat, Nginx, Apache Kafka, Microservices architecture, Caching mechanisms


  • Experience in enterprise application development, maintenance and operations


  • Knowledge of best practices and IT operations in an always-up, always-available service


  • Excellent written and oral communication skills, judgment and decision-making skills
Read more
Mumbai
3 - 5 yrs
₹5L - ₹10L / yr
skill iconKubernetes
DevOps
skill iconJenkins
Ansible

Main tasks

  • Supervision of the CI/CD process for the automated builds and deployments of web services and web applications as well as desktop tool in the cloud and container environment
  • Responsibility of the operations part of a DevOps organization especially for development at LS telcom in the environment of container technology and orchestration, e.g. with Kubernetes
  • Installation, operation and monitoring of web applications in cloud data centers for the purpose of development of the test as well as for the operation of an own productive cloud as LS service
  • Implementation of installations of the LS system solution especially in the container context
  • Introduction, maintenance and improvement of installation solutions for LS development in the desktop and server environment as well as in the cloud and with on-premise Kubernetes
  • Maintenance of the system installation documentation and implementation of trainings

Execution of internal software tests and support of involved teams and stakeholders

  • Hands on Experience with Azure DevOps.

Qualification profile

  • Bachelor’s or master’s degree in communications engineering, electrical engineering, physics or comparable qualification
  • Experience in software
  • Installation and administration of Linux and Windows systems including network and firewalling aspects
  • Experience with build and deployment automation with tools like Jenkins, Gradle, Argo or similar as well as system scripting (Bash, Power-Shell, etc.)
  • Interest in operation and monitoring of applications in virtualized and containerized environments in cloud and on-premise
  • Server environments, especially application, web-and database servers
  • Knowledge in VMware/K3D/Rancer is an advantage
  • Good spoken and written knowledge of English
Read more
Infra360 Solutions Pvt Ltd
at Infra360 Solutions Pvt Ltd
2 candid answers
Rahul Tripathi
Posted by Rahul Tripathi
Gurugram
1 - 4 yrs
₹5L - ₹12L / yr
Microsoft Windows Azure
Windows Azure
skill iconKubernetes
Terraform
skill iconDocker
+4 more

Infra360 Solutions is a services company specializing in Cloud, DevSecOps, Security, and Observability solutions. We help technology companies adapt DevOps culture in their organization by focusing on long-term DevOps roadmap. We focus on identifying technical and cultural issues in the journey of successfully implementing the DevOps practices in the organization and work with respective teams to fix issues to increase overall productivity. We also do training sessions for the developers and make them realize the importance of DevOps. We provide these services - DevOps, DevSecOps, FinOps, Cost Optimizations, CI/CD, Observability, Cloud Security, Containerization, Cloud Migration, Site Reliability, Performance Optimizations, SIEM and SecOps, Serverless automation, Well-Architected Review, MLOps, Governance, Risk & Compliance. We do assessments of technology architecture, security, governance, compliance, and DevOps maturity model for any technology company and help them optimize their cloud cost, streamline their technology architecture, and set up processes to improve the availability and reliability of their website and applications. We set up tools for monitoring, logging, and observability. We focus on bringing the DevOps culture to the organization to improve its efficiency and delivery.


Job Description


Our Mission 


Our mission is to help customers achieve their business objectives by providing innovative, best-in-class consulting, IT solutions and services and to make it a joy for all stakeholders to work with us. We function as a full stakeholder in business, offering a consulting-led approach with an integrated portfolio of technology-led solutions that encompass the entire Enterprise value chain.

Our Customer-centric Engagement Model defines how we engage with you, offering specialized services and solutions that meet the distinct needs of your business.


Our Culture 


Culture forms the core of our foundation and our effort towards creating an engaging workplace has resulted in Infra360 Solution Pvt Ltd.


Our Tech-Stack:

  • Azure DevOps, Azure Kubernetes Service, Docker, Active Directory (Microsoft Entra)
  • Azure IAM and managed identity, Virtual network, VM Scale Set, App Service, Cosmos
  • Azure, MySQL Scripting (PowerShell, Python, Bash), 
  • Azure Security, Security Documentation, Security Compliance, 
  • AKS, Blob Storage, Azure functions, Virtual Machines, Azure SQL
  • AWS - IAM, EC2, EKS, Lambda, ECS, Route53, Cloud formation, Cloud front, S3
  • GCP - GKE, Compute Engine, App Engine, SCC
  • Kubernetes, Linux, Docker & Microservices Architecture
  • Terraform & Terragrunt
  • Jenkins & Argocd
  • Ansible, Vault, Vagrant, SaltStack
  • CloudFront, Apache, Nginx, Varnish, Akamai
  • Mysql, Aurora, Postgres, AWS RedShift, MongoDB
  • ElasticSearch, Redis, Aerospike, Memcache, Solr
  • ELK, Fluentd, Elastic APM & Prometheus Grafana Stack
  • Java (Spring/Hibernate/JPA/REST), Nodejs, Ruby, Rails, Erlang, Python


What does this role hold for you…??


  • Infrastructure as a code (IaC)
  • CI/CD and configuration management
  • Managing Azure Active Directory (Entra)
  • Keeping the cost of the infrastructure to the minimum
  • Doing RCA of production issues and providing resolution
  • Setting up failover, DR, backups, logging, monitoring, and alerting
  • Containerizing different applications on the Kubernetes platform
  • Capacity planning of different environments infrastructure
  • Ensuring zero outages of critical services
  • Database administration of SQL and NoSQL databases
  • Setting up the right set of security measures



Requirements

Apply if you have… 


  • A graduation/post-graduation degree in Computer Science and related fields
  • 2-4 years of strong DevOps experience in Azure with the Linux environment.
  • Strong interest in working in our tech stack
  • Excellent communication skills
  • Worked with minimal supervision and love to work as a self-starter
  • Hands-on experience with at least one of the scripting languages - Bash, Python, Go etc
  • Experience with version control systems like Git
  • Understanding of Azure cloud computing services and cloud computing delivery models (IaaS, PaaS, and SaaS)
  • Strong scripting or programming skills for automating tasks (PowerShell/Bash)
  • Knowledge and experience with CI/CD tools: Azure DevOps, Jenkins, Gitlab etc.
  • Knowledge and experience in IaC at least one (ARM Templates/ Terraform)
  • Strong experience with managing the Production Systems day in and day out
  • Experience in finding issues in different layers of architecture in a production environment and fixing them
  • Experience in automation tools like Ansible/SaltStack and Jenkins
  • Experience in Docker/Kubernetes platform and managing OpenStack (desirable)
  • Experience with Hashicorp tools i.e. Vault, Vagrant, Terraform, Consul, VirtualBox etc. (desirable)
  • Experience in Monitoring tools like Prometheus/Grafana/Elastic APM.
  • Experience in logging tools Like ELK/Loki.
  • Experience in using Microsoft Azure Cloud services

If you are passionate about infrastructure, and cloud technologies, and want to contribute to innovative projects, we encourage you to apply. Infra360 offers a dynamic work environment and opportunities for professional growth. 


Interview Process


Application Screening=>Test/Assessment=>2 Rounds of Tech Interview=>CEO Round=>Final Discussion





Read more
Conviva
at Conviva
1 recruiter
Deepa S
Posted by Deepa S
Bengaluru (Bangalore)
4 - 8 yrs
₹25L - ₹28L / yr
DevOps
skill iconKubernetes
skill iconDocker
skill iconAmazon Web Services (AWS)
Google Cloud Platform (GCP)
+9 more
  • 5+ years of experience in DevOps including automated system configuration, application deployment, and infrastructure-as-code. 
  • Advanced Linux system administration abilities. 
  • Real-world experience managing large-scale AWS or GCP environments. Multi-account management a plus. 
  • Experience with managing production environments on AWS or GCP. 
  • Solid understanding CI/CD pipelines using GitHub, CircleCI/Jenkins, JFrog Artifactory/Nexus. 
  • Experience on any configuration management tools like Ansible, Puppet or Chef is a must. 
  • Experience in any one of the scripting languages: Shell, Python, etc. 
  • Experience in containerization using Docker and orchestration using Kubernetes/EKS/GKE is a must. 
  • Solid understanding of SSL and DNS. 
  • Experience on deploying and running any open-source monitoring/graphing solution like Prometheus, Grafana, etc. 
  • Basic understanding of networking concepts.
  • Always adhere to security best practices.
  • Knowledge on Bigdata (Hadoop/Druid) systems administration will be a plus.  
  • Knowledge on managing and running DBs (MySQL/MariaDB/Postgres) will be an added advantage. 

What you get to do 

  • Work with development teams to build and maintain cloud environments to specifications developed closely with multiple teams. Support and automate the deployment of applications into those environments 
  • Diagnose and resolve occurring, latent and systemic reliability issues across entire stack: hardware, software, application and network. Work closely with development teams to troubleshoot and resolve application and service issues 
  • Continuously improve Conviva SaaS services and infrastructure for availability, performance and security 
  • Implement security best practices – primarily patching of operating systems and applications 
  • Automate everything. Build proactive monitoring and alerting tools. Provide standards, documentation, and coaching to developers.
  • Participate in 12x7 on-call rotations 
  • Work with third party service/support providers for installations, support related calls, problem resolutions etc. 



Read more
Concentric AI
at Concentric AI
7 candid answers
1 product
Gopal Agarwal
Posted by Gopal Agarwal
Pune
3 - 10 yrs
₹4L - ₹50L / yr
skill iconDocker
skill iconKubernetes
DevOps
skill iconPython
skill iconJenkins
+9 more
• 3-10 yrs of industry experience
• Energetic self-starter, fast learner, with a desire to work in a startup environment
• Experience working with Public Clouds like AWS
• Operating and Monitoring cloud infrastructure on AWS
• Primary focus on building, implementing and managing operational support
• Design, Develop and Troubleshoot Automation scripts (Configuration/Infrastructure as code or others) for Managing Infrastructure
• Expert at one of the scripting languages – Python, shell, etc
• Experience with Nginx/HAProxy, ELK Stack, Ansible, Terraform, Prometheus-Grafana stack, etc
• Handling load monitoring, capacity planning, services monitoring
• Proven experience With CICD Pipelines and Handling Database Upgrade Related Issues
• Good Understanding and experience in working with Containerized environments like Kubernetes and Datastores like Cassandra, Elasticsearch, MongoDB, etc
Read more
HCL
HCL
Agency job
via Saiva System by Sunny Kumar
Bengaluru (Bangalore)
5 - 8 yrs
₹3L - ₹15L / yr
skill iconDocker
skill iconKubernetes
DevOps
skill iconJenkins
Ansible
+8 more

Client: Sony Corporation

Position: 3

Exp: 5-8Years

DevOps Engineer

Location: Bangalore

Budget: 16.5 LPA Max

 

Gerrit ,Jenkins, Rabbit MQ AWS Linux
Python ,Ansible, Tomcat ,Postgresql
Grafana ,Groovy,HTML,Shell,Apache,Git , ELK

Read more
Second generation of the Internet
Second generation of the Internet
Agency job
via MNR Solutions by Neeraj Shukla
Remote, Trivandrum
3 - 7 yrs
₹6L - ₹12L / yr
DevOps
skill iconKubernetes
CI/CD
skill iconJavascript
skill iconDocker

Role – Devops

Experience 3 – 6 Years

 

Roles & Responsibilities –

  • 3-6 years of experience in deploying and managing highly scalable fault resilient systems
  • Strong experience in container orchestration and server automation tools such as Kubernetes, Google Container Engine, Docker Swarm, Ansible, Terraform  
  • Strong experience with Linux-based infrastructures, Linux/Unix administration, AWS, Google Cloud, Azure
  • Strong experience with databases such as  MySQL, Hadoop, Elasticsearch, Redis, Cassandra, and MongoDB.
  • Knowledge of scripting languages such as Java, JavaScript, Python, PHP, Groovy, Bash.
  • Experience in configuring CI/CD pipelines using Jenkins, GitLab CI, Travis.
  • Proficient in technologies such as Docker, Kafka, Raft and Vagrant
  • Experience in implementing queueing services such as RabbitMQ, Beanstalkd, Amazon SQS and knowledge in ElasticStack is a plus.
Read more
They provide both wholesale and retail funding. PM1
They provide both wholesale and retail funding. PM1
Agency job
via Multi Recruit by Sapna Deb
Mumbai
7 - 10 yrs
₹15L - ₹20L / yr
DevOps
skill iconJenkins
Devsecops
skill iconDocker
skill iconKubernetes
+10 more
  • 7-10 years experience with secure SDLC/DevSecOps practices such as automating security processes within CI/CD pipeline.
  • At least 4 yrs. experience designing, and securing Data Lake & Web applications deployed to AWS, Azure, Scripting/Automation skills on Python, Shell, YAML, JSON
  • At least 4 years of hands-on experience with software development lifecycle, Agile project management (e.g. Jira, Confluence), source code management (e.g. Git), build automation (e.g. Jenkins), code linting and code quality (e.g. SonarQube), test automation (e.g. Selenium)
  • Hand-on & Solid understanding of Amazon Web Services & Azure-based Infra & applications
  • Experience writing cloud formation templates, Jenkins, Kubernetes, Docker, and microservice application architecture and deployment.
  • Strong know-how on VA/PT integration in CI/CD pipeline.
  • Experience in handling financial solutions & customer-facing applications

Roles

  • Accelerate enterprise cloud adoption while enabling rapid and stable delivery of capabilities using continuous integration and continuous deployment principles, methodologies, and technologies
  • Manage & deliver diverse cloud [AWS, Azure, GCP] DevSecOps journeys
  • Identify, prototype, engineer, and deploy emerging software engineering methodologies and tools
  • Maximize automation and enhance DevSecOps pipelines and other tasks
  • Define and promote enterprise software engineering and DevSecOps standards, practices, and behaviors
  • Operate and support a suite of enterprise DevSecOps services
  • Implement security automation to decrease the loop between the development and deployment processes.
  • Support project teams to adopt & integrate the DevSecOps environment
  • Managing application vulnerabilities, Data security, encryption, tokenization, access management, Secure SDLC, SAST/DAST
  • Coordinate with development and operations teams for practical automation solutions and custom flows.
  • Own DevSecOps initiatives by providing objective, practical and relevant ideas, insights, and advice.
  • Act as Release gatekeeper with an understanding of OWASP top 10 lists of vulnerabilities, NIST SP-800-xx, NVD, CVSS scoring, etc concepts
  • Build workflows to ensure a successful DevSecOps journey for various enterprise applications. 
  • Understand the strategic direction to reach business goals across multiple projects & teams
  • Collaborate with development teams to understand project deliverables and promote DevSecOps culture
  • Formulate & deploy cloud automation strategies and tools

Skills

  • Knowledge of the DevSecOps culture and principles.
  • An understanding of cloud technologies & components
  • A flair for programming languages such as Shell, Python, Java Scripts,
  • Strong teamwork and communication skills.
  • Knowledge of threat modeling and risk assessment techniques.
  • Up-to-date knowledge of cybersecurity threats, current best practices, and the latest software.
  • An understanding of programs such as Puppet, Chef, ThreatModeler, Checkmarx, Immunio, and Aqua.
  • Strong know-how of Kubernetes, Docker, AWS, Azure-based deployments
  • On the job learning for new programming languages, automation tools, deployment architectures

 

Read more
Cross tower India Trading pvt ltd
Fauzia Khan
Posted by Fauzia Khan
Gurugram
3 - 5 yrs
₹8L - ₹12L / yr
DevOps
skill iconDocker
skill iconKubernetes
CI/CD
Distributed Systems
+7 more

Responsibilities

- Building and maintenance of resilient and scalable production infrastructure 

Improvement of monitoring systems

- Improvement of monitoring systems

- Creation and support of development automation processes (CI / CD)

- Participation in infrastructure development 

- Detection of problems in architecture and proposing of solutions for solving them 

- Creation of tasks for system improvements for system scalability, performance and monitoring

- Analysis of product requirements in the aspect of devops

- Incident analysis and fixing

Skills and Experience

- Understanding of the distributed systems principles 

- Understanding of principles for building a resistant network infrastructure 

- Experience of Ubuntu Linux administration (Debian-like will be a plus)

- Strong knowledge of Bash

- Experience of working with LXC-containers 

- Understanding and experience with infrastructure as a code approach 

- Experience of development idempotent Ansible roles

- Experience of working with git

Preferred experience

 Experience with relational databases (PostgeSQL), ability to create simple SQL queries 

- Experience with monitoring and metric collect systems (Prometheus, Grafana, Zabbix)

- Understanding of dynamic routing (OSPF) 

- Knowledge and experience of working with network equipment Cisco

- Experience of working with Cisco NX-OS

- Experience of working with IPsec, VXLAN, Open vSwitch

- Knowledge of principles of multicast protocols IGMP, PIM

- Experience of setting multicast on Cisco equipment

- Experience administering Atlassian products

Read more
Why apply to jobs via Cutshort
people_solving_puzzle
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
people_verifying_people
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly. No 3rd party agencies here.
ai_chip
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
21,01,133
Matches delivered
37,12,187
Network size
15,000
Companies hiring
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos