DevOps Engineer

at TestMu AI (Formely LambdaTest)

DevOps Engineer

TestMu AI (Formely LambdaTest)

Company

Home

DevOps Engineer

at TestMu AI (Formely LambdaTest)

Posted by Himanshi Tomer

1 - 5 yrs

₹6L - ₹25L / yr

Noida

Skills

DevOps

Kubernetes

Docker

Amazon Web Services (AWS)

Windows Azure

Google Cloud Platform (GCP)

Linux/Unix

PostgreSQL

MySQL

Amazon EC2

Amazon S3

AWS Lambda

CI/CD

Git

DevOps Engineer (Cloud & Infrastructure)

📍 Noida | 🕐 Full-Time | 🧭 Experience: 2–4 years

About TestMu AI

TestMu AI (formerly LambdaTest) is an AI-native platform designed to move software testing beyond simple automation into the era of agentic intelligence. It provides end-to-end AI agents that manage the entire Quality Engineering lifecycle.

Full-Stack AI Agents: Autonomously plan, author, execute, and analyze tests across the SDLC.
Comprehensive Coverage: Supports web, mobile, and enterprise applications.
Real-World Testing: Scale execution across real devices, browsers, and custom environments.

About the Role

This isn't a role for someone who just wants to "maintain" systems. As a DevOps Engineer at TestMu AI, you are the architect of the automated highways that power our AI agents. You will step into a fast-paced environment where you bridge the gap between cloud-native automation and core infrastructure.

You will manage complex CI/CD pipelines, troubleshoot deep-seated Linux issues, and ensure our hybrid-cloud environment (AWS/Azure) is as resilient as the code it runs.

Key Responsibilities: The Pillars of Growth

A. DevOps & Automation (50% Focus)

Platform Orchestration: Lead the migration to modular, self-healing Terraform and Helm templates.
Agentic CI/CD: Architect GitHub Actions workflows that treat AI agents as first-class citizens, automating environment promotion and risk scoring.
Kubernetes Mastery: Advanced management of Docker and K8s clusters to support scalable production workloads.
Predictive Observability: Use Prometheus, Grafana, and ELK to move from reactive alerts to autonomous anomaly detection.

B. Networking & Data Center Mastery (30% Focus)

Hybrid Networking: Design and troubleshoot VPCs and subnets in Azure/AWS/GCP, paired with physical VLANs and switches in our data centers.
Bare-Metal Lifecycle: Automate hardware provisioning, RAID setup, and firmware updates for our real-device cloud.
Remote Admin: Master out-of-band management (iDRAC, iLO, IPMI) to ensure 100% remote operational capability.
Core Protocols: Own the lifecycle of DNS, DHCP, Load Balancing, and IPAM across distributed environments.

C. Development & Scripting (20% Focus)

Backend Integration: Debug and optimize Python or Go code; understanding how logic interacts with system-level resources.
Advanced Scripting: Write idempotent Bash/Python scripts to automate complex, multi-server operational tasks.
Agentic Tooling: Support the integration of LLM-based developer tools into DevOps workflows to eliminate "toil".

The Interview Journey

We value your ability to solve problems under pressure more than your ability to memorize documentation.

Technical Round 1 (DevOps Leads): A live session focused on real-world debugging scenarios and Linux fundamentals.
Technical Round 2 (Hiring Manager / Pod Lead): An assessment of your architectural thinking, automation strategy, and team alignment.
Technical Round 3 (SVP Engineering / VP DevOps): Strategic discussion on scalability, infrastructure vision, and technical leadership.
Final Round (CEO): Mission alignment, cultural fit, and the "big picture" at TestMu AI.

Growth Timeline

This is a high-visibility role. You will receive direct mentorship from our senior engineering leadership. As you master our production environment, you will have a clear path to move into Senior DevOps Engineer or Infrastructure Architect roles as our pods scale.

Perks That Matter

Health Cover: Comprehensive insurance for you and your family.

Fresh Meals: Daily catered meals at the office.

Transport: Safe cab facilities for eligible shifts.

Pod Budgets: Dedicated engagement budgets for team building and offsites.

DevOps Engineer (Cloud & Infrastructure)

📍 Noida | 🕐 Full-Time | 🧭 Experience: 2–4 years

About TestMu AI

Full-Stack AI Agents: Autonomously plan, author, execute, and analyze tests across the SDLC.
Comprehensive Coverage: Supports web, mobile, and enterprise applications.
Real-World Testing: Scale execution across real devices, browsers, and custom environments.

About the Role

You will manage complex CI/CD pipelines, troubleshoot deep-seated Linux issues, and ensure our hybrid-cloud environment (AWS/Azure) is as resilient as the code it runs.

Key Responsibilities: The Pillars of Growth

A. DevOps & Automation (50% Focus)

Platform Orchestration: Lead the migration to modular, self-healing Terraform and Helm templates.
Agentic CI/CD: Architect GitHub Actions workflows that treat AI agents as first-class citizens, automating environment promotion and risk scoring.
Kubernetes Mastery: Advanced management of Docker and K8s clusters to support scalable production workloads.
Predictive Observability: Use Prometheus, Grafana, and ELK to move from reactive alerts to autonomous anomaly detection.

B. Networking & Data Center Mastery (30% Focus)

Hybrid Networking: Design and troubleshoot VPCs and subnets in Azure/AWS/GCP, paired with physical VLANs and switches in our data centers.
Bare-Metal Lifecycle: Automate hardware provisioning, RAID setup, and firmware updates for our real-device cloud.
Remote Admin: Master out-of-band management (iDRAC, iLO, IPMI) to ensure 100% remote operational capability.
Core Protocols: Own the lifecycle of DNS, DHCP, Load Balancing, and IPAM across distributed environments.

C. Development & Scripting (20% Focus)

Backend Integration: Debug and optimize Python or Go code; understanding how logic interacts with system-level resources.
Advanced Scripting: Write idempotent Bash/Python scripts to automate complex, multi-server operational tasks.
Agentic Tooling: Support the integration of LLM-based developer tools into DevOps workflows to eliminate "toil".

The Interview Journey

We value your ability to solve problems under pressure more than your ability to memorize documentation.

Technical Round 1 (DevOps Leads): A live session focused on real-world debugging scenarios and Linux fundamentals.
Technical Round 2 (Hiring Manager / Pod Lead): An assessment of your architectural thinking, automation strategy, and team alignment.
Technical Round 3 (SVP Engineering / VP DevOps): Strategic discussion on scalability, infrastructure vision, and technical leadership.
Final Round (CEO): Mission alignment, cultural fit, and the "big picture" at TestMu AI.

Growth Timeline

Perks That Matter

Health Cover: Comprehensive insurance for you and your family.

Fresh Meals: Daily catered meals at the office.

Transport: Safe cab facilities for eligible shifts.

Pod Budgets: Dedicated engagement budgets for team building and offsites.

Users love Cutshort

Read about what our users have to say about finding their next opportunity on Cutshort.

Shubham Vishwakarma

Full Stack Developer - Averlon

I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.

Companies hiring on Cutshort

About TestMu AI (Formely LambdaTest)

Founded :

2017

Type :

Product

Size :

100-500

Stage :

Raised funding

About

TestMu AI (formerly LambdaTest) is the world’s first full-stack Agentic AI Quality Engineering Platform.

We built TestMu AI for a reality where software is written by AI and must be shipped at machine speed.

TestMu AI (formerly LambdaTest) is the world’s first full-stack Agentic AI Quality Engineering Platform.

We built TestMu AI for a reality where software is written by AI and must be shipped at machine speed.

Go Programming (Golang)

Python

React.js

Terraform

Kubernetes

Docker

Connect with the team

Chandni Chopra

Connect

Aliya Akhtar

Connect

Gayatri Sood

Connect

Company social profiles

Similar jobs

DevOps Engineer

at Bell Techlogix

Posted by Pemmraju VenkatVandita

Hyderabad

5 - 10 yrs

₹15L - ₹20L / yr

CI/CD

Terraform

MLOps

Machine Learning (ML)

Powershell

The DevOps Engineer will play a critical role in operationalizing artificial intelligence across Bell Techlogix client environments. This role focuses on building and supporting cloud infrastructure, CI/CD pipelines, and automation frameworks that power AI and machine learning workloads. The ideal candidate has experience supporting AI platforms such as Azure AI, Azure Machine Learning, Azure OpenAI, and ServiceNow or conversational AI platforms, and understands the operational requirements of production AI systems, including reliability, scalability, and security.

Key Responsibilities

•Design, build, and operate cloud infrastructure and platform services that support AI and machine learning workloads in production, SLA-driven managed services environments

•Implement CI/CD and MLOps pipelines to enable automated training, testing, deployment, and rollback of AI and ML models

•Develop and maintain Infrastructure as Code to provision AI-ready environments consistently across dev/test/prod

•Support AI platform operations including monitoring model health, pipeline execution, compute utilization, and data dependencies

•Partner with Machine Learning Engineers and Data Engineers to standardize deployment patterns for AI services and LLM-based solutions

•Enable secure and scalable AI integrations using APIs, messaging, and event-driven architectures

•Implement observability solutions for AI platforms, including logging, metrics, alerting, and drift detection integrations

•Troubleshoot AI platform incidents, perform root cause analysis, and implement remediation to improve reliability and automation coverage

•Apply security best practices for AI environments including secrets management, identity and access controls, network isolation, and policy enforcement

•Support AI-driven automation use cases across platforms such as Microsoft Copilot, ServiceNow, and conversational AI tools

•Collaborate with service desk, security, and architecture teams to continuously improve AI service delivery and operational maturity

Required Qualifications

•Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience

•5+ years of experience in DevOps, cloud engineering, or platform operations, with exposure to AI or data workloads

•Hands-on experience with Microsoft Azure, including compute, networking, storage, and monitoring services

•Experience building CI/CD pipelines using Azure DevOps, GitHub Actions, or similar tools

•Working knowledge of Infrastructure as Code (Terraform and/or Bicep/ARM)

•Scripting experience using PowerShell and/or Python

•Experience supporting production platforms with incident management, change control, and root cause analysis

•Understanding of cloud security fundamentals and enterprise governance requirements

Preferred Qualifications

•Experience with Azure Machine Learning, Azure AI Services, Azure OpenAI, or MLOps frameworks

•Exposure to containerization and orchestration technologies (Docker, Kubernetes, AKS)

•Experience supporting data pipelines or feature stores used by machine learning systems

•Familiarity with ServiceNow, AI-driven ITSM workflows, or automation platforms

•Experience with observability tools

•Knowledge of Responsible AI, data governance, and compliance considerations for AI systems

•Relevant certifications (Microsoft Azure Administrator, Azure DevOps Engineer, Azure AI Engineer)