NAS management Jobs in Delhi, NCR and Gurgaon

11+ NAS management Jobs in Delhi, NCR and Gurgaon | NAS management Job openings in Delhi, NCR and Gurgaon

Apply to 11+ NAS management Jobs in Delhi, NCR and Gurgaon on CutShort.io. Explore the latest NAS management Job opportunities across top companies like Google, Amazon & Adobe.

L3 Linux Engineer

It's a Vadodara based IT company

Agency job

via Gendroit SR Solution by Sheeba Abrar

Noida, Vadodara, NCR (Delhi | Gurgaon | Noida)

10 - 15 yrs

₹10L - ₹26L / yr

RHEL

centos

RHEV

Nutanix

VMWare

+10 more

Hands-on experience on server’s OS - RHEL/CentOS/Ubuntu/RHEV
 Problem troubleshooting & Solving skills
 Hands-on Hyper Converged Infrastructure & Virtualization technology Like: VMWare, RHEV And Nutanix.
 Experience in Monitoring tools: Nagios, Icinga etc.
 Knowledge of Backup Technologies like Commvault Etc.
 Hands-on experience on storage Systems i.e. SAN/NAS, Net Backup- Dell EMC
 Knowledge of CIS Security benchmarks.
 Expert on UNIX, Shell, Bash Scripting.

Specialist I - DevOps Engineering

Global Digital Transformation Solutions Provider

Agency job

via Peak Hire Solutions by Dharati Thakkar

Bengaluru (Bangalore), Chennai, Hyderabad, Kochi (Cochin), Noida, Pune, Thiruvananthapuram

7 - 10 yrs

₹21L - ₹30L / yr

Perforce

DevOps

Git

GitHub

Python

+7 more

JOB DETAILS:

* Job Title: Specialist I - DevOps Engineering

* Industry: Global Digital Transformation Solutions Provider

* Salary: Best in Industry

* Experience: 7-10 years

* Location: Bengaluru (Bangalore), Chennai, Hyderabad, Kochi (Cochin), Noida, Pune, Thiruvananthapuram

Job Description

Job Summary:

As a DevOps Engineer focused on Perforce to GitHub migration, you will be responsible for executing seamless and large-scale source control migrations. You must be proficient with GitHub Enterprise and Perforce, possess strong scripting skills (Python/Shell), and have a deep understanding of version control concepts.

The ideal candidate is a self-starter, a problem-solver, and thrives on challenges while ensuring smooth transitions with minimal disruption to development workflows.

Key Responsibilities:

Analyze and prepare Perforce repositories — clean workspaces, merge streams, and remove unnecessary files.
Handle large files efficiently using Git Large File Storage (LFS) for files exceeding GitHub’s 100MB size limit.
Use git-p4 fusion (Python-based tool) to clone and migrate Perforce repositories incrementally, ensuring data integrity.
Define migration scope — determine how much history to migrate and plan the repository structure.
Manage branch renaming and repository organization for optimized post-migration workflows.
Collaborate with development teams to determine migration points and finalize migration strategies.
Troubleshoot issues related to file sizes, Python compatibility, network connectivity, or permissions during migration.

Required Qualifications:

Strong knowledge of Git/GitHub and preferably Perforce (Helix Core) — understanding of differences, workflows, and integrations.
Hands-on experience with P4-Fusion.
Familiarity with cloud platforms (AWS, Azure) and containerization technologies (Docker, Kubernetes).
Proficiency in migration tools such as git-p4 fusion — installation, configuration, and troubleshooting.
Ability to identify and manage large files using Git LFS to meet GitHub repository size limits.
Strong scripting skills in Python and Shell for automating migration and restructuring tasks.
Experience in planning and executing source control migrations — defining scope, branch mapping, history retention, and permission translation.
Familiarity with CI/CD pipeline integration to validate workflows post-migration.
Understanding of source code management (SCM) best practices, including version history and repository organization in GitHub.
Excellent communication and collaboration skills for cross-team coordination and migration planning.
Proven practical experience in repository migration, large file management, and history preservation during Perforce to GitHub transitions.

Skills: Github, Kubernetes, Perforce, Perforce (Helix Core), Devops Tools

Must-Haves

Git/GitHub (advanced), Perforce (Helix Core) (advanced), Python/Shell scripting (strong), P4-Fusion (hands-on experience), Git LFS (proficient)

JOB DETAILS:

* Job Title: Specialist I - DevOps Engineering

* Industry: Global Digital Transformation Solutions Provider

* Salary: Best in Industry

* Experience: 7-10 years

* Location: Bengaluru (Bangalore), Chennai, Hyderabad, Kochi (Cochin), Noida, Pune, Thiruvananthapuram

Job Description

Job Summary:

The ideal candidate is a self-starter, a problem-solver, and thrives on challenges while ensuring smooth transitions with minimal disruption to development workflows.

Key Responsibilities:

Analyze and prepare Perforce repositories — clean workspaces, merge streams, and remove unnecessary files.
Handle large files efficiently using Git Large File Storage (LFS) for files exceeding GitHub’s 100MB size limit.
Use git-p4 fusion (Python-based tool) to clone and migrate Perforce repositories incrementally, ensuring data integrity.
Define migration scope — determine how much history to migrate and plan the repository structure.
Manage branch renaming and repository organization for optimized post-migration workflows.
Collaborate with development teams to determine migration points and finalize migration strategies.
Troubleshoot issues related to file sizes, Python compatibility, network connectivity, or permissions during migration.

Required Qualifications:

Strong knowledge of Git/GitHub and preferably Perforce (Helix Core) — understanding of differences, workflows, and integrations.
Hands-on experience with P4-Fusion.
Familiarity with cloud platforms (AWS, Azure) and containerization technologies (Docker, Kubernetes).
Proficiency in migration tools such as git-p4 fusion — installation, configuration, and troubleshooting.
Ability to identify and manage large files using Git LFS to meet GitHub repository size limits.
Strong scripting skills in Python and Shell for automating migration and restructuring tasks.
Experience in planning and executing source control migrations — defining scope, branch mapping, history retention, and permission translation.
Familiarity with CI/CD pipeline integration to validate workflows post-migration.
Understanding of source code management (SCM) best practices, including version history and repository organization in GitHub.
Excellent communication and collaboration skills for cross-team coordination and migration planning.
Proven practical experience in repository migration, large file management, and history preservation during Perforce to GitHub transitions.

Skills: Github, Kubernetes, Perforce, Perforce (Helix Core), Devops Tools

Must-Haves

Git/GitHub (advanced), Perforce (Helix Core) (advanced), Python/Shell scripting (strong), P4-Fusion (hands-on experience), Git LFS (proficient)

Senior DevOps Engineer

Media and Entertainment Industry

Agency job

via Peak Hire Solutions by Dharati Thakkar

Noida

5 - 7 yrs

₹15L - ₹25L / yr

DevOps

Amazon Web Services (AWS)

CI/CD

Infrastructure

Scripting

+28 more

Required Skills: Advanced AWS Infrastructure Expertise, CI/CD Pipeline Automation, Monitoring, Observability & Incident Management, Security, Networking & Risk Management, Infrastructure as Code & Scripting

Criteria:

5+ years of DevOps/SRE experience in cloud-native, product-based companies (B2C scale preferred)
Strong hands-on AWS expertise across core and advanced services (EC2, ECS/EKS, Lambda, S3, CloudFront, RDS, VPC, IAM, ELB/ALB, Route53)
Proven experience designing high-availability, fault-tolerant cloud architectures for large-scale traffic
Strong experience building & maintaining CI/CD pipelines (Jenkins mandatory; GitHub Actions/GitLab CI a plus)
Prior experience running production-grade microservices deployments and automated rollout strategies (Blue/Green, Canary)
Hands-on experience with monitoring & observability tools (Grafana, Prometheus, ELK, CloudWatch, New Relic, etc.)
Solid hands-on experience with MongoDB in production, including performance tuning, indexing & replication
Strong scripting skills (Bash, Shell, Python) for automation
Hands-on experience with IaC (Terraform, CloudFormation, or Ansible)
Deep understanding of networking fundamentals (VPC, subnets, routing, NAT, security groups)
Strong experience in incident management, root cause analysis & production firefighting

Description

Role Overview

Company is seeking an experienced Senior DevOps Engineer to design, build, and optimize cloud infrastructure on AWS, automate CI/CD pipelines, implement monitoring and security frameworks, and proactively identify scalability challenges. This role requires someone who has hands-on experience running infrastructure at B2C product scale, ideally in media/OTT or high-traffic applications.

Key Responsibilities

1. Cloud Infrastructure — AWS (Primary Focus)

Architect, deploy, and manage scalable infrastructure using AWS services such as EC2, ECS/EKS, Lambda, S3, CloudFront, RDS, ELB/ALB, VPC, IAM, Route53, etc.
Optimize cloud cost, resource utilization, and performance across environments.
Design high-availability, fault-tolerant systems for streaming workloads.

2. CI/CD Automation

Build and maintain CI/CD pipelines using Jenkins, GitHub Actions, or GitLab CI.
Automate deployments for microservices, mobile apps, and backend APIs.
Implement blue/green and canary deployments for seamless production rollouts.

3. Observability & Monitoring

Implement logging, metrics, and alerting using tools like Grafana, Prometheus, ELK, CloudWatch, New Relic, etc.
Perform proactive performance analysis to minimize downtime and bottlenecks.
Set up dashboards for real-time visibility into system health and user traffic spikes.

4. Security, Compliance & Risk Highlighting

• Conduct frequent risk assessments and identify vulnerabilities in:

o Cloud architecture

o Access policies (IAM)

o Secrets & key management

o Data flows & network exposure

• Implement security best practices including VPC isolation, WAF rules, firewall policies, and SSL/TLS management.

5. Scalability & Reliability Engineering

Analyze traffic patterns for OTT-specific load variations (weekends, new releases, peak hours).
Identify scalability gaps and propose solutions across:
o Microservices
o Caching layers
o CDN distribution (CloudFront)
o Database workloads
Perform capacity planning and load testing to ensure readiness for 10x traffic growth.

6. Database & Storage Support

Administer and optimize MongoDB for high-read/low-latency use cases.
Design backup, recovery, and data replication strategies.
Work closely with backend teams to tune query performance and indexing.

7. Automation & Infrastructure as Code

Implement IaC using Terraform, CloudFormation, or Ansible.
Automate repetitive infrastructure tasks to ensure consistency across environments.

Required Skills & Experience

Technical Must-Haves

5+ years of DevOps/SRE experience in cloud-native, product-based companies.
Strong hands-on experience with AWS (core and advanced services).
Expertise in Jenkins CI/CD pipelines.
Solid background working with MongoDB in production environments.
Good understanding of networking: VPCs, subnets, security groups, NAT, routing.
Strong scripting experience (Bash, Python, Shell).
Experience handling risk identification, root cause analysis, and incident management.

Nice to Have

Experience with OTT, video streaming, media, or any content-heavy product environments.
Familiarity with containers (Docker), orchestration (Kubernetes/EKS), and service mesh.
Understanding of CDN, caching, and streaming pipelines.

Personality & Mindset

Strong sense of ownership and urgency—DevOps is mission critical at OTT scale.
Proactive problem solver with ability to think about long-term scalability.
Comfortable working with cross-functional engineering teams.

Why Join company?

• Build and operate infrastructure powering millions of monthly users.

• Opportunity to shape DevOps culture and cloud architecture from the ground up.

• High-impact role in a fast-scaling Indian OTT product.

Criteria:

5+ years of DevOps/SRE experience in cloud-native, product-based companies (B2C scale preferred)
Strong hands-on AWS expertise across core and advanced services (EC2, ECS/EKS, Lambda, S3, CloudFront, RDS, VPC, IAM, ELB/ALB, Route53)
Proven experience designing high-availability, fault-tolerant cloud architectures for large-scale traffic
Strong experience building & maintaining CI/CD pipelines (Jenkins mandatory; GitHub Actions/GitLab CI a plus)
Prior experience running production-grade microservices deployments and automated rollout strategies (Blue/Green, Canary)
Hands-on experience with monitoring & observability tools (Grafana, Prometheus, ELK, CloudWatch, New Relic, etc.)
Solid hands-on experience with MongoDB in production, including performance tuning, indexing & replication
Strong scripting skills (Bash, Shell, Python) for automation
Hands-on experience with IaC (Terraform, CloudFormation, or Ansible)
Deep understanding of networking fundamentals (VPC, subnets, routing, NAT, security groups)
Strong experience in incident management, root cause analysis & production firefighting

Description

Role Overview

Key Responsibilities

1. Cloud Infrastructure — AWS (Primary Focus)

Architect, deploy, and manage scalable infrastructure using AWS services such as EC2, ECS/EKS, Lambda, S3, CloudFront, RDS, ELB/ALB, VPC, IAM, Route53, etc.
Optimize cloud cost, resource utilization, and performance across environments.
Design high-availability, fault-tolerant systems for streaming workloads.

2. CI/CD Automation

Build and maintain CI/CD pipelines using Jenkins, GitHub Actions, or GitLab CI.
Automate deployments for microservices, mobile apps, and backend APIs.
Implement blue/green and canary deployments for seamless production rollouts.

3. Observability & Monitoring

Implement logging, metrics, and alerting using tools like Grafana, Prometheus, ELK, CloudWatch, New Relic, etc.
Perform proactive performance analysis to minimize downtime and bottlenecks.
Set up dashboards for real-time visibility into system health and user traffic spikes.

4. Security, Compliance & Risk Highlighting

• Conduct frequent risk assessments and identify vulnerabilities in:

o Cloud architecture

o Access policies (IAM)

o Secrets & key management

o Data flows & network exposure

• Implement security best practices including VPC isolation, WAF rules, firewall policies, and SSL/TLS management.

5. Scalability & Reliability Engineering

Analyze traffic patterns for OTT-specific load variations (weekends, new releases, peak hours).
Identify scalability gaps and propose solutions across:
o Microservices
o Caching layers
o CDN distribution (CloudFront)
o Database workloads
Perform capacity planning and load testing to ensure readiness for 10x traffic growth.

6. Database & Storage Support

Administer and optimize MongoDB for high-read/low-latency use cases.
Design backup, recovery, and data replication strategies.
Work closely with backend teams to tune query performance and indexing.

7. Automation & Infrastructure as Code

Implement IaC using Terraform, CloudFormation, or Ansible.
Automate repetitive infrastructure tasks to ensure consistency across environments.

Required Skills & Experience

Technical Must-Haves

5+ years of DevOps/SRE experience in cloud-native, product-based companies.
Strong hands-on experience with AWS (core and advanced services).
Expertise in Jenkins CI/CD pipelines.
Solid background working with MongoDB in production environments.
Good understanding of networking: VPCs, subnets, security groups, NAT, routing.
Strong scripting experience (Bash, Python, Shell).
Experience handling risk identification, root cause analysis, and incident management.

Nice to Have

Experience with OTT, video streaming, media, or any content-heavy product environments.
Familiarity with containers (Docker), orchestration (Kubernetes/EKS), and service mesh.
Understanding of CDN, caching, and streaming pipelines.

Personality & Mindset

Strong sense of ownership and urgency—DevOps is mission critical at OTT scale.
Proactive problem solver with ability to think about long-term scalability.
Comfortable working with cross-functional engineering teams.

Why Join company?

• Build and operate infrastructure powering millions of monthly users.

• Opportunity to shape DevOps culture and cloud architecture from the ground up.

• High-impact role in a fast-scaling Indian OTT product.

Senior DevOps Engineer

at Fonada

Posted by Karandeep Singh

Noida

5 - 8 yrs

₹15L - ₹20L / yr

DevOps

Amazon Web Services (AWS)

Microsoft Windows Azure

Google Cloud Platform (GCP)

VMware vSphere

+8 more

About the Role

We are looking for a Senior DevOps Engineer to lead the design, automation, and scaling of our hybrid cloud infrastructure spanning public cloud and private/on-premises environments. You will partner closely with software engineering, security, and product teams to build reliable, secure, and high-performance systems that support rapid product delivery. This is a hands-on role with significant influence over our infrastructure strategy, deployment workflows, and engineering culture.

Key Responsibilities

Architect, deploy, and maintain scalable, highly available infrastructure across both public cloud (AWS, Azure, GCP) and private cloud platforms (OpenStack, VMware vSphere/Tanzu, Nutanix, or similar).
Operate and maintain on-premises infrastructure: hypervisors, compute, storage (Ceph, NetApp, SAN/NAS), networking (SDN, VLANs, BGP, MPLS), and hardware capacity planning, alongside their public cloud equivalents.
Design and own CI/CD pipelines that deploy seamlessly across public and private environments.
Implement and manage Infrastructure as Code (Terraform, Ansible, Pulumi) with strong version control and review practices, using providers for both public and private cloud platforms.
Manage container orchestration (Kubernetes, ECS, OpenShift, Rancher) across managed cloud services and self-managed/bare-metal clusters, including upgrades, autoscaling, and workload reliability.
Build observability into all systems through logging, metrics, tracing, and alerting (Prometheus, Grafana, Datadog, ELK, or similar) with unified visibility across hybrid environments.
Champion security best practices: secrets management, IAM hardening, network segmentation, vulnerability scanning, and compliance (SOC 2, ISO 27001, HIPAA, or data-sovereignty requirements).
Lead incident response, root-cause analysis, and post-mortems; drive long-term reliability improvements and SLO/SLA adherence.
Optimize cost, capacity, and resource utilization across public cloud spend and on-premises hardware without compromising performance or availability.
Partner with data center operations and network providers on hardware provisioning, firmware management, MPLS circuit management, and lifecycle planning.
Mentor junior DevOps and software engineers; promote DevOps culture, automation-first thinking, and shared ownership of production.
Evaluate and introduce new tools, platforms, and processes that improve developer productivity and system reliability.

Required Qualifications

5+ years of experience in DevOps, SRE, or Platform Engineering roles, with at least 2 years at a senior level.
Deep expertise with at least one major public cloud provider (AWS, Azure, or GCP) in production.
Hands-on experience operating private cloud or virtualization platforms (OpenStack, VMware, Nutanix, or equivalent) in production.
Strong experience with virtualization, storage systems, and enterprise networking in on-premises environments.
Strong hands-on experience with Kubernetes in production, including both managed cloud and self-managed/bare-metal clusters.
Proficiency in Infrastructure as Code (Terraform and Ansible strongly preferred).
Solid scripting and programming skills in Python, Go, Bash, or similar.
Experience designing and operating CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or ArgoCD.
Strong Linux systems administration and networking fundamentals (TCP/IP, DNS, load balancing, VPNs, firewalls, routing, MPLS).
Experience with monitoring and observability stacks (Prometheus, Grafana, Datadog, New Relic, ELK, or OpenTelemetry).
Proven track record of leading incident response and improving system reliability.
Excellent communication skills and the ability to collaborate across engineering, security, infrastructure, and product teams.

Preferred Qualifications

Experience designing hybrid and multi-cloud architectures, including secure connectivity (Direct Connect, ExpressRoute, MPLS, VPN, SD-WAN) between public and private environments.
Familiarity with service meshes (Istio, Linkerd), API gateways, and GitOps workflows (ArgoCD, Flux).
Background in security-focused or regulated environments and exposure to compliance frameworks.
Experience with database administration (PostgreSQL, MySQL, Redis, MongoDB) in cloud-managed and self-hosted setups.
Contributions to open-source DevOps or cloud infrastructure tooling.
Relevant certifications (AWS Solutions Architect / DevOps Engineer, Azure Administrator, CKA, CKAD, RHCE, VMware VCP, OpenStack Certified Administrator, HashiCorp Terraform Associate).

About the Role

Key Responsibilities

Architect, deploy, and maintain scalable, highly available infrastructure across both public cloud (AWS, Azure, GCP) and private cloud platforms (OpenStack, VMware vSphere/Tanzu, Nutanix, or similar).
Operate and maintain on-premises infrastructure: hypervisors, compute, storage (Ceph, NetApp, SAN/NAS), networking (SDN, VLANs, BGP, MPLS), and hardware capacity planning, alongside their public cloud equivalents.
Design and own CI/CD pipelines that deploy seamlessly across public and private environments.
Implement and manage Infrastructure as Code (Terraform, Ansible, Pulumi) with strong version control and review practices, using providers for both public and private cloud platforms.
Manage container orchestration (Kubernetes, ECS, OpenShift, Rancher) across managed cloud services and self-managed/bare-metal clusters, including upgrades, autoscaling, and workload reliability.
Build observability into all systems through logging, metrics, tracing, and alerting (Prometheus, Grafana, Datadog, ELK, or similar) with unified visibility across hybrid environments.
Champion security best practices: secrets management, IAM hardening, network segmentation, vulnerability scanning, and compliance (SOC 2, ISO 27001, HIPAA, or data-sovereignty requirements).
Lead incident response, root-cause analysis, and post-mortems; drive long-term reliability improvements and SLO/SLA adherence.
Optimize cost, capacity, and resource utilization across public cloud spend and on-premises hardware without compromising performance or availability.
Partner with data center operations and network providers on hardware provisioning, firmware management, MPLS circuit management, and lifecycle planning.
Mentor junior DevOps and software engineers; promote DevOps culture, automation-first thinking, and shared ownership of production.
Evaluate and introduce new tools, platforms, and processes that improve developer productivity and system reliability.

Required Qualifications

5+ years of experience in DevOps, SRE, or Platform Engineering roles, with at least 2 years at a senior level.
Deep expertise with at least one major public cloud provider (AWS, Azure, or GCP) in production.
Hands-on experience operating private cloud or virtualization platforms (OpenStack, VMware, Nutanix, or equivalent) in production.
Strong experience with virtualization, storage systems, and enterprise networking in on-premises environments.
Strong hands-on experience with Kubernetes in production, including both managed cloud and self-managed/bare-metal clusters.
Proficiency in Infrastructure as Code (Terraform and Ansible strongly preferred).
Solid scripting and programming skills in Python, Go, Bash, or similar.
Experience designing and operating CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or ArgoCD.
Strong Linux systems administration and networking fundamentals (TCP/IP, DNS, load balancing, VPNs, firewalls, routing, MPLS).
Experience with monitoring and observability stacks (Prometheus, Grafana, Datadog, New Relic, ELK, or OpenTelemetry).
Proven track record of leading incident response and improving system reliability.
Excellent communication skills and the ability to collaborate across engineering, security, infrastructure, and product teams.

Preferred Qualifications

Experience designing hybrid and multi-cloud architectures, including secure connectivity (Direct Connect, ExpressRoute, MPLS, VPN, SD-WAN) between public and private environments.
Familiarity with service meshes (Istio, Linkerd), API gateways, and GitOps workflows (ArgoCD, Flux).
Background in security-focused or regulated environments and exposure to compliance frameworks.
Experience with database administration (PostgreSQL, MySQL, Redis, MongoDB) in cloud-managed and self-hosted setups.
Contributions to open-source DevOps or cloud infrastructure tooling.
Relevant certifications (AWS Solutions Architect / DevOps Engineer, Azure Administrator, CKA, CKAD, RHCE, VMware VCP, OpenStack Certified Administrator, HashiCorp Terraform Associate).

DevOps Engineer

at TestMu AI (Formely LambdaTest)

3 recruiters

Posted by Himanshi Tomer

Noida

1 - 5 yrs

₹6L - ₹25L / yr

DevOps

Kubernetes

Docker

Amazon Web Services (AWS)

Windows Azure

+9 more

DevOps Engineer (Cloud & Infrastructure)

📍 Noida | 🕐 Full-Time | 🧭 Experience: 2–4 years

About TestMu AI

TestMu AI (formerly LambdaTest) is an AI-native platform designed to move software testing beyond simple automation into the era of agentic intelligence. It provides end-to-end AI agents that manage the entire Quality Engineering lifecycle.

Full-Stack AI Agents: Autonomously plan, author, execute, and analyze tests across the SDLC.
Comprehensive Coverage: Supports web, mobile, and enterprise applications.
Real-World Testing: Scale execution across real devices, browsers, and custom environments.

About the Role

This isn't a role for someone who just wants to "maintain" systems. As a DevOps Engineer at TestMu AI, you are the architect of the automated highways that power our AI agents. You will step into a fast-paced environment where you bridge the gap between cloud-native automation and core infrastructure.

You will manage complex CI/CD pipelines, troubleshoot deep-seated Linux issues, and ensure our hybrid-cloud environment (AWS/Azure) is as resilient as the code it runs.

Key Responsibilities: The Pillars of Growth

A. DevOps & Automation (50% Focus)

Platform Orchestration: Lead the migration to modular, self-healing Terraform and Helm templates.
Agentic CI/CD: Architect GitHub Actions workflows that treat AI agents as first-class citizens, automating environment promotion and risk scoring.
Kubernetes Mastery: Advanced management of Docker and K8s clusters to support scalable production workloads.
Predictive Observability: Use Prometheus, Grafana, and ELK to move from reactive alerts to autonomous anomaly detection.

B. Networking & Data Center Mastery (30% Focus)

Hybrid Networking: Design and troubleshoot VPCs and subnets in Azure/AWS/GCP, paired with physical VLANs and switches in our data centers.
Bare-Metal Lifecycle: Automate hardware provisioning, RAID setup, and firmware updates for our real-device cloud.
Remote Admin: Master out-of-band management (iDRAC, iLO, IPMI) to ensure 100% remote operational capability.
Core Protocols: Own the lifecycle of DNS, DHCP, Load Balancing, and IPAM across distributed environments.

C. Development & Scripting (20% Focus)

Backend Integration: Debug and optimize Python or Go code; understanding how logic interacts with system-level resources.
Advanced Scripting: Write idempotent Bash/Python scripts to automate complex, multi-server operational tasks.
Agentic Tooling: Support the integration of LLM-based developer tools into DevOps workflows to eliminate "toil".

The Interview Journey

We value your ability to solve problems under pressure more than your ability to memorize documentation.

Technical Round 1 (DevOps Leads): A live session focused on real-world debugging scenarios and Linux fundamentals.
Technical Round 2 (Hiring Manager / Pod Lead): An assessment of your architectural thinking, automation strategy, and team alignment.
Technical Round 3 (SVP Engineering / VP DevOps): Strategic discussion on scalability, infrastructure vision, and technical leadership.
Final Round (CEO): Mission alignment, cultural fit, and the "big picture" at TestMu AI.

Growth Timeline

This is a high-visibility role. You will receive direct mentorship from our senior engineering leadership. As you master our production environment, you will have a clear path to move into Senior DevOps Engineer or Infrastructure Architect roles as our pods scale.

Perks That Matter

Health Cover: Comprehensive insurance for you and your family.

Fresh Meals: Daily catered meals at the office.

Transport: Safe cab facilities for eligible shifts.

Pod Budgets: Dedicated engagement budgets for team building and offsites.

DevOps Engineer (Cloud & Infrastructure)

📍 Noida | 🕐 Full-Time | 🧭 Experience: 2–4 years

About TestMu AI

Full-Stack AI Agents: Autonomously plan, author, execute, and analyze tests across the SDLC.
Comprehensive Coverage: Supports web, mobile, and enterprise applications.
Real-World Testing: Scale execution across real devices, browsers, and custom environments.

About the Role

You will manage complex CI/CD pipelines, troubleshoot deep-seated Linux issues, and ensure our hybrid-cloud environment (AWS/Azure) is as resilient as the code it runs.

Key Responsibilities: The Pillars of Growth

A. DevOps & Automation (50% Focus)

Platform Orchestration: Lead the migration to modular, self-healing Terraform and Helm templates.
Agentic CI/CD: Architect GitHub Actions workflows that treat AI agents as first-class citizens, automating environment promotion and risk scoring.
Kubernetes Mastery: Advanced management of Docker and K8s clusters to support scalable production workloads.
Predictive Observability: Use Prometheus, Grafana, and ELK to move from reactive alerts to autonomous anomaly detection.

B. Networking & Data Center Mastery (30% Focus)

Hybrid Networking: Design and troubleshoot VPCs and subnets in Azure/AWS/GCP, paired with physical VLANs and switches in our data centers.
Bare-Metal Lifecycle: Automate hardware provisioning, RAID setup, and firmware updates for our real-device cloud.
Remote Admin: Master out-of-band management (iDRAC, iLO, IPMI) to ensure 100% remote operational capability.
Core Protocols: Own the lifecycle of DNS, DHCP, Load Balancing, and IPAM across distributed environments.

C. Development & Scripting (20% Focus)

Backend Integration: Debug and optimize Python or Go code; understanding how logic interacts with system-level resources.
Advanced Scripting: Write idempotent Bash/Python scripts to automate complex, multi-server operational tasks.
Agentic Tooling: Support the integration of LLM-based developer tools into DevOps workflows to eliminate "toil".

The Interview Journey

We value your ability to solve problems under pressure more than your ability to memorize documentation.

Technical Round 1 (DevOps Leads): A live session focused on real-world debugging scenarios and Linux fundamentals.
Technical Round 2 (Hiring Manager / Pod Lead): An assessment of your architectural thinking, automation strategy, and team alignment.
Technical Round 3 (SVP Engineering / VP DevOps): Strategic discussion on scalability, infrastructure vision, and technical leadership.
Final Round (CEO): Mission alignment, cultural fit, and the "big picture" at TestMu AI.

Growth Timeline

Perks That Matter

Health Cover: Comprehensive insurance for you and your family.

Fresh Meals: Daily catered meals at the office.

Transport: Safe cab facilities for eligible shifts.

Pod Budgets: Dedicated engagement budgets for team building and offsites.

OpenStack Engineer

at Variyas Labs Pvt. Ltd.

2 candid answers

Posted by Sales Team

Delhi, Gurugram, Noida

4 - 6 yrs

₹15L - ₹24L / yr

OpenStack

Linux/Unix

openshift

Kubernetes

Job Overview:

We are looking for a seasoned OpenStack Administrator with strong expertise in managing large-scale production environments. The ideal candidate should have hands-on experience with Linux, Kubernetes, and OpenShift, and be capable of performing routine maintenance, upgrades, and troubleshooting in complex cloud infrastructures.

The candidate must also be comfortable working with Red Hat support, managing escalations, and communicating effectively with both internal teams and external clients.

Key Skills & Qualifications:

Proven experience managing OpenStack infrastructure in production.
Strong proficiency in Linux system administration (RHEL/CentOS preferred).
Hands-on experience with Kubernetes and OpenShift.
Experience with system monitoring, log management, and troubleshooting tools.
Familiarity with RH support portal, managing cases, and following up on resolutions.
Excellent problem-solving skills and ability to work under pressure.
Strong client communication skills and ability to articulate technical issues clearly.
Proven ability to work in and manage large-scale production environments.

Candidates with OpenStack certification will be preferred.

Job Overview:

The candidate must also be comfortable working with Red Hat support, managing escalations, and communicating effectively with both internal teams and external clients.

Key Skills & Qualifications:

Proven experience managing OpenStack infrastructure in production.
Strong proficiency in Linux system administration (RHEL/CentOS preferred).
Hands-on experience with Kubernetes and OpenShift.
Experience with system monitoring, log management, and troubleshooting tools.
Familiarity with RH support portal, managing cases, and following up on resolutions.
Excellent problem-solving skills and ability to work under pressure.
Strong client communication skills and ability to articulate technical issues clearly.
Proven ability to work in and manage large-scale production environments.

Candidates with OpenStack certification will be preferred.

Lead - DevOps

at Classplus

1 video

4 recruiters

Posted by Peoples Office

Noida

5 - 8 yrs

Best in industry

Docker

Kubernetes

DevOps

Amazon Web Services (AWS)

Google Cloud Platform (GCP)

+11 more

About us

Classplus is India's largest B2B ed-tech start-up, enabling 1 Lac+ educators and content creators to create their digital identity with their own branded apps. Starting in 2018, we have grown more than 10x in the last year, into India's fastest-growing video learning platform.
Over the years, marquee investors like Tiger Global, Surge, GSV Ventures, Blume, Falcon, Capital, RTP Global, and Chimera Ventures have supported our vision. Thanks to our awesome and dedicated team, we achieved a major milestone in March this year when we secured a “Series-D” funding.

Now as we go global, we are super excited to have new folks on board who can take the rocketship higher🚀. Do you think you have what it takes to help us achieve this? Find Out Below!

What will you do?

• Define the overall process, which includes building a team for DevOps activities and ensuring that infrastructure changes are reviewed from an architecture and security perspective

• Create standardized tooling and templates for development teams to create CI/CD pipelines

• Ensure infrastructure is created and maintained using terraform

• Work with various stakeholders to design and implement infrastructure changes to support new feature sets in various product lines.

• Maintain transparency and clear visibility of costs associated with various product verticals, environments and work with stakeholders to plan for optimization and implementation

• Spearhead continuous experimenting and innovating initiatives to optimize the infrastructure in terms of uptime, availability, latency and costs

You should apply, if you

1. Are a seasoned Veteran: Have managed infrastructure at scale running web apps, microservices, and data pipelines using tools and languages like JavaScript(NodeJS), Go, Python, Java, Erlang, Elixir, C++ or Ruby (experience in any one of them is enough)

2. Are a Mr. Perfectionist: You have a strong bias for automation and taking the time to think about the right way to solve a problem versus quick fixes or band-aids.

3. Bring your A-Game: Have hands-on experience and ability to design/implement infrastructure with GCP services like Compute, Database, Storage, Load Balancers, API Gateway, Service Mesh, Firewalls, Message Brokers, Monitoring, Logging and experience in setting up backups, patching and DR planning

4. Are up with the times: Have expertise in one or more cloud platforms (Amazon WebServices or Google Cloud Platform or Microsoft Azure), and have experience in creating and managing infrastructure completely through Terraform kind of tool

5. Have it all on your fingertips: Have experience building CI/CD pipeline using Jenkins, Docker for applications majorly running on Kubernetes. Hands-on experience in managing and troubleshooting applications running on K8s

6. Have nailed the data storage game: Good knowledge of Relational and NoSQL databases (MySQL,Mongo, BigQuery, Cassandra…)

7. Bring that extra zing: Have the ability to program/script is and strong fundamentals in Linux and Networking.

8. Know your toys: Have a good understanding of Microservices architecture, Big Data technologies and experience with highly available distributed systems, scaling data store technologies, and creating multi-tenant and self hosted environments, that’s a plus

Being Part of the Clan

At Classplus, you’re not an “employee” but a part of our “Clan”. So, you can forget about being bound by the clock as long as you’re crushing it workwise😎. Add to that some passionate people working with and around you, and what you get is the perfect work vibe you’ve been looking for!

It doesn’t matter how long your journey has been or your position in the hierarchy (we don’t do Sirs and Ma’ams); you’ll be heard, appreciated, and rewarded. One can say, we have a special place in our hearts for the Doers! ✊🏼❤️

Are you a go-getter with the chops to nail what you do? Then this is the place for you.

Azure DevOps Lead

at Celebal Technologies Pvt Ltd

2 candid answers

Posted by Anjani Upadhyay

Remote, Jaipur, Delhi, Gurugram, Noida, Ghaziabad, Faridabad, Pune

5 - 10 yrs

₹18L - ₹25L / yr

Windows Azure

Docker

Kubernetes

DevOps

Architecture

+7 more

About Us -Celebal Technologies is a premier software services company in the field of Data Science, Big Data and Enterprise Cloud. Celebal Technologies helps you to discover the competitive advantage by employing intelligent data solutions using cutting-edge technology solutions that can bring massive value to your organization. The core offerings are around "Data to Intelligence", wherein we leverage data to extract intelligence and patterns thereby facilitating smarter and quicker decision making for clients. With Celebal Technologies, who understands the core value of modern analytics over the enterprise, we help the business in improving business intelligence and more data-driven in architecting solutions.

Key Responsibilities

• As a part of the DevOps team, you will be responsible for configuration, optimization, documentation, and support of the CI/CD components.

• Creating and managing build and release pipelines with Azure DevOps and Jenkins.

• Assist in planning and reviewing application architecture and design to promote an efficient deployment process.

• Troubleshoot server performance issues & handle the continuous integration system.

• Automate infrastructure provisioning using ARM Templates and Terraform.

• Monitor and Support deployment, Cloud-based and On-premises Infrastructure.

• Diagnose and develop root cause solutions for failures and performance issues in the production environment.

• Deploy and manage Infrastructure for production applications

• Configure security best practices for application and infrastructure

Essential Requirements

• Good hands-on experience with cloud platforms like Azure, AWS & GCP. (Preferably Azure)

• Strong knowledge of CI/CD principles.

• Strong work experience with CI/CD implementation tools like Azure DevOps, Team city, Octopus Deploy, AWS Code Deploy, and Jenkins.

• Experience of writing automation scripts with PowerShell, Bash, Python, etc.

• GitHub, JIRA, Confluence, and Continuous Integration (CI) system.

• Understanding of secure DevOps practices

Good to Have -

• Knowledge of scripting languages such as PowerShell, Bash

• Experience with project management and workflow tools such as Agile, Jira, Scrum/Kanban, etc.

• Experience with Build technologies and cloud services. (Jenkins, TeamCity, Azure DevOps, Bamboo, AWS Code Deploy)

• Strong communication skills and ability to explain protocol and processes with team and management.

• Must be able to handle multiple tasks and adapt to a constantly changing environment.

• Must have a good understanding of SDLC.

• Knowledge of Linux, Windows server, Monitoring tools, and Shell scripting.

• Self-motivated; demonstrating the ability to achieve in technologies with minimal supervision.

• Organized, flexible, and analytical ability to solve problems creatively.