Cutshort logo
Kubernetes jobs

50+ Kubernetes Jobs in India

Apply to 50+ Kubernetes Jobs on CutShort.io. Find your next job, effortlessly. Browse Kubernetes Jobs and apply today!

icon
Unico Connect Private Limited
Mumbai
2 - 4 yrs
Best in industry
DevOps
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Terraform
CI/CD
+8 more

DevOps Engineer

AWS Infrastructure, CI/CD & Production Operations

Mumbai (On-site) | Full-time | 2-4 years


About the role:

Unico Connect is an AI-first technology partner that builds custom mobile, web, and AI products for clients across multiple geographies. We are hiring a DevOps Engineer who will own day-to-day cloud infrastructure, deployment automation, and production operations across active customer engagements.

The mandatory requirement for this role is hands-on production experience on AWS, with infrastructure as code, container orchestration, and CI/CD pipelines owned end to end on at least one live customer workload. The role is hands-on. Expect to operate Kubernetes clusters, build CI/CD pipelines, automate environment provisioning, manage TLS and DNS, set up observability, and partner with backend and AI engineers to ship reliably. A typical week includes a Terraform refactor, a deployment pipeline build for a new service, an incident response on a production cluster, and a cost review.


Responsibilities:

  • AWS infrastructure: Design and operate production infrastructure on AWS using EC2, EKS or ECS, S3, RDS, IAM, VPC, CloudFront, and Route53. Own configuration, networking, and cost.
  • Infrastructure as code: Write and maintain Terraform or Pulumi modules. Drive consistency across environments and tenants through IaC rather than manual configuration.
  • Kubernetes and containers: Operate production EKS clusters. Manage Helm charts, Ingress, autoscaling, secrets, and workload isolation.
  • CI/CD pipelines: Build and maintain pipelines using GitHub Actions, GitLab CI, or equivalent. Include automated tests, security scans, and rollback paths.
  • TLS, DNS, and CDN automation: Automate domain provisioning, TLS issuance (Let's Encrypt, cert-manager, ACM), and CDN configuration (CloudFront, Cloudflare).
  • Observability and incident response: Set up monitoring, logging, and alerting using Prometheus, Grafana, ELK, Loki, or CloudWatch. Lead incident response and write postmortems.
  • Secrets and security: Manage secrets through Vault, AWS Secrets Manager, or KMS. Apply least-privilege IAM and review access regularly.
  • Cost monitoring: Track and optimise AWS spend across environments. Surface waste and propose remediations.

Requirements:

  • Hands-on AWS production experience (mandatory). Must have personally operated production workloads on AWS, with responsibility for IaC, deployments, and incident response on at least one live customer or internal-platform deployment. POCs and lab environments do not qualify.
  • 2 to 4 years of hands-on DevOps or infrastructure experience. Candidates with slightly less experience but strong demonstrated ownership are welcome to apply.
  • AWS depth. Hands-on with EC2, S3, IAM, VPC, EKS or ECS, RDS, CloudFront, and Route53. Working knowledge of CloudWatch and AWS cost tooling.
  • Kubernetes in production. Hands-on operation of EKS or equivalent. Comfort with Helm, Ingress controllers, autoscaling, and resource quotas.
  • Infrastructure as code. Strong with Terraform (preferred) or Pulumi. Modular code, state management, and review discipline.
  • CI/CD pipelines. Production experience with GitHub Actions, GitLab CI, or equivalent. Comfort with multi-environment pipelines and release strategies.
  • Scripting and automation. Strong Bash and Python (or Go) for tooling. Linux fluency at the command line.
  • Observability stack. Hands-on with Prometheus, Grafana, ELK or Loki, and at least one APM tool (Datadog, New Relic, or equivalent).
  • Networking, TLS, and security fundamentals. Comfortable with DNS, TLS certificate lifecycle, VPC peering, and security groups.



Nice to have: multi-tenant SaaS infrastructure experience; service mesh (Istio, Linkerd); GitOps (ArgoCD, Flux); sandboxed execution environments (Firecracker, gVisor); exposure to platform engineering or developer-platform teams.


Read more
A leading data & analytics intelligence technology solutions provider

A leading data & analytics intelligence technology solutions provider

Agency job
via HyrHub by Neha Koshy
Remote only
3 - 7 yrs
₹10L - ₹27L / yr
DevOps
Windows Azure
CI/CD
Terraform
Linux/Unix
+2 more

Contract Job: DevOps / Azure DevOps

Contract Term: Max 3-6 months

Looking For Immediate joiners only

Remote Opportunity

  • CI/CD, deployment, environment management, and release support
  • This can be a shared capability/position as well


Read more
 A Digital Product Engineering company

A Digital Product Engineering company

Agency job
via Unique Occupational by Mantasha Naaz
Gurugram
5.5 - 7.5 yrs
₹28L - ₹32L / yr
skill iconKubernetes
ArgoCD
NewRelic
Crossplane
skill iconAmazon Web Services (AWS)
+2 more

Job Details:

  • Role: Staff Engineer, ArgoCD
  • Experience: 5.5-7.5 Years
  • Employment Type: Full-time
  • Work Mode: Gurugram (Hybrid)

Job Description

REQUIREMENTS:

  • Strong hands-on experience with Kubernetes (K8s) administration, deployment, and troubleshooting
  • Expertise in GitOps implementation using ArgoCD
  • Strong experience with Crossplane for infrastructure provisioning and orchestration
  • Hands-on experience with New Relic for monitoring, observability, and performance management
  • Experience building and maintaining CI/CD pipelines and deployment automation
  • Strong knowledge of Infrastructure as Code (IaC) using Terraform
  • Experience working with AWS cloud services and cloud-native architectures
  • Hands-on experience with Docker and containerization technologies
  • Strong Linux administration and scripting skills
  • Experience implementing platform reliability, security, and automation best practices
  • Strong understanding of monitoring, logging, and observability frameworks

RESPONSIBILITIES:

  • Manage, maintain, and optimize Kubernetes-based infrastructure and application deployments
  • Implement and support GitOps workflows using ArgoCD
  • Design and manage infrastructure provisioning using Crossplane
  • Monitor platform performance, reliability, and user experience using New Relic
  • Build, enhance, and maintain CI/CD pipelines for automated software delivery
  • Collaborate with development and platform engineering teams to deliver scalable cloud-native solutions
  • Implement Infrastructure as Code practices using Terraform and automation tools
  • Ensure platform stability, security, scalability, and operational excellence
  • Troubleshoot infrastructure, deployment, and performance-related issues
  • Drive continuous improvement initiatives across DevOps processes, tooling, and automation practices
  • Support cloud infrastructure management and containerized application environments on AWS
  • Promote DevOps best practices, governance, and operational standards across teams

Qualifications

Bachelor’s or master’s degree in computer science, Information Technology, or a related fields   

Read more
Credilio Financial Technologies Pvt. Ltd.
Munjal Dhamecha
Posted by Munjal Dhamecha
Mumbai
4 - 8 yrs
Best in industry
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Terraform

DevOps Engineer

We are looking for a hands-on DevOps Engineer to manage and scale our cloud infrastructure, Kubernetes-based microservice deployments, monitoring systems, and data engineering infrastructure.

The person will be responsible for building reliable, secure, scalable, and cost-efficient infrastructure using automation-first practices. This role is important for supporting a high-growth B2C platform where availability, deployment velocity, observability, security, and cost efficiency are critical.


Key Responsibilities

  • Manage and automate cloud infrastructure using Terraform.
  • Deploy, manage, and troubleshoot microservices on Kubernetes.
  • Build and maintain CI/CD pipelines to ensure reliable, controlled deployments.
  • Implement safe release practices, including rolling deployments, rollback, and zero-downtime deployments.
  • Manage monitoring, logging, alerting, dashboards, and production runbooks.
  • Support incident response, production debugging, RCA, and preventive action closure.
  • Ensure infrastructure is scalable, secure, highly available, and cost-optimised.
  • Support data engineering infrastructure, including ClickHouse, PeerDB, Airflow, Kafka, and related platform components.
  • Maintain infra-level security controls, backups, disaster recovery, and access governance.

Required Skills

  • Strong experience with Terraform, Infrastructure as Code, and AWS.
  • Strong experience with Kubernetes, Docker, Helm, ingress, and autoscaling.
  • Experience with CI/CD tools such as GitHub Actions, GitLab CI, Jenkins, ArgoCD, or similar.
  • Experience with monitoring and observability tools such as Prometheus, Grafana, ELK/OpenSearch, New Relic, or similar.
  • Good understanding of cloud networking, DNS, load balancers, VPC/VPN, SSL/TLS, firewalls, and WAF.
  • Experience with Linux administration, shell scripting, and automation.
  • Understanding of cloud security, IAM, secrets management, and access governance.
  • Exposure to databases, queues, caches, and data infrastructure tools such as ClickHouse, PeerDB, Airflow, Kafka, or similar.
  • Strong debugging and problem-solving skills during production incidents.
  • Ability to work closely with engineering teams to improve deployment, monitoring, cost, and reliability. 
Read more
WITS Innovation Lab
Prabhnoor Kaur
Posted by Prabhnoor Kaur
Mohali
3 - 7 yrs
₹4L - ₹13L / yr
skill iconAmazon Web Services (AWS)
Azure
Google Cloud Platform (GCP)
skill iconKubernetes
skill iconJenkins
+9 more

Job Description:

Job Description: DevOps Engineer

Experience-3+yrs

As a DevOps Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure that supports our platform and solutions. You will work closely and collaborate with various development and operations teams to ensure seamless integration and deployment of our solutions.

Responsibilities:

  • Design, implement, and manage scalable and reliable IoT infrastructure on Raspberry Pi OS, various backend services using Java and AI stack Automate deployment, monitoring, and management of IoT applications.
  • Collaborate with development teams to ensure continuous integration and continuous deployment (CI/CD) pipelines are efficient and effective. Manage various AWS services such as EC2, S3, Redis, RDS, Lambda, and VPC, Cognito etc.,
  • Manage various Cloud infrastructure using both Kubernetes(k8) and AWS stacks for both lower and production environments Implement security best practices to protect both platform and IoT data and infrastructures.
  • Develop and maintain infrastructure as code (IaC) using tools like Terraform, Ansible, or CloudFormation. Troubleshoot and resolve infrastructure-related issues along with operations and development teams
  • Monitor system performance, identify issues, and implement solutions to ensure high availability and reliability. Plan and provision on-demand and stable various Cloud infrastructures in AWS and other Clouds with guided planning, scoping, costing and visibility to whole platform infrastructures with cost optimizations and utilization efficiency
  • Stay up-to-date with the latest industry trends and technologies DevOps Familiarity with Dev SecOps & ML-Ops, Git-Ops practices

Requirements:

  • Bachelor's degree in Computer Science, Engineering, or a related field. Strong knowledge of cloud platforms (AWS, Azure, GCP) and containerization technologies using Docker, Kubernetes etc.,
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or Circle CI. Proficiency in scripting languages (Python, Bash, Shell etc.).
  • Good understanding and hands-on with Networking, security, and system administration in AWS environment will be ideal Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills.


Read more
NeoGenCode Technologies Pvt Ltd
Akshay Patil
Posted by Akshay Patil
Noida, Bengaluru (Bangalore), Pune, Hyderabad, Chennai
6 - 8 yrs
₹6L - ₹12L / yr
Data engineering
databricks
Snow flake schema
skill iconPython
Apache Spark
+8 more

Job Title : Data Engineer – Databricks

Experience : 6+ Years

Location : Noida / Hyderabad / Chennai / Pune / Bengaluru (Hybrid)

Shift : IST (Normal Shift)


Job Summary :

We are seeking an experienced Data Engineer with strong expertise in Databricks, Snowflake, Python, and Spark to build and optimize scalable data pipelines and support AI/ML model deployments. The ideal candidate should have experience working with cloud-based data platforms and preferably possess exposure to the Healthcare domain.


Required Skills :

  • Databricks (Preferred)
  • Snowflake
  • Python
  • Apache Spark
  • SQL
  • Azure Cloud
  • Kubernetes
  • Apache Airflow
  • GitHub & CI/CD Pipelines
  • AI/ML Model Deployment
  • Data Analytics

Preferred :

  • Experience in the Healthcare domain.
  • Strong understanding of scalable data engineering architectures and best practices.
Read more
Searce Inc
Mumbai
4 - 8 yrs
Best in industry
Google Cloud Platform (GCP)
Terraform
skill iconKubernetes
GKE
Site reliability
+3 more

Senior Cloud Security & Reliability Engineer (GCP)

Searce Inc | Mumbai, Maharashtra | On-site | 4–7 Years

About Searce

Searce is a global, AI-native, engineering-led technology consultancy and a Premier Google Cloud Partner — recognized as the Google Cloud Workplace AI Transformation Partner of the Year, APAC (2026). With 20+ years of experience and 3,000+ clients across 10+ countries, we help businesses stay ahead of the cloud curve. The Role We're looking for a Senior Cloud Security & Reliability Engineer with deep GCP expertise to join our Mumbai MSP team. You'll own reliability, security, and optimization of enterprise client GCP environments — 24/7.


What You'll Do

Own Reliability — Lead 24x7 GCP cloud operations and incident management. Define and enforce SLOs.

Engineer the Blueprint — Build scalable, secure GCP architectures and maintain IaC modules and playbooks.

Automate Everything — Embed AI-driven automation and AIOps to eliminate toil and preempt incidents.

Drive FinOps — Own GCP cost optimization for clients with quantified impact.

Be the Expert — Represent deep GCP expertise in client conversations and documentation.

Mentor & Elevate — Coach junior engineers through code reviews and problem-solving.


What We're Looking For

Experience

4–7 years total with 4+ years on GCP cloud infrastructure Background in Cloud Managed Services / MSP environments

2+ years in a client-facing technical role


Technical Skills (Must-Have)

GCP: GKE, IAM, VPC, Cloud Monitoring, Stackdriver — in work experience

Kubernetes: GKE — demonstrated in production

IaC: Terraform — module-level, demonstrated in work experience Observability: Prometheus + Grafana minimum

Security: GCP IAM, VPC controls, Security Command Center

Scripting: Python


Nice to Have

GCP Professional Cloud Architect / Pro DevOps Engineer certification BFSI or enterprise domain experience Thanos, Vault, Istio, ArgoCD FinOps Foundation Certification


Why Searce?

🏆 Google Cloud Partner of the Year — APAC 2026

🌍 Enterprise clients across US, APAC, and India

🤖 AI-first, engineering-led culture

📈 Fast-growing MSP team with real career ownership

🤝 HAPPIER values — Humble, Adaptable, Positive, Passionate, Innovative, Excellence, Responsible

📧 Interested? Share your profile and let's connect.

Read more
CLOUDSUFI

at CLOUDSUFI

3 recruiters
Ayushi Dwivedi
Posted by Ayushi Dwivedi
Noida
3 - 6 yrs
₹15L - ₹35L / yr
Maven
DevOps
Google Cloud Platform (GCP)
skill iconKubernetes
skill iconDocker
+2 more

About Us

CLOUDSUFI, a Google Cloud Premier Partner, is a global leading provider of data-driven digital transformation across cloud-based enterprises. With a global presence and focus on Software & Platforms, Life sciences and Healthcare, Retail, CPG, financial services and supply chain, CLOUDSUFI is positioned to meet customers where they are in their data monetization journey.


Hybrid - 2 days in a week from Noida office


Key Responsibilities

  • Design, develop, and maintain robust and scalable big data solutions
  • Implement and manage CI/CD pipelines to automate the build, test, and deployment of data applications.
  • Write high-quality, maintainable, and efficient code in Java.
  • Create and manage build configurations using tools like Maven, Gradle, or Ant.
  • Utilize Git for version control and to manage code repositories.
  • Develop automation scripts using Bash and Python.
  • Perform in-depth logging and debugging to identify and resolve issues in complex data systems.
  • Develop and execute comprehensive test scripts to ensure the quality and reliability of data pipelines.

Requirement:

  • Cloud Computing: Practical knowledge of GCP services, including VMs, GCS, and Dataproc or similar cloud data services.
  • Programming: Proven proficiency in Java development.
  • Build Tools: Solid experience with build automation tools like Maven, Gradle, or Ant.
  • Version Control: Proficient in using Git.
  • Big Data: Strong hands-on experience with the Hadoop ecosystem and Apache Spark.
  • IDE: Experience with IntelliJ IDEA or similar development environments.
  • CI/CD: A strong understanding of how CI/CD pipelines work and experience with relevant tools (e.g., Jenkins, GitLab CI).
  • Scripting: Proficiency in Bash and Python.
  • Operating Systems: In-depth knowledge of Linux distributions (Debian, Ubuntu, Rocky).
  • Core Competencies:
  • Excellent logging and debugging skills.
  • Experience in writing test scripts and a commitment to software quality.
  • Familiarity with the process of CVE resolutions.

Behavioural competencies required:

  • Must have worked with US/Europe based clients in onsite/offshore delivery model
  • Should have very good verbal and written communication, technical articulation, listening and presentation skills
  • Should have proven analytical and problem solving skills
  • Should have demonstrated effective task prioritization, time management and internal/external stakeholder management skills
  • Should be a quick learner and team player
  • Should have experience of working under stringent deadlines in a Matrix organization structure
  • Should have demonstrated appreciable Organizational Citizenship Behavior (OCB) in past organizations


Read more
AbleCredit

at AbleCredit

2 candid answers
Arpita Das
Posted by Arpita Das
Bengaluru (Bangalore)
5 - 15 yrs
₹20L - ₹40L / yr
Information security
IT security
Security architecture
Cloud Security
Risk Management
+14 more

About the Role

We are looking for a senior Information Security leader who can operate at two levels simultaneously:

  • Drive the company’s security architecture and governance internally.
  • Serving as the primary security representative to banks, NBFCs, insurers and other enterprise customers.


This is a mix of client-facing + security expert role for someone comfortable discussing cloud security, application security, audits, risk management and regulatory requirements with CISOs, security teams, auditors and executive stakeholders.


What You'll Own?

• Act as the executive face of security during customer audits, InfoSec reviews, RFPs and procurement processes

• Build trusted relationships with CISOs, security leaders and risk teams across BFSI organizations

• Own security responses for customer questionnaires, control discussions and architecture reviews

• Drive security strategy, risk management and security governance across the company

• Establish and continuously improve internal audit, compliance and evidence-collection processes

• Own security readiness for ISO 27001, SOC 2, RBI, DPDP and related regulatory requirements

• Guide cloud, infrastructure, identity, access control, secrets management and deployment security practices

• Partner with engineering teams to embed security into product development and operations

• Lead incident response, vulnerability management and security improvement initiatives


Ideal Background

• Security Engineering, DevSecOps, Cloud Security, Security Architecture or CISO/Deputy CISO experience

• Experience working with banks, NBFCs, insurers, fintechs or BFSI-focused SaaS companies

• Strong understanding of AWS, Kubernetes, IAM, CI/CD and modern cloud security practices

• Experience handling customer-facing security reviews, audits and compliance programs

• Familiarity with ISO 27001, SOC 2, RBI guidelines, DPDP and related security frameworks

• Ability to translate technical controls into business language for customers and executives


Why This Role Is Different?

Rather than being a back-office compliance function, this role sits at the intersection of security, customer trust and business growth. You will work directly with the founders, influence product and infrastructure decisions, and play a key role in helping the company win and expand enterprise BFSI accounts.

Read more
NeoGenCode Technologies Pvt Ltd
Akshay Patil
Posted by Akshay Patil
Remote only
8 - 12 yrs
₹8L - ₹12L / yr
skill iconC#
skill icon.NET
Microsoft Windows Azure
Azure Functions
Azure Kubernetes Service (AKS)
+8 more

Job Title : API Lead

Experience : 8+ Years

Location : Remote (Laptop pickup required for 2 days from Noida, Hyderabad, Chennai, Pune, or Bengaluru)

Shift : IST (Regular Day Shift)


Role Overview :

We are looking for an experienced API Lead with strong expertise in C#.NET, Azure Cloud, Azure Functions, and Azure Kubernetes Service (AKS) to design, develop, and lead scalable, secure, and high-performance APIs and microservices for cloud-native applications.


Mandatory Skills :

C#, .NET, .NET Core/.NET 6+, Azure Cloud Services, Azure Functions, Azure Kubernetes Service (AKS), REST API Development, Docker, Kubernetes, Azure SQL/Cosmos DB, and CI/CD using Azure DevOps or GitHub Actions.


Mandatory Skills :

  • C#.NET (.NET Core/.NET 6+)
  • Azure Cloud Services
  • Azure Functions
  • Azure Kubernetes Service (AKS)
  • REST API Development
  • Docker and Kubernetes
  • Azure SQL and/or Cosmos DB
  • CI/CD (Azure DevOps or GitHub Actions)


Key Responsibilities :

  • Design and develop RESTful APIs and microservices using C# and .NET Core/.NET 6+.
  • Build and integrate solutions using Azure services such as Azure Functions, APIM, App Services, and Storage.
  • Deploy and manage containerized applications using Docker and AKS.
  • Implement API security using OAuth2, JWT, and Managed Identities.
  • Optimize API performance and collaborate with Architecture, DevOps, and QA teams.


Required Technical Skills :

Must-Have Skills :

  • Strong expertise in C#, .NET Core, and .NET 6+.
  • Hands-on experience with Azure Functions and Azure Kubernetes Service (AKS).
  • Strong understanding of REST API architecture and best practices.
  • Experience with Docker and Kubernetes concepts.
  • Knowledge of Azure API Management (APIM) and Azure PaaS services.
  • Experience with Azure SQL Database and Cosmos DB.
  • Familiarity with CI/CD pipelines using Azure DevOps and/or GitHub Actions.
  • Strong understanding of API authentication and security mechanisms (OAuth2, JWT, Managed Identities).
  • Experience with monitoring, troubleshooting, and performance tuning of APIs.

Good to Have :

  • Experience with microservices architecture and distributed systems.
  • Exposure to Infrastructure as Code (Terraform/Bicep).
  • Experience with Application Insights and Azure Monitor.
  • Knowledge of Agile/Scrum methodologies.
Read more
Improving
Vishakha Deshmukh
Posted by Vishakha Deshmukh
Bengaluru (Bangalore), Mumbai
4 - 10 yrs
₹15L - ₹25L / yr
WAF
OWASP
skill iconKubernetes
CI/CD
ArgoCD
+1 more

We are seeking an experienced SRE Security Engineer to design, secure, and operate cloud-native platforms running on Kubernetes (K8s). The ideal candidate will have strong expertise in Helm, ArgoCD/GitOps, and Kubernetes security best practices.

Key Responsibilities

  • Manage and secure Kubernetes-based infrastructure and workloads.
  • Implement and maintain GitOps workflows using ArgoCD and CI/CD pipelines.
  • Deploy and manage applications using Helm.
  • Monitor, detect, and respond to security threats using Wazuh and Falco.
  • Perform container and image vulnerability scanning using Trivy.
  • Design, implement, and maintain WAF controls aligned with OWASP security standards.
  • Collaborate with platform, DevOps, and development teams to embed security into the software delivery lifecycle.

Required Skills

  • Strong hands-on experience with Kubernetes (K8s).
  • Expertise in Helm, ArgoCD, and GitOps/CI-CD practices.
  • Experience with Wazuh, Falco, and Trivy.
  • Good understanding of Web Application Firewalls (WAF) and OWASP Top 10.
  • Knowledge of container, cloud-native, and application security best practices.

Preferred: Experience securing large-scale Kubernetes environments and implementing DevSecOps practices.

Read more
Impacto Digifin Technologies

at Impacto Digifin Technologies

4 candid answers
1 recruiter
Navitha Reddy
Posted by Navitha Reddy
Bengaluru (Bangalore)
3 - 5 yrs
₹7L - ₹10L / yr
css3
skill iconJavascript
skill iconReact.js
TypeScript
skill iconNextJs (Next.js)
+15 more

Job Description

Team Lead – Full Stack Developer

Experience

3–5 Years

Location

Bangalore (Onsite)

Role Summary

We are looking for a highly skilled, hands-on, and ownership-driven Team Lead – Full Stack Developer who can independently architect, design, develop, deploy, and support enterprise-grade web, mobile, and cloud-native applications.

The ideal candidate should have strong expertise in ReactJS, Next.js, Java, Spring Boot, Microservices, Distributed Systems, Cloud Technologies, DevOps, AI-Assisted Development, and Product Engineering.

The candidate will be responsible for owning end-to-end product development, including frontend architecture, backend services, APIs, databases, deployment, production support, stakeholder communication, sprint planning, team leadership, and successful delivery of scalable, secure, and high-performance solutions.

Key Responsibilities

Architecture & Product Ownership

  • Design end-to-end application architecture, HLDs, LLDs, deployment architecture, integration architecture, and security architecture.
  • Build products from scratch and independently drive projects from concept to production.
  • Convert business requirements into technical solutions, user stories, sprint plans, and delivery roadmaps.
  • Design scalable, secure, fault-tolerant, multi-tenant, and enterprise-grade applications.
  • Evaluate and recommend suitable technologies, frameworks, and development approaches.

Full Stack Development

Frontend Development

  • Design and develop responsive and modern web applications.
  • Build reusable UI components and scalable frontend architectures.
  • Develop Single Page Applications (SPA) using ReactJS and Next.js.
  • Implement state management using Redux Toolkit and Context API.
  • Integrate REST APIs, GraphQL APIs, and WebSocket-based applications.
  • Ensure application accessibility, responsiveness, performance optimization, and SEO best practices.
  • Collaborate closely with UI/UX designers to deliver intuitive user experiences.

Backend Development

  • Develop enterprise-grade backend applications using Java and Spring Boot.
  • Design and develop REST APIs, Microservices, SOAP Services, gRPC Services, and Webhooks.
  • Build reusable frameworks, service layers, and integration modules.
  • Implement authentication, authorization, API security, and enterprise integrations.
  • Optimize application performance, scalability, reliability, and maintainability.

Distributed Systems & Enterprise Solutions

  • Design distributed systems using Event-Driven Architecture, CQRS, Saga Pattern, DDD, Hexagonal Architecture, and Clean Architecture.
  • Implement resilience patterns, service discovery, circuit breakers, distributed transactions, and failover mechanisms.
  • Build workflow automation platforms, document management systems, enterprise integration solutions, and business process automation systems.

Team Leadership & Delivery

  • Lead and mentor development teams.
  • Conduct code reviews, architecture reviews, and technical discussions.
  • Drive sprint planning, stand-ups, reviews, and retrospectives.
  • Collaborate with Product, Business, QA, DevOps, AI, and Management teams.
  • Ensure successful project delivery while maintaining engineering standards and best practices.
  • Support hiring, onboarding, mentoring, and technical growth of team members.

DevOps, Deployment & Production Support

  • Manage CI/CD pipelines and deployment processes.
  • Work with Docker, Kubernetes, and cloud-native environments.
  • Implement Blue-Green, Canary, Rolling, and Zero-Downtime deployment strategies.
  • Monitor application health and troubleshoot production issues.
  • Ensure application reliability, security, scalability, and operational excellence.

AI & Innovation

  • Leverage AI tools to improve development productivity and software quality.
  • Build AI-powered Proof of Concepts (POCs) and enterprise applications.
  • Utilize AI for development, testing, documentation, architecture reviews, and automation.
  • Collaborate with AI teams to integrate intelligent capabilities into business applications.

Mandatory Technical Skills

Frontend Technologies

  • ReactJS
  • Next.js
  • JavaScript (ES6+)
  • TypeScript
  • HTML5
  • CSS3
  • Tailwind CSS
  • Material UI
  • Bootstrap
  • Redux Toolkit
  • Context API
  • React Query
  • Responsive Design
  • Progressive Web Applications (PWA)

Backend Technologies

  • Java 8/11/17/21
  • Core Java
  • Spring Boot
  • Spring MVC
  • Spring Security
  • Spring Data JPA
  • Spring Cloud
  • Spring Batch
  • Spring WebFlux
  • Spring Integration
  • Spring AOP
  • Spring Validation

APIs & Enterprise Integrations

  • REST APIs
  • SOAP Services
  • GraphQL
  • gRPC
  • Webhooks
  • OpenAPI / Swagger
  • OAuth2
  • JWT
  • OpenID Connect
  • SAML

Microservices & Distributed Systems

  • Microservices
  • Event-Driven Architecture
  • CQRS
  • Saga Pattern
  • Domain-Driven Design (DDD)
  • Event Sourcing
  • Hexagonal Architecture
  • Clean Architecture
  • Service Discovery
  • Circuit Breakers

Messaging & Streaming

  • Apache Kafka
  • RabbitMQ
  • Event Streaming
  • Message Brokers
  • Asynchronous Processing

Databases

  • PostgreSQL
  • Oracle Database
  • Oracle FLEXCUBE
  • MySQL
  • SQL Server
  • MongoDB
  • Redis
  • Database Design
  • Query Optimization
  • Indexing
  • Migration
  • Replication

DevOps & Cloud

  • Docker
  • Kubernetes
  • Jenkins
  • GitHub Actions
  • GitLab CI/CD
  • AWS
  • Azure
  • GCP

Monitoring & Quality

  • Grafana
  • Prometheus
  • ELK Stack
  • OpenTelemetry
  • JUnit
  • Mockito
  • SonarQube
  • Unit Testing
  • Integration Testing
  • API Testing
  • Performance Testing
  • Security Testing

AI-Assisted Development Tools

  • ChatGPT
  • Claude
  • Cursor
  • GitHub Copilot
  • Replit
  • Lovable
  • IntelliJ IDEA
  • Prompt Engineering
  • AI-Based Code Review
  • AI-Based Testing & Documentation

Preferred Domain Experience

  • Banking & Financial Services
  • FinTech
  • Enterprise SaaS
  • Workflow Automation
  • Digital Transformation
  • AI-Powered Applications
  • OCR & Document Processing
  • Payment Systems
  • Enterprise Integration Platforms
  • Identity & Access Management
  • Multi-Tenant Platforms

Soft Skills

  • Strong leadership and ownership mindset.
  • Excellent communication and stakeholder management skills.
  • Strong analytical and problem-solving abilities.
  • Ability to manage multiple projects simultaneously.
  • Strong mentoring and team collaboration skills.
  • Innovation-driven and continuous learning attitude.

Education

Bachelor's Degree or Master's Degree in:

  • Computer Science
  • Information Technology
  • Software Engineering
  • Electronics & Communication
  • Or equivalent practical experience

Preferred Candidate Profile

A highly motivated Full Stack Engineering Leader with experience building products from scratch, owning frontend and backend architecture, leading engineering teams, driving innovation, and delivering scalable enterprise-grade applications across web, mobile, cloud, and enterprise platforms. The candidate should be capable of independently managing architecture, development, deployment, production support, and successful project delivery while mentoring and growing engineering teams.

mandatory one w

Read more
Improving
Leena Lahari
Posted by Leena Lahari
Bengaluru (Bangalore)
4 - 8 yrs
₹12L - ₹30L / yr
skill iconKubernetes
Google Cloud Platform (GCP)
Terraform
helm
ArgoCD
+4 more

Site Reliability Engineer

Experience - 4 - 8 Years

Location - Bangalore (Hybrid) 


We are seeking a highly skilled Site Reliability Engineer (SRE) to design, build, and operate scalable, reliable, and secure cloud-native platforms. The ideal candidate will have strong experience with Kubernetes ecosystems, cloud infrastructure, automation, observability, and GitOps practices.


Key Responsibilities

  • Manage and optimize Kubernetes-based platforms, including Cilium, Istio, Ingress Controllers, and related ecosystem components.
  • Design, deploy, and maintain infrastructure on Google Cloud Platform (GCP).
  • Automate infrastructure provisioning and lifecycle management using Terraform.
  • Implement and manage GitOps workflows using ArgoCD and GitLab.
  • Deploy and maintain Helm charts for Kubernetes applications.
  • Manage secrets, service discovery, and distributed systems using Vault and Consul.
  • Build and maintain monitoring, logging, and observability platforms using Prometheus Operator and the Grafana Stack (Grafana, Mimir, Loki, Alloy, Tempo, and Pyroscope).
  • Collaborate with development teams to improve platform reliability, performance, scalability, and operational excellence.
  • Develop CI/CD pipelines and automation to support modern cloud-native deployments.


Required Skills

  • Strong hands-on experience with Kubernetes (K8s) and cloud-native technologies.
  • Experience with GCP, Terraform, Helm, and ArgoCD.
  • Knowledge of Service Mesh technologies, particularly Istio and Cilium.
  • Experience with Vault, Consul, and infrastructure security best practices.
  • Strong expertise in observability tools including Prometheus and the Grafana ecosystem.
  • Proficiency with GitOps, GitLab, CI/CD pipelines, and automation.
  • Good understanding of Linux systems, networking, and troubleshooting in distributed environments.


Preferred Qualifications

  • Experience operating large-scale production environments.
  • Knowledge of SRE principles, incident management, capacity planning, and reliability engineering.
  • Relevant cloud-native certifications (CKA, GCP, Terraform, etc.) are a plus.


Read more
Searce Inc

at Searce Inc

3 recruiters
Karthika Senthilkumar
Posted by Karthika Senthilkumar
Pune
7 - 12 yrs
Best in industry
Google Cloud Platform (GCP)
Terraform
skill iconKubernetes
GKE
Reliability engineering
+1 more

About Searce

Searce is a global, AI-native, engineering-led technology consultancy and a Premier Google

Cloud Partner — recognized as the Google Cloud Workplace AI Transformation Partner of the

Year, APAC (2026). With 20+ years of experience and 3,000+ clients across 10+ countries, we

help businesses stay ahead of the cloud curve.


The Role

We're looking for a Lead Cloud Security & Reliability Engineer with deep GCP expertise to own

end-to-end cloud reliability and security forAPAC enterprise clients. As Lead, you'll set the architectural direction, mentor your squad, and drive measurable client outcomes across multi-

cloud environments.


What You'll Do

Own Client Delivery — Lead 24x7 GCP cloud operations forAPAC clients. Define SLO frameworks and ensure adherence.


Architect Solutions — Design scalable, secure GCP-primary architectures with multi-cloud awareness.


Drive Reliability — Lead incident response, RCA, and long-term remediation across production systems.


Mentor & Elevate — Coach and grow a squad of Senior CSREs.


Drive FinOps — Own cloud cost governance and optimization with quantified impact.


Be the Expert — Represent Searce's technical depth in global client conversations.


What We're Looking For

Experience

7–12 years total with 5+ years on GCP cloud infrastructure

Strong background in Cloud Managed Services / MSP environments

Proven experience leading a team in client-facing delivery

Multi-cloud exposure (AWS/Azure secondary) preferred


Technical Skills (Must-Have)

  • GCP: GKE, IAM, VPC, Cloud Monitoring, Stackdriver, KMS — demonstrated in work
  • experience
  • Kubernetes: GKE — production cluster management, Helm
  • IaC: Terraform — module-level, reusable frameworks
  • Observability: Prometheus, Grafana, Thanos or equivalent
  • Security: IAM, Zero-trust, DevSecOps, CSPM tools
  • Scripting: Python or Go
  • FinOps: GCP cost governance demonstrated


Nice to Have

  • GCP Professional Cloud Architect / Pro DevOps Engineer certification
  • AWS / Azure secondary experience
  • CKA (Certified Kubernetes Administrator)
  • ITIL / change management awareness
  • APAC client delivery experience


Why Searce?

🏆 Google Cloud Partner of the Year — APAC 2026

🌍 Work with APAC enterprise clients across multiple industries

🤖 AI-first, engineering-led culture

📈 Lead-level ownership with real career growth

🤝 HAPPIER values — Humble, Adaptable, Positive, Passionate, Innovative, Excellence,

Responsible

Read more
P99soft
Uma Sahithya Parimi
Posted by Uma Sahithya Parimi
Hyderabad
4 - 5 yrs
₹10L - ₹18L / yr
Reliability engineering
AWS CloudFormation
DevOps
RCA
skill iconAmazon Web Services (AWS)
+11 more

Job Title: Site Reliability Engineer

Experience: 4 - 6 Years

Location: Hyderabad (Hybrid)


Job Summary

P99soft is hiring a Site Reliability / Cloud Engineer to manage and optimize scalable cloud infrastructure and microservices environments. The role involves implementing automation, monitoring systems, and ensuring high availability and performance across AWS/GCP platforms. You will work closely with cross-functional teams to handle deployments, incident management, and continuous improvements. Strong expertise in DevOps tools, containerization, and infrastructure as code is essential.


RESPONSIBILITIES:

* Implement, Own, maintain, monitor & support the backend servers & micro-services infrastructure for the studio titles which runs on wide-variety of tech stack

* Implement/maintain various automation tools for development, testing, operations and IT infrastructure

* Be available for on-call duty during production outages in 24/7 PAGERDUTY support

* Work very closely with all the disciplines/stakeholders and keep them communicated on all impacted aspects

* Defining and setting development, test, release, update, and support processes for the SRE operations

* Excellent troubleshooting skills in areas of systems Infrastructure engineering

* Monitoring the processes during the entire lifecycle for its adherence and updating or creating new processes for improvement and minimising the workflow times

* Encouraging and building automated processes wherever possible

* Identifying and deploying cybersecurity measures by continuously performing vulnerability assessment and risk management

* Incidence management and root cause analysis

* Monitoring and measuring customer experience and KPIs



Key Skills : -

Containerisation & Orchestration : Docker, Kubernetes, Rancher, EKS, ECS, GKE, Elastic Beanstalk, Google App Engine

Cloud Platform: AWS, GCP

IaaC: Terraform, AWS-CloudFormation / GCP-CloudDeploymentManager, Ansible

Infra Monitoring : Prometheus, Datadog, Alert Manager, Thanos, AWS Cloudwatch

CI/CD : GITLAB CI-CD, Jenkins

Scripting : Python, Golang

VCS : GITLAB, Perforce, Subversion

OS : UBUNTU, CENTOS, Amazon LINUX, Redhat Linux


Nice to Have: -

1. Experience with supporting build workloads running on On-Prem VMWare based VMs.

2. Experience supporting Mobile App/Mobile Gaming build workloads that involves development platforms like UNITY, UNREAL, Xcode, Android Studio & Fastlane


Read more
DotPe

at DotPe

2 recruiters
Pawan Shukla
Posted by Pawan Shukla
Gurugram
2 - 5 yrs
₹12L - ₹20L / yr
skill iconAmazon Web Services (AWS)
skill iconKubernetes
skill iconMongoDB
Scripting

Role Overview

We are looking for a DevOps Engineer with 2 – 5 years of experience working on AWS-based production systems. This role strictly requires hands-on experience managing production databases (MongoDB / SQL).

👉 Please apply only if you have hands-on experience managing databases in a production environment.


Key Responsibilities:

  • Build and maintain AWS infrastructure with focus on availability, security, and cost.
  • Deploy and operate Kubernetes (EKS) clusters and containerized applications in production.
  • Manage and scale production databases (backup, replication, performance tuning, upgrades).
  • Design and maintain CI/CD pipelines for automated deployments.
  • Manage MongoDB or SQL in high-scale environments.
  • Implement Infrastructure as Code using Terraform.
  • Set up monitoring, logging, and alerting.
  • Handle production incidents, on-call support, and RCA.
  • Automate ops tasks using Bash / Python.


Requirements:

  • 2 – 5 years of DevOps experience.
  • Hands-on experience managing production databases (MongoDB or SQL).
  • Strong AWS & Kubernetes (EKS) experience in production.
  • Experience with CI/CD tools (Jenkins / GitLab CI / GitHub Actions / ArgoCD), Terraform, Linux and networking fundamentals.
  • Scripting: Bash / Python.
  • Immediate joiners preferred.


Experience with MongoDB is preferred, though candidates with solid SQL database experience will also be considered.


Read more
Service Co

Service Co

Agency job
via Vikash Technologies by Rishika Teja
Delhi, Gurugram, Noida, Ghaziabad, Faridabad
9 - 12 yrs
₹25L - ₹35L / yr
skill iconPython
skill iconDjango
FastAPI
RESTful APIs
Microservices
+6 more

Hiring for Lead Python Developer


Exp : 9+ yrs

Edu : BE/B.Tech

Work Location : Gurugram Hybrid


Skills :


Strong expertise in Python programming. 


Experience with frameworks such as: Django Flask FastAPI 


Strong understanding of: REST APIs Microservices Architecture OOPs Concepts Design Patterns 


Experience with databases: PostgreSQL MySQL MongoDB Redis 


Hands-on experience with: Docker Kubernetes CI/CD Pipelines Git/GitHub/GitLab 


Cloud platform experience: AWS / Azure / GCP 


Knowledge of message brokers: Kafka / RabbitMQ 


Experience with unit testing and automation frameworks. 


Leadership Skills Strong team handling and mentoring experience. 


Ability to drive technical discussions and architecture decisions. 


Excellent stakeholder management and communication skills. 


Experience managing Agile/Scrum teams. 


Preferred Qualifications Bachelor’s/Master’s degree in Computer Science or related field.


 Experience in large-scale enterprise applications.


Exposure to AI/ML integrations is an added advantage. Certifications in cloud or Python technologies are preferred.



Read more
Cibirix

at Cibirix

2 recruiters
Madhuri Rathore
Posted by Madhuri Rathore
Indore
10 - 15 yrs
₹10L - ₹45L / yr
skill iconReact.js
skill iconNodeJS (Node.js)
skill iconAmazon Web Services (AWS)
skill iconDocker
DevOps
+6 more

Key Focus Areas:-


• Scalable SaaS and distributed systems


• Microservices architecture


• Backend performance and API optimization


• Cloud infrastructure and DevOps


• Multi-tenant platforms


• Reliability, scalability, and engineering governance


Main Responsibilities:-


• Design fault-tolerant backend systems


• Lead architecture decisions and technical planning


• Improve system performance, scalability, and deployment workflows


• Guide senior developers and enforce engineering standards


• Reduce technical debt and solve complex production issues


• Collaborate with DevOps on CI/CD, monitoring, and AWS infrastructure.


• You will work on complex, enterprise-scale systems including large SaaS platforms, distributed backend architectures, pricing intelligence engines, CRM solutions, 3D configuration ecosystems, route optimization modules, payment integrations, high-volume APIs, and modern cloud infrastructure.


• We are not looking for a “people manager who used to code.


Required Skills:-


• Backend: Node.js, TypeScript, REST APIs, event-driven systems


• Frontend Understanding: React.js architecture concepts


• Databases: PostgreSQL, MySQL, Redis, query optimization


• Cloud/DevOps: AWS, Docker, CI/CD, Kubernetes (preferred)


• System Design: Microservices, distributed systems, high-availability SaaS platforms


Preferred Experience:-


• SaaS product companies


• Enterprise CRM systems


• Large-scale production platforms


• Pricing engines, WebGL/3D systems, AI-assisted workflows


• U.S.-based product engineering environments


Experience Needed:-


• 10–15 years in software engineering


• 3+ years in architect/principal engineer/engineering lead roles


• Proven experience building scalable production systems


Ideal Candidate:-


A technically strong, ownership-driven architect who:


• Thinks strategically


• Stays calm under pressure


• Solves complex engineering problems


• Mentors teams effectively


• Builds long-term scalable systems rather than quick fixes

Read more
Pune
4 - 7 yrs
Best in industry
DevSecOps
skill iconAmazon Web Services (AWS)
DevOps
Github Actions
sonarqube
+18 more

About NonStop io Technologies

NonStop io Technologies is a value-driven company with a strong focus on process-oriented software engineering. We specialize in Product Development and have a decade's worth of experience in building web and mobile applications across various domains. NonStop io Technologies follows core principles that guide its operations and believes in staying invested in a product's vision for the long term. We are a small but proud group of individuals who believe in the 'givers gain' philosophy and strive to provide value in order to seek value. We are committed to and specialize in building cutting-edge technology products and serving as trusted technology partners for startups and enterprises. We pride ourselves on fostering innovation, learning, and community engagement. Join us to work on impactful projects in a collaborative and vibrant environment.


Brief Description

We are looking for a skilled DevSecOps Engineer who can help design, automate, and secure cloud-native platforms for healthcare and life sciences clients. The ideal candidate will have hands-on experience with cloud security, infrastructure automation, CI/CD pipelines, compliance controls, and platform operations in regulated environments.


You will work closely with engineering teams, architects, security stakeholders, and client representatives to build secure-by-design systems that meet healthcare security and compliance requirements. Experience supporting AI/ML platforms, healthcare data platforms, or regulated workloads is highly desirable.


Roles and Responsibilities

  • Design and implement security controls aligned with healthcare regulations, including HIPAA, HITRUST, and industry security best practices
  • Ensure secure handling of Protected Health Information (PHI), Personally Identifiable Information (PII), and sensitive healthcare datasets
  • Support client security reviews, vendor assessments, penetration testing remediation, and compliance audits
  • Partner with engineering teams to establish secure SDLC practices and shift-left security initiatives
  • Implement cloud governance policies, security baselines, and compliance automation across multiple client environments
  • Build and maintain audit-ready logging, monitoring, and evidence collection mechanisms
  • Support disaster recovery, business continuity, and security incident response processes
  • Collaborate with healthcare product teams working on FHIR APIs, healthcare integrations, clinical applications, genomics platforms, or AI-enabled healthcare solutions
  • Experience working with healthcare, life sciences, biotech, genomics, digital health, or regulated SaaS platforms is strongly preferred
  • Understanding of PHI, PII, healthcare security controls, and healthcare compliance requirements
  • Familiarity with healthcare interoperability standards such as FHIR, HL7, SMART on FHIR, or healthcare APIs is a plus
  • Experience securing healthcare data platforms, analytics environments, AI/ML workloads, or regulated cloud environments is highly desirable
  • Ability to work directly with client stakeholders and communicate security risks, recommendations, and remediation plans
  • Experience participating in security assessments, audits, compliance reviews, and client-facing technical discussions
  • Strong documentation and security governance skills


Requirements

  • 4–7 years of experience in DevOps, DevSecOps, SRE, or Platform Engineering
  • Strong experience with AWS, Azure, or GCP and cloud security best practices
  • Hands-on experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI, or Azure DevOps
  • Experience with security tools, including SonarQube, Snyk, Checkmarx, Fortify, Veracode, or similar platforms
  • Strong understanding of vulnerability management, IAM, threat detection, and security scanning
  • Experience implementing compliance controls aligned with one or more of the following frameworks:
  • HIPAA
  • HITRUST
  • SOC 2
  • ISO 27001
  • NIST Cybersecurity Framework
  • PCI-DSS (where applicable)
  • FDA-regulated software environments (preferred)
  • Proficiency with Terraform, CloudFormation, ARM, Docker, Kubernetes, Linux, and shell scripting
  • Experience with monitoring and observability tools such as Prometheus, Grafana, ELK, or Datadog
  • Exposure to MLOps/AI platforms, model deployment, or AI workload management is desirable
  • Strong troubleshooting, automation, networking, and cloud security skills


Why Join Us?

  • Opportunity to work on a cutting-edge healthcare product
  • A collaborative and learning-driven environment
  • Exposure to AI and software engineering innovations
  • Excellent work ethic and culture

If you're passionate about technology and want to work on impactful projects, we'd love to hear from you!


Read more
CAW.Tech

at CAW.Tech

5 recruiters
Ranjana Singh
Posted by Ranjana Singh
Hyderabad
6 - 10 yrs
₹30L - ₹45L / yr
Generative AI
Large Language Models (LLM)
skill iconAmazon Web Services (AWS)
SAGE
skill iconDocker
+13 more

Role:

We're scaling an AI platform that powers Computer Vision and real-time video analytics, and we need someone to own the architecture, not just contribute to it. Our AI workloads are moving to the cloud, our models need hardening for production, and our engineering practices need a north star. The gap between prototype and production is costing us velocity.


Responsibilities:

  • Architect and govern the end-to-end AI platform from data lake to model serving on AWS (SageMaker preferred).
  • Lead cloud migration of AI workloads with a cloud-native, containerised approach (Docker, Kubernetes, CI/CD).
  • Own the AI roadmap model accuracy, MLOps maturity, observability, and scalability.
  • Set engineering standards across ML development, deployment, and monitoring.
  • Mentor engineers and data scientists; reduce key-person dependency across the org.
  • Drive productionisation, turn research into reliable, monitored, high-performance systems.


Requirements:

  • Deep expertise in Computer Vision, Deep Learning, and Video/Real-Time Analytics.
  • Fluency in PyTorch and/or TensorFlow, Python, and ML architecture patterns.
  • Hands-on with AWS SageMaker, Docker, Kubernetes, CI/CD pipelines.
  • Experience with model monitoring, observability tools, and data lake architectures.
  • A track record of leading AI strategy, not just executing it.


Read more
CAW.Tech

at CAW.Tech

5 recruiters
Ranjana Singh
Posted by Ranjana Singh
Hyderabad
8 - 12 yrs
Best in industry
skill iconPython
skill iconJava
TypeScript
skill iconAmazon Web Services (AWS)
skill iconDocker
+9 more

About the Role

We're looking for a hands-on AI Architect to lead the design and delivery of AI-native applications. You will define the AI stack, establish engineering standards, and drive adoption of AI-assisted development across teams.


Responsibilities

  • Architect and build production-grade AI applications.
  • Evaluate and standardize AI frameworks, tools, and platforms.
  • Design agentic systems, RAG pipelines, and AI workflows.
  • Establish AI evaluation, testing, guardrails, and observability practices.
  • Drive AI-native development and delivery acceleration initiatives.
  • Partner with engineering and product teams on technical roadmap decisions.


Requirements

  • 7+ years of software engineering experience.
  • Strong experience with LLMs, RAG, embeddings, prompt engineering, and agent frameworks.
  • Experience with LangGraph, AutoGen, MCP, or similar ecosystems.
  • Hands-on experience with AI evaluations (DeepEval, Ragas), guardrails, and structured outputs.
  • Strong Python and TypeScript/Java skills.
  • Experience with Kubernetes, CI/CD, IaC, observability, and event-driven architectures.
  • Proven experience shipping AI applications to production.
Read more
Whitefield Careers
Whitefield Team
Posted by Whitefield Team
Bengaluru (Bangalore)
6 - 12 yrs
₹15L - ₹25L / yr
DevOps
skill iconAmazon Web Services (AWS)
skill icongrafana
prometheus
Terraform
+2 more

Required Skills

● Experience: Minimum of 5 years of professional experience in a DevOps Engineer

role

● Cloud Proficiency: Proven experience with at least one major cloud provider (AWS,

Azure, or GCP).

● Scripting & Programming: Strong scripting skills in languages such as Bash, Python,

or Go.

● IaC Tools: Hands-on experience with Terraform.

● Container Technology: Expertise in Docker and Kubernetes.

● CI/CD Tools: Proficient with CI/CD platforms like Jenkins, GitLab CI, or Travis CI.

● Configuration Management: Experience with configuration management tools like

Ansible, Chef, or Puppet.

● Version Control: Strong knowledge of Git and branching strategies.

● Problem-Solving:Excellent problem-solving abilities and a commitment to automation

and continuous improvement.

Read more
Improving
Leena Lahari
Posted by Leena Lahari
Bengaluru (Bangalore)
5 - 12 yrs
Best in industry
skill iconNodeJS (Node.js)
TypeScript
RESTful APIs
skill iconPostgreSQL
Microservices
+7 more

Backend Engineer (Node.js)

Experience - 4+ Years

Location - Bangalore (Hybrid)


We are looking for a Backend Engineer to help design, scale, and continuously improve the platform.


What You'll Do -

  • Engage in fast-paced agile application development teams in terms of organizing, managing and executing your work with minimal supervision.
  • Participate with the team and stakeholders on requirements understanding, design and solutions in line with the product architecture.
  • Build and maintain high performing, high quality backend services and APIs with end-to-end responsibility from development to technical QA (unit tests) by applying technology and best practices.
  • Design and manage scalable data pipelines, integrations, and microservices that power the platform.
  • Provide necessary technical documentation to enable visibility and maintainability of designs and code.
  • Play an integral role in acquiring and learning technology trends, expertise, and best practices to keep the team's collective knowledge up to date.


What We're Looking For

Education

  • Masters/bachelor's degree in engineering from a reputed university with an excellent academic record.
  • Any relevant certifications on technology from reputed sources is a plus.


Work Experience

  • 4+ years of relevant industry experience. Experience working in Energy, Oil & Gas or Engineering domain is a definite plus.
  • Experience working in distributed agile developments. Experience working in international teams is relevant.


Technical Skills

  • Hands-on experience building scalable backend services and REST/GraphQL APIs using Node.js and TypeScript. Further hands-on experience with NestJS or Express.js is a plus.
  • Strong understanding of databases including relational (PostgreSQL, SQL Server) and NoSQL (MongoDB, Redis) systems, including query optimization and data modeling.
  • Good understanding of event-driven architecture and messaging systems such as Kafka, RabbitMQ, or Azure Service Bus.
  • Good understanding of the concepts and application of unit and integration tests in backend development. Hands-on experience using Jest or Mocha is a plus.
  • It is an advantage if you have hands-on experience working in Cloud (Azure) applications especially in areas of DevOps, CI/CD, and Containerization (Docker, Kubernetes).
  • It will be a valuable addition if you have experience and knowledge working for energy, oil & gas or engineering industries.
Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
4 - 7 yrs
₹12L - ₹18L / yr
skill iconPython
skill iconGo Programming (Golang)
RESTful APIs
Linux/Unix
Object Oriented Programming (OOPs)
+3 more

Core Responsibilities:

  • Design & Development: Architect and implement scalable backend services and APIs using Python or Golang, ensuring high performance, resilience, and extensibility.
  • System Ownership: Take end-to-end ownership of critical modules, from design and development to deployment and support.
  • Technical Leadership: Conduct design and code reviews, enforce best practices, and mentor junior engineers to raise the team’s technical bar.
  • Collaboration: Work closely with product managers, architects, and other engineers to translate business requirements into technical solutions.
  • Performance & Reliability: Troubleshoot complex issues in production systems, identify root causes, and design sustainable long-term solutions.
  • Innovation: Evaluate new technologies, contribute to proof-of-concepts, and recommend tools that can improve developer productivity.
  • Process Improvement: Drive initiatives to improve coding standards, CI/CD pipelines, and automated testing practices.
  • Knowledge Sharing: Document designs, create technical guides, and share insights with the broader engineering team.


Experience and Expertise:

  • 4–7 years of backend development experience with Python or Golang.
  • Strong expertise in designing, developing, and scaling microservices and distributed systems.
  • Solid understanding of concurrency, multi-threading, and performance optimization.
  • Proficiency with databases (SQL/NoSQL), caching systems (Redis, Memcached), and messaging systems (Kafka, RabbitMQ, etc.).
  • Hands-on experience with Linux development, Docker, and Kubernetes.
  • Familiarity with cloud platforms (AWS/GCP/Azure) and related services.
  • Strong debugging, profiling, and optimization skills for production-grade systems.
  • Experience with AI-powered development tools is a strong plus; familiarity with concepts like 'agentic coding' for workflow automation or 'context engineering' for leveraging LLMs in system design is highly desirable.


Skills:

  • Strong problem-solving ability, with experience handling complex technical challenges.
  • Ability to lead technical initiatives and mentor junior engineers.
  • Excellent communication skills to collaborate with cross-functional teams and articulate trade-offs.
  • Self-motivated, proactive, and able to operate independently while aligning with team goals.
  • Passionate about engineering culture, quality, and developer productivity.


Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
2 - 4 yrs
₹8L - ₹12L / yr
skill iconPython
skill iconGo Programming (Golang)
RESTful APIs
skill iconDocker
skill iconKubernetes
+3 more

Core Responsibilities:

  • Design, develop, and maintain backend services and APIs using Python or Golang.
  • Write high-quality, testable, and maintainable code with a focus on performance and scalability.
  • Implement automated tests and contribute to CI/CD pipelines.
  • Collaborate with product, QA, and DevOps teams for end-to-end feature delivery.
  • Troubleshoot production issues and provide timely resolutions.
  • Participate in design and architecture discussions to improve system efficiency.
  • Contribute to improving development processes, coding standards, and best practices.


Experience and Expertise:

  • 2–4 years of experience in backend development with Python or Golang.
  • Solid understanding of RESTful APIs, microservices, and distributed systems.
  • Strong knowledge of data structures, algorithms, and OOPS principles.
  • Hands-on experience with relational and/or NoSQL databases.
  • Familiarity with Linux development, Docker, and basic cloud concepts (AWS/GCP/Azure).
  • Proficiency with Git and version control workflows.
  • Familiarity with AI-powered development tools or exposure to projects involving large language models (LLMs) is a plus.


Skills:

  • Strong analytical and debugging skills with the ability to solve complex problems.
  • Good communication and collaboration skills across teams.
  • Ability to work independently with minimal supervision while being a strong team player.
  • Growth mindset – eagerness to learn new technologies and improve continuously.
Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
1 - 2 yrs
₹2L - ₹7L / yr
skill iconPython
skill iconGo Programming (Golang)
Linux/Unix
skill iconDocker
skill iconKubernetes
+3 more

Core Responsibilities:

  • Design, develop, and maintain backend services and APIs using Python or Golang.
  • Write high-quality, testable, and maintainable code with a focus on performance and scalability.
  • Implement automated tests and contribute to CI/CD pipelines.
  • Collaborate with product, QA, and DevOps teams for end-to-end feature delivery.
  • Troubleshoot production issues and provide timely resolutions.
  • Participate in design and architecture discussions to improve system efficiency.
  • Contribute to improving development processes, coding standards, and best practices.


Experience and Expertise:

  • 1-2 years of experience in backend development with Python or Golang.
  • Solid understanding of RESTful APIs, microservices, and distributed systems.
  • Strong knowledge of data structures, algorithms, and OOPS principles.
  • Hands-on experience with relational and/or NoSQL databases.
  • Familiarity with Linux development, Docker, and basic cloud concepts (AWS/GCP/Azure).
  • Proficiency with Git and version control workflows.
  • Familiarity with AI-powered development tools or exposure to projects involving large language models (LLMs) is a plus.


Skills:

  • Strong analytical and debugging skills with the ability to solve complex problems.
  • Good communication and collaboration skills across teams.
  • Ability to work independently with minimal supervision while being a strong team player.
  • Growth mindset – eagerness to learn new technologies and improve continuously.
Read more
Maropost
Rishika Mehra
Posted by Rishika Mehra
Chandigarh, Mohali
5 - 15 yrs
₹25L - ₹32L / yr
Google Cloud Platform (GCP)
skill iconPython
Bash
skill iconGo Programming (Golang)
skill iconKubernetes
+3 more

Everything we do is for our customers!


Featured on Deloitte's Technology Fast 500 list and G2's leaderboard, Maropost offers a unified commerce experience that our customers need, transforming ecommerce, retail, marketing automation, merchandising, helpdesk and AI operations with one platform designed to scale for fast-growing businesses. With a relentless focus on our customers’ success, we are motivated by customer obsession, extreme urgency, excellence and resourcefulness to to power 5,000+ global brands while we head to 100,000+.


Driven by the same customer-centric mentality as above, we empower businesses to achieve their goals and grow alongside us. If you're a driver and not passenger and are ready to make a significant impact and be part of our transformative journey, Maropost is the place for you.


The Opportunity:


Thrive on change and grow beyond limits! We are looking for a bold thinker who sees a chance to learn and define what's possible with every challenge! Ready to make an impact? Welcome to Maropost and you can turn ideas into action!


We are seeking an experienced Senior Platform Engineer to join our growing team responsible for building, scaling, and maintaining the core infrastructure of our SaaS product on Google Cloud Platform (GCP). You will architect and implement robust, secure, and scalable systems using Kubernetes for orchestration and terraform for infrastructure-as-code.


What you'll be responsible for:


  • Design, implement, and optimize cloud-native infrastructure on GCP to support our SaaS product. 
  • Lead the deployment and management of Kubernetes clusters, ensuring high availability and security. 
  • Develop and maintain Terraform scripts for automated provisioning of cloud resources and environments. 
  • Collaborate with development and product teams to deliver scalable and reliable solutions. 
  • Establish CI/CD pipelines and automate infrastructure deployment and application delivery workflows. 
  • Set up and monitor observability solutions (logging, monitoring, alerting) to ensure operational excellence. 
  • Identify and resolve performance bottlenecks, contributing to continuous platform optimization. 
  • Drive adoption of DevOps practices and mentor junior engineers in Kubernetes, GCP, and Terraform best practices. 
  • Participate in on-call rotations and support incident response as a senior technical resource in 24/7 rotational environment.


What you'll bring to Maropost:


  • 5+ years of experience in platform engineering, SRE, or DevOps, with a focus on cloud-native SaaS environments. 
  • Strong hands-on expertise with GCP services (Compute Engine, GKE, Cloud SQL, IAM, etc.). 
  • Advanced skills in Kubernetes cluster management, workload orchestration, and troubleshooting. 
  • Deep proficiency in Terraform and infrastructure-as-code concepts for cloud automation. 
  • Experience designing and maintaining CI/CD pipelines (Bitbucket, Build kite, Argo CD , or similar). 
  • Proficient in scripting/programming (Python, Go, or Bash). 
  • Solid understanding of networking, security, and compliance in cloud environments. 
  • Familiarity with monitoring and observability tools (Prometheus, Grafana, Stack driver, etc.). 
  • Excellent problem-solving, communication, and team collaboration skills. 
  • Exposure to service mesh (Istio, Linkerd), cloud cost optimization, or FinOps. 
  • Experience with other public clouds (AWS, Azure). 
  • Certifications in GCP, Kubernetes, or Terraform. 
  • You exemplify Maropost’s Values: 
  • Customer Obsessed 
  • Extreme Urgency 
  • Excellence 
  • Resourceful


Message from the Founders: Maropost is looking for builders - people who want to drive our business forward at all costs in order to achieve the goals we have both short and long term for the results and outcomes that that will bring to us all.

 

If that isn't for you that’s ok, for those of you that it is please get in touch with us!

Read more
The supreme consultancy
Pune
4 - 7 yrs
₹23L - ₹38L / yr
SQL
skill iconPython
skill iconMachine Learning (ML)
skill iconDeep Learning
Image Embeddings
+17 more

Role & Responsibilities


Responsibilities


• Contribute to the development and optimization of enterprise-wide search systems and models.

• Design and implement algorithms to improve indexing, query relevance, and search accuracy.

• Support taxonomy, ontology, and metadata model creation for better search outcomes.

• Collaborate with business units (Loans, Insurance, Investments) to build AI-enabled search features.

• Conduct analysis of user behavior and system metrics to refine search performance.

• Work with engineers, product managers, and designers to deliver integrated search solutions.

• Develop production-grade ML systems for ranking, personalization, and recommendations.

• Participate in proof-of-concept initiatives with internal and external partners.

• Follow best practices in software engineering including CI/CD, testing, and monitoring.

• Keep abreast of emerging developments in AI/ML to apply them in practical solutions.


Ideal Candidate


Strong Data Scientist / AI Engineer / Machine Learning Engineer profiles.

Mandatory (Experience 1) – Must have minimum 5+ years of hands-on experience in Data Science, Machine Learning, Applied AI, NLP, Deep Learning, or Generative AI solutions.

Mandatory (Experience 2) – Must have strong hands-on experience in Python programming, SQL, data analysis, feature engineering, model development, and production-grade ML applications.

Mandatory (Experience 3) – Must have experience working with Machine Learning and Deep Learning frameworks such as PyTorch, TensorFlow, Keras, Scikit-learn, or equivalent.

Mandatory (Experience 4) – Must have hands-on experience working on NLP, embeddings, semantic search, text classification, document understanding, recommendation systems, or similar AI/ML use cases.

Mandatory (Experience 5) – Must have experience working with Large Language Models (LLMs) such as GPT, Llama, Mistral, Claude, Gemini, Phi, or similar foundation models.

Mandatory (Experience 6) – Must have hands-on experience building or implementing RAG (Retrieval Augmented Generation) systems, vector search, knowledge retrieval, embeddings, chunking, indexing, or semantic retrieval solutions.

Mandatory (Experience 7) – Must have experience working with Git, CI/CD practices, production environments, and scalable AI/ML systems.

Mandatory (CTC) – The CTC breakup offered will be 75% fixed + 25% variable, as per company policy.

Mandatory (Age) - Candidate's Age should be below 30 Years

Preferred (Experience 1) – Experience with MLFlow, Kubeflow, Airflow, Prefect, Feature Stores, Model Registry, or MLOps/LLMOps frameworks.

Preferred (Experience 2) – Experience working with Vector Databases, Spark, PySpark, distributed ML pipelines, large-scale data processing, or real-time ML systems..

Preferred (Experience 3) – Familiarity with Docker, Kubernetes, Azure, AWS, GCP, cloud-native AI deployments, and scalable ML architecture.

Preferred (Company) – Candidates from AI-first startups, Fintech, Banking, Lending, Fraud Analytics, Risk Analytics, Product Companies, SaaS organizations, or data-driven technology companies.


Kindly provide the following details while sending your CV: (Mandatory details)


1) Date of Birth

2) Current Location-

3) Current CTC-

4) Expected CTC-

5) Notice Period-

6) Ready to relocate to Pune?



Regards,

The Supreme Consultancy

Website- https://lnkd.in/eawfxfxU

Read more
Bengaluru (Bangalore)
5 - 12 yrs
₹20L - ₹45L / yr
skill iconJavascript
skill iconJava
skill iconNodeJS (Node.js)
skill iconPython
skill iconGo Programming (Golang)
+12 more

Job Title : Generalist Fullstack Engineer (Java / Node.js / React / AI-Driven Engineering)

Experience : 5 to 12 Years

Location : Bengaluru (Koramangala) – Hybrid (3 Days WFO)

Employment Model : C2H

Joining : Immediate to 20 Days Notice Only

Open Positions : 4


About the Role :

We are hiring a hands-on Generalist Fullstack Engineer for a high-impact engineering team. This is a 90% coding role focused on building scalable products across frontend, backend, cloud, and modern AI-enabled engineering practices.


Mandatory Skills :

Full Stack Development (Frontend + Backend), JavaScript / Java / Node.js / Python / Go, ReactJS or modern UI frameworks, Microservices, AWS, Docker, Kubernetes, SQL/NoSQL, CI/CD, Terraform (IaC), Automated Testing (TDD), AI-enabled software delivery, and strong hands-on coding experience.


Key Requirements :

  • 5 to 12 years of software development experience.
  • Strong expertise in JavaScript, Java, Node.js, Python, or Go.
  • Hands-on experience with ReactJS / modern frontend frameworks.
  • Strong backend depth with microservices & event-driven architecture.
  • Experience with AWS, Docker, Kubernetes.
  • Exposure to AI tools/frameworks in software delivery.
  • Strong understanding of SQL/NoSQL databases & data modelling.
  • Experience with CI/CD, automated testing, Terraform (IaC).
  • Working knowledge of TDD, pair programming, continuous integration, security best practices.


Roles & Responsibilities :

  • Develop and own production-grade full-stack applications.
  • Build AI-first engineering workflows to improve delivery efficiency.
  • Drive architecture decisions and engineering best practices.
  • Deliver scalable cloud-native solutions.
  • Collaborate across product, QA, design, and architecture teams.
  • Improve system reliability through monitoring, automation, and testing.

Ideal Candidate :

✔ Strong end-to-end ownership (Frontend + Backend).

✔ Deep hands-on coding mindset (not people-management focused).

✔ Strong AWS + SQL exposure.

✔ Experience building and managing UI state and frontend components.

✔ Stable career progression (avoid frequent job changes).


Interview Process :

  1. Technical Round – GT (Assemble)
  2. Technical Round – Client
  3. Technical Round – Client
  4. HR Discussion & Offer
Read more
 Digital Product Engineering company

Digital Product Engineering company

Agency job
via Unique Occupational by Mantasha Naaz
Bengaluru (Bangalore)
9 - 11 yrs
₹32L - ₹35L / yr
Debian Linux
Edge technology
Proxmox
Linux administration
DevOps
+6 more

Associate Principal Engineer, Linux Administrator

Location: Bengaluru, India (Hybrid)

Employment Type: Full-time

Experience:9-11 years



Job description

REQUIREMENTS:


  • Strong experience in DevOps, Platform Engineering, and Infrastructure Automation
  • Deep hands-on expertise in Linux Administration (RHEL, CentOS, Ubuntu) – OS hardening, security, patching, and performance management (Must Have)
  • Strong experience with Cloud Technologies – Public & Private Cloud environments (Must Have)
  • Hands-on experience with Infrastructure as Code (IaC) using Terraform (Must Have)
  • Strong automation expertise using Ansible for configuration management and infrastructure provisioning (Must Have)
  • Experience building and managing CI/CD pipelines and end-to-end deployment automation
  • Strong experience with Kubernetes administration, orchestration, and cluster management (Must Have)
  • Hands-on experience with Docker containerization and Helm package management
  • Experience managing large-scale development and infrastructure environments
  • Strong understanding of Networking concepts, connectivity, design, troubleshooting, and network automation
  • Experience with Observability & Monitoring tools and best practices
  • Experience with Proxmox virtualization platform administration and management
  • Knowledge of Edge Technologies and distributed infrastructure environments
  • Basic understanding and administration of Active Directory (AD)
  • Experience implementing AI-driven Automation solutions and operational efficiencies
  • Strong understanding of infrastructure security, compliance, and governance
  • Experience working in Agile/Scrum environments
  • Strong troubleshooting, analytical, and problem-solving skills
  • Excellent communication and stakeholder management skills

RESPONSIBILITIES:

  • Design, build, and manage scalable infrastructure platforms across cloud and on-premise environments
  • Administer and maintain Linux servers including security hardening, patching, performance tuning, and troubleshooting
  • Develop and manage Infrastructure as Code (IaC) solutions using Terraform
  • Automate infrastructure provisioning, configuration management, and operational tasks using Ansible
  • Design, implement, and maintain CI/CD pipelines for application and infrastructure deployments
  • Deploy, manage, and optimize Kubernetes clusters and containerized workloads
  • Manage Docker environments and Helm-based application deployments
  • Design and implement network solutions ensuring security, reliability, and scalability
  • Monitor infrastructure health, performance, and availability using observability and monitoring tools
  • Manage and support Proxmox virtualization environments
  • Implement AI-driven automation initiatives to improve operational efficiency and reduce manual effort
  • Support edge infrastructure deployments and distributed computing environments
  • Collaborate with development, security, and operations teams to deliver reliable platform services
  • Troubleshoot production incidents and perform root cause analysis
  • Define infrastructure standards, automation frameworks, and operational best practices
  • Ensure high availability, scalability, security, and reliability of infrastructure platforms
  • Mentor junior engineers and provide technical leadership on DevOps and platform engineering initiatives
  • Participate in Agile ceremonies and contribute to continuous improvement initiatives
  • Work closely with stakeholders to understand infrastructure requirements and deliver optimal solutions

Qualifications

Bachelor’s or master’s degree in computer science, Information Technology, or a related fields

Read more
TalentXO
tabbasum shaikh
Posted by tabbasum shaikh
Bengaluru (Bangalore)
10 - 15 yrs
₹70L - ₹80L / yr
skill iconKubernetes
cloud architecture
Platform engineering
Observability
security and compliance
+3 more

Role & Responsibilities

Responsibilities:

  • Infrastructure at Scale
  • Design and evolve our cloud-native infrastructure (AWS/Kubernetes), ensuring availability, performance, and cost efficiency across regions and products.
  • Platform & Developer Experience
  • Build internal tools and platforms that help engineers deploy, monitor, and scale their services independently — with minimal friction and maximum confidence.
  • CI/CD & Release Automation
  • Architect secure, fast, and scalable CI/CD pipelines across multiple environments using tools like GitHub Actions, and Jenkins.
  • Reliability Engineering
  • Champion observability, SLOs, and incident response practices. Drive a culture of proactive performance monitoring and resilient system design.
  • Security & Governance
  • Integrate DevSecOps practices — from policy-as-code and automated audits to secure secrets management and vulnerability scanning.
  • Mentorship & Thought Leadership
  • Guide and mentor DevOps and SRE engineers. Partner closely with platform developers on infrastructure strategy, deployment patterns, and production readiness.

Ideal Candidate

  • Strong Principal DevOps Engineer Profile
  • Mandatory (Experience 1): Must have 10+ years in DevOps / SRE / Infrastructure roles with hands-on experience (clear scale signals like traffic, uptime, latency, infra size should be mentioned) in B2B SAAS companies
  • Mandatory (Experience 2): Must have worked in Principal / Staff / Lead DevOps / SRE / Platform Engineer role and demonstrated org-level ownership - setting infra roadmap, defining DevOps charter, or structuring the platform function not just domain-level technical ownership
  • Mandatory (Experience 3): Must show evidence of strategic authorship, defined multi-year infra/platform strategy, drove company-wide architectural shifts as an initiator (not implementer), or directly interfaced with VP Eng / CTO / product leadership on infra direction
  • Mandatory (Experience 4): Must have B2B SaaS company experience with multi-tenant architecture OR multiple production stacks (multi-env / multi-client systems)
  • Mandatory (Tech Skills 1 - Cloud & Infra): AWS (VPC, EKS, EC2, RDS, networking), Kubernetes (EKS) at scale, Designing high availability, multi-region systems
  • Mandatory (Tech Skills 2 - Automation & IaC): Terraform (must-have), Helm / GitOps, Strong scripting (Python / Go / Bash)
  • Mandatory (Tech Skills 4 - Reliability & Observability): SRE principles (SLOs, SLIs, error budgets), Monitoring tools (Prometheus, Grafana, Datadog), Alerting, on-call, incident management
  • Mandatory (Leadership): Must demonstrate leadership experience in an individual contributor capacity having mentored senior engineers, driven cross-team technical alignment, or anchored org-wide initiatives without having moved into a people management or engineering manager role
  • Mandatory (Company): Strong B2B SaaS product companies only
  • Preferred (Education): B.Tech in Computer Science or related fields


Read more
Fonada
Karandeep Singh
Posted by Karandeep Singh
Noida
5 - 8 yrs
₹15L - ₹20L / yr
DevOps
skill iconAmazon Web Services (AWS)
Microsoft Windows Azure
Google Cloud Platform (GCP)
VMware vSphere
+8 more


About the Role 

We are looking for a Senior DevOps Engineer to lead the design, automation, and scaling of our hybrid cloud infrastructure spanning public cloud and private/on-premises environments. You will partner closely with software engineering, security, and product teams to build reliable, secure, and high-performance systems that support rapid product delivery. This is a hands-on role with significant influence over our infrastructure strategy, deployment workflows, and engineering culture. 


Key Responsibilities 

  • Architect, deploy, and maintain scalable, highly available infrastructure across both public cloud (AWS, Azure, GCP) and private cloud platforms (OpenStack, VMware vSphere/Tanzu, Nutanix, or similar). 
  • Operate and maintain on-premises infrastructure: hypervisors, compute, storage (Ceph, NetApp, SAN/NAS), networking (SDN, VLANs, BGP, MPLS), and hardware capacity planning, alongside their public cloud equivalents. 
  • Design and own CI/CD pipelines that deploy seamlessly across public and private environments. 
  • Implement and manage Infrastructure as Code (Terraform, Ansible, Pulumi) with strong version control and review practices, using providers for both public and private cloud platforms. 
  • Manage container orchestration (Kubernetes, ECS, OpenShift, Rancher) across managed cloud services and self-managed/bare-metal clusters, including upgrades, autoscaling, and workload reliability. 
  • Build observability into all systems through logging, metrics, tracing, and alerting (Prometheus, Grafana, Datadog, ELK, or similar) with unified visibility across hybrid environments. 
  • Champion security best practices: secrets management, IAM hardening, network segmentation, vulnerability scanning, and compliance (SOC 2, ISO 27001, HIPAA, or data-sovereignty requirements). 
  • Lead incident response, root-cause analysis, and post-mortems; drive long-term reliability improvements and SLO/SLA adherence. 
  • Optimize cost, capacity, and resource utilization across public cloud spend and on-premises hardware without compromising performance or availability. 
  • Partner with data center operations and network providers on hardware provisioning, firmware management, MPLS circuit management, and lifecycle planning. 
  • Mentor junior DevOps and software engineers; promote DevOps culture, automation-first thinking, and shared ownership of production. 
  • Evaluate and introduce new tools, platforms, and processes that improve developer productivity and system reliability. 

Required Qualifications 

  • 5+ years of experience in DevOps, SRE, or Platform Engineering roles, with at least 2 years at a senior level. 
  • Deep expertise with at least one major public cloud provider (AWS, Azure, or GCP) in production. 
  • Hands-on experience operating private cloud or virtualization platforms (OpenStack, VMware, Nutanix, or equivalent) in production. 
  • Strong experience with virtualization, storage systems, and enterprise networking in on-premises environments. 
  • Strong hands-on experience with Kubernetes in production, including both managed cloud and self-managed/bare-metal clusters. 
  • Proficiency in Infrastructure as Code (Terraform and Ansible strongly preferred). 
  • Solid scripting and programming skills in Python, Go, Bash, or similar. 
  • Experience designing and operating CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or ArgoCD. 
  • Strong Linux systems administration and networking fundamentals (TCP/IP, DNS, load balancing, VPNs, firewalls, routing, MPLS). 
  • Experience with monitoring and observability stacks (Prometheus, Grafana, Datadog, New Relic, ELK, or OpenTelemetry). 
  • Proven track record of leading incident response and improving system reliability. 
  • Excellent communication skills and the ability to collaborate across engineering, security, infrastructure, and product teams. 

Preferred Qualifications 

  • Experience designing hybrid and multi-cloud architectures, including secure connectivity (Direct Connect, ExpressRoute, MPLS, VPN, SD-WAN) between public and private environments. 
  • Familiarity with service meshes (Istio, Linkerd), API gateways, and GitOps workflows (ArgoCD, Flux). 
  • Background in security-focused or regulated environments and exposure to compliance frameworks. 
  • Experience with database administration (PostgreSQL, MySQL, Redis, MongoDB) in cloud-managed and self-hosted setups. 
  • Contributions to open-source DevOps or cloud infrastructure tooling. 
  • Relevant certifications (AWS Solutions Architect / DevOps Engineer, Azure Administrator, CKA, CKAD, RHCE, VMware VCP, OpenStack Certified Administrator, HashiCorp Terraform Associate). 


Read more
HireTo

at HireTo

1 video
Anshul Saxena
Posted by Anshul Saxena
Hyderabad
8 - 10 yrs
₹25L - ₹40L / yr
TypeScript
skill iconNodeJS (Node.js)
skill iconReact.js
skill iconRedis
skill iconPostgreSQL
+6 more


Senior Software Engineer


  • Minimum 5 years in typescript and Total of minimum 8 Years of experience
  • Excellent communication skills. English Language fluency is Mandatory.


We are looking for strong backend senior engineers

  • 8+ years experience as Senior Software Engineer
  • Have worked in medium- to big-sized companies for at least 3 years ( companies with 50+ developers)
  • Strong architecture and planning skills
  • Strong AI coding skills (being fluent in developing agents and skills) and using them in their development routines
  • Distributed programming knowledge
  • Typescript, Postgres & Kubernetes knowledge.


Ways of working

The engineers will be lead/managed by a UK Multinational company engineer manager (initially one currently based in Europe), and we are going to work on hiring one in Asia if we succeed in putting the team together

Engineers will have a direct contract with UK Multinational Company as career employees (entitled to company benefits and career progression) and will be trained in the UK Multinational Company product portfolio and work on several different projects.

We work in an environment where each engineer is presented with a business problem (or user story) and must come up with a detailed plan on how to tackle it (and explain and get approval from the team), which we do during the refinement meetings. In summary, we do not tell the engineers what to do, our expectation from a Senior engineer is that he tells us what to do (he will be trained, of course, in the best practices and technical specs of our environment).

Hiring process

  • Hiring manager interview
  • System Design interview
  • Coding (with AI) interview


Read more
Technogise Private Limited
Vandana BM
Posted by Vandana BM
Bengaluru (Bangalore)
12 - 18 yrs
Best in industry
skill iconPython
skill iconReact.js
skill iconDocker
skill iconKubernetes
skill iconAmazon Web Services (AWS)
+1 more

How do Technogisers function?


Value: Exploring technologies and implementing them on the projects provided they make business sense and deliver value.

Engagement: Be it offshore or onshore, we engage ourselves daily with the clients. This assists in building a trustworthy relationship at the same time, collaborating to come up with strategic solutions to business problems.

Solution: We are involved in providing hands-on contributions towards Backend & Front-end design and development at the same time, flourishing our DevOps culture.

Thought Leadership: Attend or present technical meet-ups/workshops/conferences to share knowledge and help build Technogise brand.

Note: All our roles are customer-facing roles.


This is a full-time 5 days work from office role as a Technology Consultant (Lead) located in Bangalore (Indiranagar).


Core Skills:

  • Strong system design and architecture expertise.
  • Production-level experience with React (TypeScript) and Python (FastAPI) — able to own a feature end-to-end
  • Familiarity with Classic ASP and .NET for maintaining and migrating legacy applications
  • Solid AWS fundamentals: EKS, RDS, Lambda, IAM, S3, CloudFront, and VPC networking
  • ⁠Kubernetes and Docker fluency; comfortable reading and writing deployment YAMLs
  • Experience with OpenResty or Nginx-based reverse proxy and API gateway configuration
  • Familiarity with security practices: OAuth2, Okta, parameterized queries, secrets management (Doppler or equivalent), and static analysis tooling
  • Strong written communication — able to produce RFCs, runbooks, and architectural decision records
  • ⁠Experience with Redis, Celery, or similar caching and task queue systems is a plus
  • Exposure to product analytics SDKs (Amplitude or equivalent) is a plus
  • Experience with MSSQL and SQL Server in a production environment is a plus
  • Prior experience in a VC, PE, or fintech engineering context is advantageous but not required


Going beyond:

  • Establish credibility within the team as a result of technical and leadership skills
  • Mentoring fellow team members within the project team and providing technical guidance to others beyond project boundaries
  • Actively participate in organisational initiatives
Read more
Recruiting Bond

at Recruiting Bond

2 candid answers
Pavan Kumar
Posted by Pavan Kumar
Bengaluru (Bangalore)
7 - 12 yrs
₹70L - ₹110L / yr
Platform as a Service (PaaS)
Platform Engineering
Agentic AI
AI Agents
Model Context Protocol (MCP)
+41 more

About My Client Company

We're building the learning infrastructure that transforms AI agents into true digital workers. While today's agents can reason and plan, they fail to do meaningful work because they lack real experience operating in apps. My Client Product gives agents continuously improving, reusable skills across 1000+ production-grade app connectors including Gmail, Linear, and Hubspot. We handle authentication, tool routing, retries, failure handling, and observability, making every action safe and dependable.


About the Role

Every enterprise is racing to make AI work — not as a demo, but as infrastructure that runs their business. My Client Product is becoming the critical layer that makes this possible: the platform that connects AI agents to 250+ real-world applications with production-grade auth, execution, and reliability.

We've built this for the cloud. Now we need to build it for the enterprise — and that means rethinking the platform from the ground up with the right abstractions, primitives, and architectural decisions that let us serve a massive, diverse set of enterprise customers without bespoke engineering for each one. This is a founding role.


Your Impact

  • Agent infrastructure platform: The foundational layer that enterprise AI agents run on — governance, observability, and control planes for MCP-powered agent ecosystems. You'll define how organizations monitor, audit, and manage AI agents operating at scale across their systems
  • The integration gateway: The secure, reliable bridge between an enterprise's AI agents and the outside world — every SaaS tool, internal system, and API they need to act on. Not just connectors, but a platform-grade gateway with the right trust, permissioning, and routing primitives
  • Platform primitives for scale: Multi-tenancy, isolation, configuration, and extensibility abstractions that let Composio serve thousands of enterprise customers without linear engineering cost
  • Enterprise-grade architecture: Deployment flexibility, security, and compliance as first-class platform capabilities — not bolted-on afterthoughts
  • The repeatable deployment motion: Turn enterprise onboarding from a services engagement into a product experience. Shorter cycles, fewer custom touches, more self-serve


What you bring

  • You've built platforms at genuine scale — not just high user counts, but high complexity: many customer types, deployment models, and integration surfaces
  • You think in abstractions and primitives. Your instinct is to find the right foundational model, not to solve each problem individually
  • You've shipped enterprise product capabilities (deployment flexibility, security, admin tooling, compliance) and understand them as product problems, not just checkboxes
  • You've built or shipped an AI product — or you're the person who can't stop tinkering. You're building agents on weekends, stress-testing the latest models, experimenting with MCP, and forming your own opinions on where agent architectures are headed. You have a point of view on this space, not just a resume line
  • You're a force multiplier. When you join a team, the entire product moves faster because the platform decisions are right


Skills & Expertise

Platform Engineering, AI Infrastructure, Agentic AI, AI Agents, MCP (Model Context Protocol), Distributed Systems, Enterprise Architecture, Multi-Tenant Architecture, Backend Platform Engineering, Enterprise SaaS, API Platform Engineering, Integration Platforms, SaaS Connectors, Cloud Infrastructure, AWS, GCP, Kubernetes, Docker, Terraform, Microservices, Event-Driven Architecture, API Gateway, OAuth 2.0, RBAC, IAM, Observability, OpenTelemetry, Prometheus, Grafana, Reliability Engineering, SRE, Python, Golang, Node.js, TypeScript, REST APIs, GraphQL, AI Orchestration, LLM Infrastructure, LangChain, LangGraph, OpenAI APIs, Claude APIs, RAG, Workflow Automation, AI Tool Routing, Enterprise Security, Compliance Engineering, Deployment Architecture, Configuration Management, Extensible Systems, Scalability Engineering, High-Scale Systems, Technical Strategy, Platform Primitives, Developer Platforms, Enterprise Integrations, Infrastructure Engineering, Founding Engineer Mindset.


This role demands deep platform thinking. You've designed systems where the abstractions were the product — where getting the primitives right meant the difference between a product that scales and one that drowns in customer-specific code.


You've done this within large organizations and seen what "enterprise-grade" actually means when thousands of teams depend on your platform. But you've also operated in environments where you had to build fast, make tradeoffs, and ship before the architecture was perfect.


The combination matters. Big-company pattern recognition with small-company intensity.


What We Offer

  • Lunch and dinner are provided in the office
  • $200/month learning and development budget
  • $1,000/month AI tool experimentation budget to automate, accelerate, and improve how you work
  • High-ownership role with direct exposure to leadership and company-building decisions
  • Competitive salary and equity


Read more
Searce Inc

at Searce Inc

3 recruiters
Reena Bandekar
Posted by Reena Bandekar
Pune
4 - 9 yrs
₹10L - ₹26L / yr
DevOps
skill iconKubernetes
Reliability engineering
Network Security
Amazon VPC
+4 more

Lead Cloud Reliability Engineer


Job Responsibilities

● Lead and manage the Cloud Reliability teams to provide strong Managed Services support to end-customers.

● Isolate, troubleshoot and resolve issues reported by CMS clients in their cloud environment

● Drive the communication with the customer providing details about the issue, current steps, next plan of action, ETA

● Gather client's requirements related to use of specic cloud services and provide assistance in seing them up and resolving issues

● Create SOPs and knowledge articles for use by the L1 teams to resolve common issues

● Identify recurring issues, perform root cause analysis and propose/implement preventive actions

● Follow change management procedure to identify, record and implement changes

● Plan and deploy OS, security patches in Windows/Linux environment and upgrade k8s clusters

● Identify the recurring manual activities and contribute to automation

● Provide technical guidance and educate team members on development and operations. Monitor metrics and develop ways to improve.

● System troubleshooting and problem-solving across plaorm and application domains. Ability to use a wide variety of open-source technologies and cloud services.

● Build, maintain, and monitor conguration standards.

● Ensuring critical system security through using best-in-class cloud security solutions.


Qualifications

● 4-7 years experience in Cloud Infrastructure and Operations domains and IT operational experience preferably in a global enterprise environment.

● Specialize in one or two cloud deployment platforms: AWS, GCP

● Hands on experience with AWS/GCP services (EKS, ECS, EC2, VPC, RDS, Lambda, GKE, Compute Engine)

● Understanding of one or more programming languages (Python, JavaScript, Ruby, Java, .Net)

● Logging and Monitoring tools (ELK, Stackdriver, CloudWatch)

● Knowledge on Conguration Management tools such as Ansible, Terraform, Puppet, Chef

● Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Mesos)

● Good analytical, communication, problem solving, and learning skills.

● Knowledge on programming against cloud plaorms such as Google Cloud Platform and lean development methodologies.

● Strong service aitude and a commitment to quality.

● Willingness to work in shifts.

Read more
Pune
4 - 12 yrs
₹5L - ₹20L / yr
skill iconJava
skill iconSpring Boot
Hibernate (Java)
skill iconKubernetes
RESTful APIs
+3 more

Job Title: Lead Software Engineer

Experience: 4 - 12 yr

Department: Software

Reports To: Senior Software Engineer / Software Architect



Purpose of the Role

The incumbent will be responsible for designing and developing robust software solutions for products in the domains of Warehouse Automation, Industrial Automation, Robotics, and IoT. The role includes defining software architecture, ensuring scalability and performance, and mentoring the development team to drive technical excellence and innovation.


Technical Skills Required

  • Proven experience in designing, developing, and deploying high-volume, scalable applications.
  • Expertise in distributed systems, microservices, and central system architectures.
  • Programming & Frameworks: Proficiency in Java 17+.
  • Experience with frameworks such as Spring, Hibernate, Kubernetes, and RESTful APIs.
  • Knowledge of JPA, MS SQL, and database modelling/design.
  • Hands-on experience with GCP, AWS, or Azure for cloud architecture.
  • Familiarity with virtualization and containerization technologies.
  • Strong skills in data modelling and database design.
  • Knowledge of secure coding practices.
  • Tech stack: Java, MSSQL, MySQL, Spring Boot, Redis, Data Structures, Linux, basics of Kubernetes.


Behavioural Skills Required

  • Attention to Detail (Proficient)
  • Problem Solving
  • Decision Making
  • Collaborative approach
  • Adaptability to a volatile environment
  • Accountability
  • Good Leadership skills


Job Responsibilities

  • Understand requirements and define database and application structure under guidance of Software Architect.
  • Write high-quality, scalable, and efficient code.
  • Prepare Functional Requirement Documents (FRD) based on inputs from BA team.
  • Guide junior and mid-level developers and provide technical support.
  • Collaborate to identify and fix technical issues in UAT/Production.
  • Work closely to meet project deadlines.
  • Take ownership of product implementations at customer sites.
  • Hands-on development for assigned modules/products.
  • Handle application performance in production.
  • Work with customers to understand automation requirements.
  • Review and merge code changes from the team.
  • Conduct sprint meetings, demos, and resolve development roadblocks.
  • Optimize code for performance and efficiency.
Read more
TalentWeave
Dudekula Lakshmi
Posted by Dudekula Lakshmi
Pune, Cognizant office is available
8 - 10 yrs
₹11L - ₹15L / yr
PowerBI
skill iconPython
Spark
skill iconAmazon Web Services (AWS)
ETL
+3 more

Role : AWS Data Engineer

Location : Anywhere in India - where cognizant office is available 

Contract duration : 12 months contract

Total Experience : 8-10 years

Budget-15LPA

Relevant Experience : 5+years with required skills & data engineering

Client : Cognizant


Job description : 


Python


Spark


Gradle 


AWS Services (ex: S3, Athena, Redshift, Transfer, SNS, SQS, Event Bridge, Lamda, Glue Data Catalog, RDS, EC2, IAM, Flink)


Kubernetes


Argo


Kafka / Kinesis streaming


SQL


ETL Data Pipelines


Data Modelling


Power BI/ Any reporting tools


New Relic / Terraform


Operational support - Batch monitoring, root cause analysis and fix

Read more
Bengaluru (Bangalore)
4 - 8 yrs
₹12L - ₹17L / yr
Azure
skill icongrafana
Scripting
prometheus
CI/CD
+4 more

Location: Bangalore 

Experience: 4-8 years

Interview Process - Two Rounds - First Round Virtual

Second Round-Face to Face at Bangalore


Key Skills Required

☁️ Cloud & Infrastructure

  • Strong hands-on experience with AWS Cloud Services
  • Proficiency in Terraform for Infrastructure as Code (IaC)
  • Experience in managing scalable cloud environments

⚙️ Containerization & Orchestration

  • Solid experience in Kubernetes (K8s) for container orchestration
  • Understanding of microservices architecture

🔄 CI/CD & DevOps

  • Hands-on experience with Azure DevOps (CI/CD pipelines)
  • Experience in build, release, and deployment automation

📊 Observability & Monitoring

  • Strong experience with Prometheus & Grafana
  • Expertise in setting up alerts, dashboards, and monitoring system health

🔐 API Gateway & Security

  • Experience with Kong or equivalent API Gateway
  • Understanding of API security controls (authentication, rate limiting, policies)

🧠 Core Technical Competencies

  • Strong Linux troubleshooting and system debugging skills
  • Proficiency in scripting (Bash / Python / Shell)
  • Understanding of networking concepts: TCP/IP, HTTP, DNS, Load Balancing
  • Experience with system architecture and distributed systems

🚨 SRE Responsibilities

  • Monitor system performance, reliability, and availability
  • Handle incidents, perform troubleshooting, and conduct RCA
  • Automate operational tasks to improve efficiency
  • Build and maintain scalable, resilient infrastructure
  • Collaborate with development and DevOps teams for system improvements

🧪 Good to Have

  • Experience with Finacle operations
  • Exposure to API/load testing tools like JMeter or Gatling
  • Familiarity with logging tools like Loki

🤝 Soft Skills

  • Strong communication and collaboration skills
  • Ability to document processes and technical workflows clearly

🎯 Ideal Candidate

A hands-on SRE/DevOps Engineer with strong exposure to:

  • AWS + Terraform + Kubernetes
  • CI/CD (Azure DevOps)
  • Monitoring + API Gateway security 


Read more
Wissen Technology

at Wissen Technology

4 recruiters
Shakthi M
Posted by Shakthi M
Bengaluru (Bangalore), Mumbai, Pune
5 - 14 yrs
Best in industry
skill iconJava
skill iconKubernetes
JMS/EMS

Strong Java Developer with hands-on experience in building, deploying, debugging, and supporting Java applications on Kubernetes-based container platforms. 

Focus is on application development and deployment support rather than Kubernetes administration or security management. Experience with JMS/Messaging systems is preferred.

Primary skills required:

  • Core Java
  • Kubernetes (Application Deployment & Execution)
  • Docker & Containerization
  • Build & Deployment Support
  • scripting can be python also if not java script 

Kubernetes Certified Application Developer (CKAD) certification would be an added advantage.

Read more
ChicMic Studios
Mohali
4 - 9 yrs
₹8L - ₹16L / yr
FastAPI
skill iconDocker
skill iconPostgreSQL
Celery
skill iconRedis
+3 more

Job Description:

Profile: Senior Python AI/ML Engineer

Required experience: 5-9 Years

Location: Mohali, Punjab (WFO only, no hybrid )

B.tech/ MCA preferred

Immediate joiners

We are looking for a Senior Python (AI/ML) Engineerto build and scale a production-grade AI-powered platform focused on image understanding, scene reconstruction, and automated media generation. This is a backend-heavy role requiring strong expertise in system design, asynchronous processing, and scalable API development.

You will work closely with AI/ML, frontend, and DevOps teams to deliver high-performance systems that power object detection, segmentation, and 3D reconstruction pipelines.

Key Responsibilities:

- Design and develop scalable backend services and APIs using FastAPI

- Build modular,maintainable systems with clear service contracts and abstractions

- Implement asynchronous processing pipelines for compute-heavy AI workloads

- Design and manage job queues (Celery, RQ, or similar) and integrate Redis

- Handle long-running and GPU-based tasks- Work with PostgreSQL and integrate object storage (S3 or equivalent)

- Integrate and productionize ML models

- Optimize inference pipelines for latency, batching, and throughput

- Ensure system reliability with retries, fallbacks, and monitoring

- Collaborate with Data Science, Frontend, and DevOps teams

Required Skills:

- 5+ years of Python development experience

- Hands-on experience with FastAPI and REST APIs

- Strong understanding of async programming and scalable systems

- Experience with Redis and queue systems (Celery, RQ, Kafka)

- Strong PostgreSQL experience

- Experience with Docker (mandatory); Kubernetes is a plus

- Familiarity with AWS, GCP, or Azure

Good to Have:

-Exposure to AI/ML pipelines (YOLO, DETR, SAM, etc.)

- Experience deploying ML models in production

- Understanding of GPU workloads and optimization

- Background in image/video processing or real-time systems

What We’re Looking For:

- Strong problem-solving and system design mindset

- Ability to handle real-world scale and unstructured data

- Experience in AI-integrated backend systems

Benefits:

  • Flexible schedule
  • Health insurance
  • Leave encashment
  • Provident Fund


Read more
Zocket

at Zocket

4 recruiters
Dhanesh Sridhar
Posted by Dhanesh Sridhar
Chennai
1 - 3 yrs
₹5L - ₹12L / yr
skill iconAmazon Web Services (AWS)
MLOps
skill iconJenkins
skill iconKubernetes
Amazon EKS
+4 more

Why this role exists

Our infrastructure footprint is growing faster than our headcount, and we believe most of that

gap should be closed by automation and AI agents — not by hiring more humans to do toil. We

need someone early in their career who treats manual work as a bug, ships scripts and agents

instead of tickets, and wants to grow into deeper ownership over the next two years.

You will not be the most senior person on the team. You will be the one who multiplies the team.

What you'll own

In your first 1 months

• Take ownership of one slice of our CI/CD pipeline and make it measurably

faster, more reliable, or cheaper. We expect a number on a dashboard to move.

• Build at least three internal automations that replace manual ops toil —

using AI agents (Claude Code, agentic CLIs, scripted LLM workflows) as your force

multiplier.

• Be the first responder for a defined set of alerts. Write the runbooks. Drive

the alert volume down.

• Support senior engineers on AI/ML infrastructure (GPU nodes, inference

services, model deployment) — observe, document, and gradually take on contained

changes under review.

By 3 months you should be

• The go-to person for at least two production systems.

• Shipping routine infrastructure changes without needing senior review.

• Treating "manual" as a code smell.

Required (we will reject without these)

• 0–3 years hands-on experience with one major cloud (AWS, GCP, or

Azure — one is fine, depth beats breadth).

• Fluent in Linux command line, bash, and at least one scripting language

(Python or Go preferred).

• Have shipped something to production that real users hit. A side project

counts; a graded coursework lab does not.

• Comfortable with Docker — you can explain what an image vs. a

container is and why it matters.

• Working knowledge of networking fundamentals: DNS, HTTP/HTTPS,

TLS, ports, basic subnets — enough to debug "it works on my machine."

• Git fluency: branches, merges, rebases, conflict resolution.

• CI/CD pipelines — you have authored or substantially modified pipelines

in GitHub Actions, GitLab CI, ArgoCD, Jenkins, or similar. Not just "I clicked Re-run."

• Kubernetes basics — kubectl for real work, can read pod logs,

understand deployments and services, can debug a CrashLoopBackOff without

panicking. You do not need to have run a cluster; you do need to have lived inside one.

• Active user of AI coding agents (Claude Code, Cursor, Copilot, agentic

CLIs, etc.). You should be able to walk us through specific tasks where they made you

faster, and specific tasks where they failed you and how you noticed. "I have tried it" is

not enough.

Bonus (real plus, not required)

• Infrastructure as Code: Terraform, Pulumi, or Ansible.

• Observability: Prometheus/Grafana, Datadog, OpenTelemetry, any APM.

• Have built or extended an LLM-based agent — a custom MCP server, a

scripted multi-step workflow, an internal tool that calls models in a loop. Anything beyond

chat-with-Claude.

• Exposure to GPU workloads, model serving (vLLM, Triton, TGI, etc.), or

ML pipelines.

What we don't care about

• Whether your degree is in CS — or whether you have a degree at all.

• Brand-name companies on your resume.

• Certifications. They are fine. They do not substitute for having shipped.

How we work

• We default to automation. If you do something manually twice, the third

time you script it or hand it to an agent.

• AI agents are part of the workflow, not a novelty. Expect interview

questions about exactly how you use them — and where you have caught them being

wrong.

• Small, reversible changes beat big-bang rollouts.

• Postmortems are blameless and written down.

• We push back on each other. If you only execute, you will be unhappy

here.

How to apply

Send:

• Your resume.

• A short note (≤200 words) describing one infra or automation problem you

solved, and how AI agents factored in — or did not, and why. We read these. Generic

notes get rejected.

Internal note — delete before posting externally

• Comp band, location policy, team name, and reporting line marked

[CONFIRM] need to be filled in before this goes external.

• The Required list is intentionally tight: CI/CD and Kubernetes basics

promoted from bonus. Expect this to filter ~80% of typical junior DevOps applicants. The

remaining pool will skew toward people who have actually shipped infra at a startup, not

bootcamp grads or pure cloud-cert holders.

• IaC, observability, agent-building, and GPU/ML serving stay as bonus.

Promoting any of these to required at 0–3 yrs collapses the pool to near-zero or forces

hiring senior people at junior comp. If you want IaC required, re-level this to mid (3–5

yrs) and raise the band.

• Screening implication: the resume screen should explicitly check for

CI/CD pipeline authorship and any K8s-touching production work. If neither is on the

resume, reject at screen. Do not waste interview slots.

• Pipeline watch: if fewer than ~15 qualified resumes after 2 weeks of

active sourcing, the first thing to relax is the AI-agent-fluency bar (move to bonus and

screen for it in interview instead). Do not relax the "shipped to production" requirement

— that is the load-bearing filter.

Read more
Mlops Solutions Pvt Ltd
Sowmyasri Parthi
Posted by Sowmyasri Parthi
Bengaluru (Bangalore)
5 - 10 yrs
₹1L - ₹37L / yr
skill iconSpring Boot
skill iconJava
skill iconReact.js
skill icon.NET
Spring MVC
+8 more

Senior Software Engineer

Responsibilities:

• Lead by the principle of "customer first" to analyse, debug, develop, and maintain customer-centric software.

• Collaborate closely with multidisciplinary teams to analyse, debug and fix issues with high quality code, zero regressions, scalable, innovative technical solutions.

• Optimizing components for maximum performance and scalability

• Participates in R&D, Proof of Concepts, Prototyping, Code review, Root Causing, etc.

• At least 2-3 yrs of experience in taking full ownership of software development lifecycle including planning, design, architecture, development, test & deployment. And 2+ years of experience in supporting production or customer issues and escalations.

• Review and analyze support tickets that are complex in nature and require more technical knowledge to analyze. Investigate issues to identify root causes and document findings clearly.

• Influences the development practices so that they follow best practices, policies, and procedures.

• Ensure software products meet all non-functional requirements including operational and security needs.

• Excellent verbal and written communication skills, problem solving skills.

• Address complex technical challenges within software systems, ensuring robustness, compliance, and customer satisfaction.

• Contribute to knowledge base

• Support the Lead and Mentor the team of software engineers and own the technical health of the service the team is working on.


Requirements:

• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.

• Minimum of 5 years of professional experience in Java development with expertise in core Java, JDK, data structures, and multithreading.

• Strong analytical skills with experience in root cause analysis and fixes • Strong experience with Spring and Spring Boot frameworks.

• Strong understanding of software design principles, architecture, and best practices

• Familiarity with server technologies, including Tomcat and WebLogic.

• Proficiency in working with relational databases such as Oracle and PostgreSQL

• Experience with messaging queues, particularly JMS MQ or Artemis MQ.

• Possess exceptional debugging and troubleshooting skills to resolve complex issues across the entire, cross-functional technology stack

• Awareness / exposure to debugging Frontend applications (ReactJS / .Net) is a good to have.

• Exposure to Kubernetes, containerization and cloud (AWS) technologies in building scalable, resilient, and distributed environments.

• Excellent problem-solving skills and the ability to work in a fast-paced and in production customer sensitive environments.

• Strong communication and collaboration skills. Quick learner.

• Experience and knowledge in working in the HRP application – specifically the Claims functionality. Added experience in other areas including Enrollment, Billing, Financials is a plus. Candidate must have worked previously in HRP

• Familiarity with ticketing systems (Jira, SalesForce) and production support workflows.

Read more
NeoGenCode Technologies Pvt Ltd
Akshay Patil
Posted by Akshay Patil
Bengaluru (Bangalore)
5 - 12 yrs
₹10L - ₹30L / yr
skill iconNodeJS (Node.js)
skill iconJavascript
TypeScript
skill iconExpress
Sails.js
+10 more

Job Title : Senior Node.js Developer (SDE 2)

Experience : 5+ Years

Location : Bengaluru (Whitefield)

Work Mode : Hybrid (3 Days WFO)

Openings : 2

Notice Period : Immediate–20 Days Preferred


Role Overview :

We are looking for an experienced Senior Node.js Developer (SDE 2) professional with strong expertise in building scalable backend applications, microservices, and distributed systems in Agile environments.


Mandatory Keywords :

Node.js, JavaScript, TypeScript, GraphQL, Microservices, Cloud, REST APIs, Docker, Kubernetes, Redis, System Design.


Key Responsibilities :

• Design, develop, and deploy scalable backend systems using Node.js & TypeScript.

• Build REST APIs and GraphQL services.

• Develop secure, reusable, and high-performance applications.

• Work on microservices, cloud platforms, and production support.

• Follow best practices including TDD, testing, and DevOps processes.

• Mentor team members and contribute to technical excellence.


Mandatory Skills :

• 5+ years of software development experience.

• Strong hands-on experience in Node.js, JavaScript, TypeScript.

• Experience with Express.js/Koa/Sails.js.

GraphQL, REST APIs, Microservices

AWS/Azure/GCP Cloud

Docker, Kubernetes, Redis

• Strong understanding of distributed systems, Git, and DevOps.


Good to Have :

Kafka, RabbitMQ, Apollo Federation, New Relic, Datadog, Splunk, React.js/Next.js, Jest/Mocha/Cucumber


Interview Process :

  • L1 – Technical
  • L2 – Coding
  • L3 – System Design (HLD + LLD)
Read more
Redtring
Keshav Senthil
Posted by Keshav Senthil
Hyderabad
3 - 6 yrs
₹20L - ₹25L / yr
skill iconJava
skill iconKotlin
skill iconAmazon Web Services (AWS)
skill iconRedis
Apache Kafka
+7 more

About Us:


We are hiring for a pre seed funded startup called Zeromoblt (https://zeromoblt.com/), a high-agency Hyderabad-based startup revolutionizing student transportation with lean, intelligent tech stacks.


Our mission: architect world-class systems from scratch—fast, scalable, and algorithmically sharp—using Kotlin, React, AWS (EC2, IoT, IAM), Google Maps, and multi-cloud setups. Stealth mode operations mean you're building 0→1 products with founders, not fixing tickets.


What You'll Do

  • Lead end-to-end ownership of complex systems: design, build, deploy, monitor, and iterate at scale.
  • Architect high-performance backends in Kotlin (or JVM langs) that handle real-time routing and IoT data.
  • Craft scalable React UIs that power ops dashboards and parent-facing apps.
  • Drive cloud decisions across AWS, Azure/GCP—optimising costs for our bootstrap runway.
  • Apply DSA/system design to solve hard problems like dynamic route optimization and predictive scaling.
  • Shape the engineering roadmap: propose, prioritise, and ship features with founders.
  • Mentor juniors while executing solo on high-impact bets—no layers, just results.


We're Looking For

  • 3-6 years of hands-on engineering where you've owned and shipped production systems (prove it with code/stories).
  • Elite CS fundamentals: advanced DSA, system design (distributed systems a must), design patterns.
  • Mastery of Kotlin/Java + modern React; real AWS experience (EC2, IAM, CLI—you know our stack).
  • Proven "leap-taker": startup grit, side projects, or open-source that screams hunger.
  • Figure-it-out velocity: you thrive in chaos, learn our domain overnight, and deliver 10x faster than peers.


This Role Is Not For You If…

  • You need structured roadmaps, PM hand-holding, or big-tech process.
  • Comfort > impact: stable salary over equity upside and chaos.
  • You've never worn all hats (dev, ops, product) in a resource-constrained environment.


Why Join Us

  • Massive ownership: lead tech for 10k+ students, direct founder access, shape ZeroMoblt's scale.
  • Flat, high-trust team: flexible Hyderabad/remote, no bureaucracy.
  • Hungry culture: we hire hustlers scaling from 700 to 10k students your wins are visible daily.
  • Hungry to Leap? Apply now!
Read more
MyOperator - VoiceTree Technologies

at MyOperator - VoiceTree Technologies

1 video
2 recruiters
Vijay Muthu
Posted by Vijay Muthu
Remote only
3 - 6 yrs
₹15L - ₹20L / yr
skill iconAmazon Web Services (AWS)
Amazon CloudWatch
skill icongrafana
prometheus
skill iconKubernetes
+3 more

About MyOperator

MyOperator is a Business AI Operator platform that enables businesses, teams, and AI agents to work together seamlessly for customer operations such as Sales, Support, Escalations, Feedback, and Refund processes. With 12,000+ businesses using our platform, we operate at meaningful scale and power mission-critical communication workflows including voice bots, WhatsApp automation, and intelligent call routing.


We are building for reliability, speed, and impact. MyOperator values ownership, critical thinking, and execution. This is a high-expectation, high-learning environment where engineers are empowered to solve complex problems and build systems that directly affect customer outcomes.


Role Overview

We are looking for a skilled and proactive Site Reliability Engineer (SRE) to take end-to-end ownership of production reliability, observability, and performance engineering across MyOperator’s AI-powered communication infrastructure.


This role is not operational-only — it requires strong system design thinking, deep troubleshooting ability, and a production ownership mindset. You will define reliability standards, build observability frameworks, lead incident response, and drive SLO-based engineering practices across distributed AWS and Kubernetes environments.


Key Responsibilities

  • Own production reliability, uptime, latency, and error budgets across critical services.
  • Design and manage production-grade monitoring using Grafana, VictoriaMetrics (Prometheus), and PromQl, AWS CloudWatch.
  • Define and enforce SLIs, SLOs, and SLA thresholds for AI communication systems (voice bots, WhatsApp APIs, call routing).
  • Build real-time operational dashboards for incident response, capacity planning, and leadership visibility.
  • Implement end-to-end distributed tracing using OpenTelemetry (OTEL Collector).
  • Design and maintain centralized logging with strong correlation between logs, metrics, and traces.
  • Create SLO-based alerting systems with minimal noise and fast incident detection.
  • Lead incident response lifecycle: alert triage, mitigation, RCA documentation, and preventive improvements.
  • Drive MTTR reduction through structured monitoring, automation, and reliability engineering practices.
  • Monitor and troubleshoot AWS EKS (Kubernetes) production workloads.
  • Instrument and monitor LLM API integrations, AI inference pipelines, and messaging systems.
  • Analyze logs using OpenSearch / ELK for anomaly detection and root cause identification.
  • Automate operational workflows using Python or Bash to eliminate manual toil.
  • Drive performance optimization, scalability improvements, and capacity planning.
  • Collaborate with engineering teams to instrument new services from day one.

Required Skills & Qualifications

  • 3–6 years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
  • Hands-on experience with:
  • VictoriaMetrics / Prometheus (time-series monitoring)
  • Grafana dashboards and visualization
  • PromQL for writing complex queries and alerts
  • Experience implementing distributed tracing using OpenTelemetry (Mandatory).
  • Strong experience with centralized logging systems (ELK / OpenSearch / Loki).
  • Experience with alerting frameworks such as Alertmanager or Grafana Alerts.
  • Strong understanding of SLIs, SLOs, SLA design, and reliability engineering principles.
  • Hands-on experience managing AWS production workloads (EC2, RDS, ELB, CloudWatch, IAM).
  • Experience with Kubernetes (AWS EKS preferred).
  • Familiarity with CI/CD pipelines and automation tools.
  • Good understanding of Linux systems, networking, and cloud infrastructure.
  • Experience handling production incidents and participating in on-call rotations.
  • Ability to automate operational tasks using Python or Bash.

Good to Have

  • Experience with OpenSearch / ELK log pipelines and anomaly detection.
  • Kubernetes monitoring (pod health, node metrics, autoscaling behavior).
  • CI/CD observability integration (Jenkins, GitHub Actions).
  • Experience in monitoring LLM APIs and AI inference pipelines.
  • Familiarity with MLOps or AI observability tools (Arize, WhyLabs, etc.).
  • Service mesh exposure (Istio).
  • Infrastructure as Code (Terraform, CloudFormation).
  • Experience with chaos engineering or load testing tools.
  • Multi-cluster or multi-region architecture exposure.

Key Expectations

  • Ownership of production systems and high availability.
  • Strong troubleshooting and debugging skills.
  • Focus on automation and reliability improvements.
  • Proactive approach to incident prevention.
  • Ability to reduce alert noise and improve signal quality.
  • Data-driven approach to reliability engineering.

This Role Is Not For

  • Candidates with purely development experience and no production ownership.
  • Candidates without real incident response or on-call experience.
  • Freshers or candidates with less than 3 years of experience.


Read more
OpsTree Global
Pragati Srivastava
Posted by Pragati Srivastava
Bengaluru (Bangalore), Mumbai
4 - 9 yrs
₹20L - ₹25L / yr
Google Cloud Platform (GCP)
skill iconKubernetes
Terraform
Monitoring
prometheus
+3 more

Immediate Hiring: GCP DevOps Engineer | Mumbai & Bengaluru (On-site)


OpsTree Global is urgently hiring a GCP DevOps Engineer with 4–9 years of experience for immediate requirements in Mumbai and Bengaluru.


Key Skills

  • Google Cloud Platform (GCP)
  • Terraform / Infrastructure as Code (IaC)
  • Kubernetes & Helm Charts
  • CI/CD – Jenkins, GitLab CI, GitHub Actions
  • Linux Administration
  • Scripting – Python / Go / Java

Role Responsibilities

  • Build and manage scalable cloud infrastructure on GCP
  • Automate deployments and infrastructure provisioning
  • Ensure system reliability, monitoring, and performance optimization
  • Collaborate with development and operations teams for seamless delivery


📍 Locations: Mumbai & Bengaluru (On-site)

⚡ Immediate Joiners Preferred

💼 Experience: 4–9 Years

Read more
Searce Inc

at Searce Inc

3 recruiters
Mohammed Rabidheen
Posted by Mohammed Rabidheen
Coimbatore
5 - 10 yrs
Best in industry
Microsoft Windows Azure
skill iconKubernetes
Terraform
Observability
Reliability engineering

About Searce

Searce (pronounced 'search') is a global, AI-native, and engineering-led modern technology consultancy. Founded in 2004 with a vision to "solve for better," we partner with organizations to "futurify" their businesses by leveraging the full power of Cloud, AI, and Data Engineering.

With a presence across 10+ countries—including the US, India, Singapore, and Australia—Searce has evolved over two decades into a trusted technology partner for over 3,000 clients. We are not just a service provider; we are a group of "solvers-at-heart" who thrive on complex technical challenges.

Why Join the "Solvers" Brigade?

  • Award-Winning Excellence: In 2026, Searce was recognized as the Google Cloud Workplace AI Transformation Partner of the Year (APAC). We are a Premier Google Cloud Partner and a top-tier Managed Services Provider (MSP).
  • AI-First Mindset: We specialize in Applied AI (Generative & Conventional), Cloud Modernization, and Location Intelligence, helping industries from FinServ and Healthcare to Retail and Manufacturing reinvent themselves.
  • The "Futurify" DNA: We don't just maintain; we improve. We use our proprietary EVLOS business innovation framework to ensure our clients aren't just moving to the cloud, but are staying ahead of the curve.

Our Culture: The HAPPIER Values

We look for individuals who live and breathe our HAPPIER values:

  • Humble: We learn from everyone.
  • Adaptable: We embrace change as the only constant.
  • Positive: We focus on solutions, not just problems.
  • Passionate: We are obsessed with engineering excellence.
  • Innovative: We challenge the status quo.
  • Excellence: We deliver impactful, futuristic outcomes.
  • Responsible: We take ownership of our work and its impact.


Your Mission: The Role

solving for better.

You are a reliability-owning, hands-on solver. Not just a "break-fix engineer."

As a DRI (directly responsible individual) for our clients' most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.

You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into today's production environments.


Your Responsibilities

what you will wake up to solve.

This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:

  • Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. You'll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
  • Engineer the Blueprint: You'll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
  • Automate with Intelligence: You'll lead the charge from the keyboard to futurify our operations. You'll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
  • Drive FinOps & Impact: You'll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
  • Be the Expert in the Room: You'll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
  • Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, you'll help build a high-performing squad that lives the “Always Hardened” mindset.


Experience & Relevance

We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.

  • Cloud Operations Pedigree: 5+ years of experience in Azure cloud infrastructure, with a significant portion in cloud managed services. Hands-on experience in Kubernetes is mandatory.
  • Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
  • Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.


Functional Skills:

  • Service Delivery Mindset: A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
  • Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
  • Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
  • Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.
Read more
Egnyte

at Egnyte

4 recruiters
Bhavana Kapalganti
Posted by Bhavana Kapalganti
Remote only
5 - 10 yrs
Best in industry
Performance Testing
skill iconKubernetes
Dynatrace
HP LoadRunner
skill iconDocker
+2 more

ABOUT EGNYTE


Egnyte is the secure multi-cloud platform for content security and governance that enables organizations to better protect and collaborate on their most valuable content. Established in 2008, Egnyte has democratized cloud content security for more than 23,000 organizations, helping customers improve data security, maintain compliance, prevent and detect ransomware threats, and boost employee productivity on any app, any cloud, anywhere. For more information, visit www.egnyte.com.


Egnyte is looking for a Performance Engineer to join our performance engineering team. As a Performance Engineer, you will drive proactive monitoring and improve automation for regular operational tasks.


WHAT YOU’LL DO:


  • Develop tools to measure & monitor performance bottlenecks within the application
  • Triage reported performance issues and translate them into reproducible test scenarios
  • Collaborate with production Engineering and application engineering teams to design and execute production scenarios that will assess code and 3rd party performance
  • Develop and run PSR tests and measure stats for them
  • Work with various sub teams to ensure SLA of their core apis are tracked and maintained release over release
  • Application and architecture code profiling
  • Infrastructure and application performance tuning
  • Troubleshooting performance issues
  • Ability to distill volumes of data, analyze performance results, and diagnose performance problems
  • Capacity estimating, modeling, or planning


YOUR QUALIFICATIONS:


  • Bachelor’s degree in computer science or related field. Advanced degree preferred
  • 5+ years of work experience in performance engineering
  • Expert knowledge and strong experience using tools, LoadRunner/JMeter etc. and understanding of APM solutions like Grafana, AppDynamics, Dynatrace etc
  • Experience with microservice architecture, Docker, Kubernetes, Jenkins, Azure, GCP and application monitoring tools
  • Strong expertise on monitoring and analyzing application logs, database reports, system metrics like CPU Utilization, Memory usage, Network usage, Garbage Collection and DB Parameters
  • Strong expertise on identifying potential performance issues and providing recommendations to improve performance
  • Proficiency in JVM technology and JVM troubleshooting skills
  • Proficiency in debugging application in production at a large scale
  • JVM Profiling, GC Analysis and Tuning experience
  • Experience with Performance, Load, Stress, and Scalability Testing
Read more
Virtana

at Virtana

3 candid answers
2 recruiters
Krutika Devadiga
Posted by Krutika Devadiga
Pune
5 - 9 yrs
Best in industry
Google Cloud Platform (GCP)
DevOps
Shell Scripting
skill iconPython
skill iconKubernetes
+11 more

Role Overview:

Virtana is looking for a Senior DevOps Engineer to join our R&D Infrastructure team. In this role, you won't just follow conventions — you'll help redefine them. You will own the architecture, build, and day-to-day operations of the GCP-based cloud platform that powers Virtana's SaaS products and the AI-driven observability experience our Global 2000 customers depend on. This is a hands-on senior individual contributor role with meaningful technical leadership scope, working alongside engineers and architects on a unified observability platform.


Work Location: Pune


Job Type: Hybrid


Role Responsibilities:

  • GCP Cloud Operations: Develop, deploy, operate, and support production cloud infrastructure primarily on GCP — leveraging GKE, BigTable, BigQuery, Dataflow, Cloud Storage, IAM, and core networking services.
  • Reliability & SLAs: Ensure production systems are running at all times with multiple levels of redundancy to meet committed SLAs; lead incident response, root cause analysis, and post-incident reviews.
  • Build & Release Automation: Design, implement, and continuously improve scalable CI/CD pipelines and test frameworks leveraged by QA and development teams across the company.
  • Infrastructure as Code: Manage large-scale, repeatable deployments using Terraform, Ansible, Puppet, or SaltStack; champion Git-based workflows and version control standards for distributed engineering teams.
  • Security & Availability: Maintain the ongoing maintenance, security, patching, and availability of services in line with tight operations, security, and procedural models.
  • Monitoring & Alerting: Plan and deliver high-value monitoring and alerting features to support operations, support, and customer-facing reliability — eating our own dog food with the Virtana Platform wherever possible.
  • Capacity & Cost: Forecast capacity, plan upgrades, patches, and migrations, and drive cloud cost efficiency across hybrid and multi-cloud environments.
  • Cross-Functional Partnership: Work with development, operations, and support personnel to identify, isolate, and diagnose issues; handle support escalations and drive permanent fixes.


Required Qualifications:

  • Bachelor's degree in Computer Science / Engineering or equivalent relevant experience.
  • 5–7 years of professional hands-on DevOps / SRE experience supporting production cloud environments.
  • Strong, demonstrable production experience on GCP — including GKE, BigTable, BigQuery, Dataflow, IAM, and core GCP networking services.
  • Deep, hands-on expertise with container orchestration (Kubernetes) and Docker in production.
  • Advanced proficiency with at least one infrastructure-as-code / configuration management tool: Terraform, Ansible, Puppet, or SaltStack.
  • Solid understanding of networking, firewalls, load balancers, DNS, and database operations.
  • Strong working knowledge of Git-based workflows and version control standards for distributed engineering teams.
  • Comfort operating hybrid environments that include both Linux and Windows ecosystems.
  • Excellent verbal and written communication skills, with the ability to explain highly technical topics to both technical and non-technical audiences.
  • Self-motivated, detail-oriented, and able to work both independently and within a globally distributed team.


Good to Have:

  • Strong scripting skills and a demonstrated ability to automate operational toil — Python preferred; Bash, Go, or Groovy a plus.
  • Hands-on experience designing and operating CI/CD pipelines with Jenkins (Spinnaker, GitHub Actions, or GitLab CI also welcome).
  • Exposure to AWS or other public clouds in addition to GCP.
  • Experience operating SaaS platforms built on microservices architectures.
Read more
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort