Cutshort logo
Kubernetes jobs

50+ Kubernetes Jobs in India

Apply to 50+ Kubernetes Jobs on CutShort.io. Find your next job, effortlessly. Browse Kubernetes Jobs and apply today!

icon
HireTo

at HireTo

1 video
Anshul Saxena
Posted by Anshul Saxena
Hyderabad
8 - 10 yrs
₹25L - ₹40L / yr
TypeScript
skill iconNodeJS (Node.js)
skill iconReact.js
skill iconRedis
skill iconPostgreSQL
+6 more


Senior Software Engineer


  • Minimum 5 years in typescript and Total of minimum 8 Years of experience
  • Excellent communication skills. English Language fluency is Mandatory.


We are looking for strong backend senior engineers

  • 8+ years experience as Senior Software Engineer
  • Have worked in medium- to big-sized companies for at least 3 years ( companies with 50+ developers)
  • Strong architecture and planning skills
  • Strong AI coding skills (being fluent in developing agents and skills) and using them in their development routines
  • Distributed programming knowledge
  • Typescript, Postgres & Kubernetes knowledge.


Ways of working

The engineers will be lead/managed by a UK Multinational company engineer manager (initially one currently based in Europe), and we are going to work on hiring one in Asia if we succeed in putting the team together

Engineers will have a direct contract with UK Multinational Company as career employees (entitled to company benefits and career progression) and will be trained in the UK Multinational Company product portfolio and work on several different projects.

We work in an environment where each engineer is presented with a business problem (or user story) and must come up with a detailed plan on how to tackle it (and explain and get approval from the team), which we do during the refinement meetings. In summary, we do not tell the engineers what to do, our expectation from a Senior engineer is that he tells us what to do (he will be trained, of course, in the best practices and technical specs of our environment).

Hiring process

  • Hiring manager interview
  • System Design interview
  • Coding (with AI) interview


Read more
Technogise Private Limited
Vandana BM
Posted by Vandana BM
Bengaluru (Bangalore)
12 - 18 yrs
Best in industry
skill iconPython
skill iconReact.js
skill iconDocker
skill iconKubernetes
skill iconAmazon Web Services (AWS)
+1 more

How do Technogisers function?


Value: Exploring technologies and implementing them on the projects provided they make business sense and deliver value.

Engagement: Be it offshore or onshore, we engage ourselves daily with the clients. This assists in building a trustworthy relationship at the same time, collaborating to come up with strategic solutions to business problems.

Solution: We are involved in providing hands-on contributions towards Backend & Front-end design and development at the same time, flourishing our DevOps culture.

Thought Leadership: Attend or present technical meet-ups/workshops/conferences to share knowledge and help build Technogise brand.

Note: All our roles are customer-facing roles.


This is a full-time 5 days work from office role as a Technology Consultant (Lead) located in Bangalore (Indiranagar).


Core Skills:

  • We are looking for Lead engineer of industry experience exclusively in Python, React, AWS, Kubernetes, Docker, AI-first delivery mindset.
  • You are also an advocate of good engineering practices
  • Influence technical decision-making and high-level design decisions - choice of frameworks and tech approach
  • Demonstrate the ability to understand different approaches for application, and integration and influence decisions by making appropriate trade-offsWays of working:
  • You communicate effectively with other roles in the project at the team and client levels
  • You drive discussions effectively at the team and client levels. Encourage others to participate


Going beyond:

  • Establish credibility within the team as a result of technical and leadership skills
  • Mentoring fellow team members within the project team and providing technical guidance to others beyond project boundaries
  • Actively participate in organisational initiatives


Read more
Recruiting Bond

at Recruiting Bond

2 candid answers
Pavan Kumar
Posted by Pavan Kumar
Bengaluru (Bangalore)
7 - 12 yrs
₹70L - ₹110L / yr
Platform as a Service (PaaS)
Platform Engineering
Agentic AI
AI Agents
Model Context Protocol (MCP)
+41 more

About My Client Company

We're building the learning infrastructure that transforms AI agents into true digital workers. While today's agents can reason and plan, they fail to do meaningful work because they lack real experience operating in apps. My Client Product gives agents continuously improving, reusable skills across 1000+ production-grade app connectors including Gmail, Linear, and Hubspot. We handle authentication, tool routing, retries, failure handling, and observability, making every action safe and dependable.


About the Role

Every enterprise is racing to make AI work — not as a demo, but as infrastructure that runs their business. My Client Product is becoming the critical layer that makes this possible: the platform that connects AI agents to 250+ real-world applications with production-grade auth, execution, and reliability.

We've built this for the cloud. Now we need to build it for the enterprise — and that means rethinking the platform from the ground up with the right abstractions, primitives, and architectural decisions that let us serve a massive, diverse set of enterprise customers without bespoke engineering for each one. This is a founding role.


Your Impact

  • Agent infrastructure platform: The foundational layer that enterprise AI agents run on — governance, observability, and control planes for MCP-powered agent ecosystems. You'll define how organizations monitor, audit, and manage AI agents operating at scale across their systems
  • The integration gateway: The secure, reliable bridge between an enterprise's AI agents and the outside world — every SaaS tool, internal system, and API they need to act on. Not just connectors, but a platform-grade gateway with the right trust, permissioning, and routing primitives
  • Platform primitives for scale: Multi-tenancy, isolation, configuration, and extensibility abstractions that let Composio serve thousands of enterprise customers without linear engineering cost
  • Enterprise-grade architecture: Deployment flexibility, security, and compliance as first-class platform capabilities — not bolted-on afterthoughts
  • The repeatable deployment motion: Turn enterprise onboarding from a services engagement into a product experience. Shorter cycles, fewer custom touches, more self-serve


What you bring

  • You've built platforms at genuine scale — not just high user counts, but high complexity: many customer types, deployment models, and integration surfaces
  • You think in abstractions and primitives. Your instinct is to find the right foundational model, not to solve each problem individually
  • You've shipped enterprise product capabilities (deployment flexibility, security, admin tooling, compliance) and understand them as product problems, not just checkboxes
  • You've built or shipped an AI product — or you're the person who can't stop tinkering. You're building agents on weekends, stress-testing the latest models, experimenting with MCP, and forming your own opinions on where agent architectures are headed. You have a point of view on this space, not just a resume line
  • You're a force multiplier. When you join a team, the entire product moves faster because the platform decisions are right


Skills & Expertise

Platform Engineering, AI Infrastructure, Agentic AI, AI Agents, MCP (Model Context Protocol), Distributed Systems, Enterprise Architecture, Multi-Tenant Architecture, Backend Platform Engineering, Enterprise SaaS, API Platform Engineering, Integration Platforms, SaaS Connectors, Cloud Infrastructure, AWS, GCP, Kubernetes, Docker, Terraform, Microservices, Event-Driven Architecture, API Gateway, OAuth 2.0, RBAC, IAM, Observability, OpenTelemetry, Prometheus, Grafana, Reliability Engineering, SRE, Python, Golang, Node.js, TypeScript, REST APIs, GraphQL, AI Orchestration, LLM Infrastructure, LangChain, LangGraph, OpenAI APIs, Claude APIs, RAG, Workflow Automation, AI Tool Routing, Enterprise Security, Compliance Engineering, Deployment Architecture, Configuration Management, Extensible Systems, Scalability Engineering, High-Scale Systems, Technical Strategy, Platform Primitives, Developer Platforms, Enterprise Integrations, Infrastructure Engineering, Founding Engineer Mindset.


This role demands deep platform thinking. You've designed systems where the abstractions were the product — where getting the primitives right meant the difference between a product that scales and one that drowns in customer-specific code.


You've done this within large organizations and seen what "enterprise-grade" actually means when thousands of teams depend on your platform. But you've also operated in environments where you had to build fast, make tradeoffs, and ship before the architecture was perfect.


The combination matters. Big-company pattern recognition with small-company intensity.


What We Offer

  • Lunch and dinner are provided in the office
  • $200/month learning and development budget
  • $1,000/month AI tool experimentation budget to automate, accelerate, and improve how you work
  • High-ownership role with direct exposure to leadership and company-building decisions
  • Competitive salary and equity


Read more
Searce Inc

at Searce Inc

3 recruiters
Reena Bandekar
Posted by Reena Bandekar
Pune
4 - 9 yrs
₹10L - ₹26L / yr
DevOps
skill iconKubernetes
Reliability engineering
Network Security
Amazon VPC
+4 more

Lead Cloud Reliability Engineer


Job Responsibilities

● Lead and manage the Cloud Reliability teams to provide strong Managed Services support to end-customers.

● Isolate, troubleshoot and resolve issues reported by CMS clients in their cloud environment

● Drive the communication with the customer providing details about the issue, current steps, next plan of action, ETA

● Gather client's requirements related to use of specic cloud services and provide assistance in seing them up and resolving issues

● Create SOPs and knowledge articles for use by the L1 teams to resolve common issues

● Identify recurring issues, perform root cause analysis and propose/implement preventive actions

● Follow change management procedure to identify, record and implement changes

● Plan and deploy OS, security patches in Windows/Linux environment and upgrade k8s clusters

● Identify the recurring manual activities and contribute to automation

● Provide technical guidance and educate team members on development and operations. Monitor metrics and develop ways to improve.

● System troubleshooting and problem-solving across plaorm and application domains. Ability to use a wide variety of open-source technologies and cloud services.

● Build, maintain, and monitor conguration standards.

● Ensuring critical system security through using best-in-class cloud security solutions.


Qualifications

● 4-7 years experience in Cloud Infrastructure and Operations domains and IT operational experience preferably in a global enterprise environment.

● Specialize in one or two cloud deployment platforms: AWS, GCP

● Hands on experience with AWS/GCP services (EKS, ECS, EC2, VPC, RDS, Lambda, GKE, Compute Engine)

● Understanding of one or more programming languages (Python, JavaScript, Ruby, Java, .Net)

● Logging and Monitoring tools (ELK, Stackdriver, CloudWatch)

● Knowledge on Conguration Management tools such as Ansible, Terraform, Puppet, Chef

● Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Mesos)

● Good analytical, communication, problem solving, and learning skills.

● Knowledge on programming against cloud plaorms such as Google Cloud Platform and lean development methodologies.

● Strong service aitude and a commitment to quality.

● Willingness to work in shifts.

Read more
Pune
4 - 12 yrs
₹5L - ₹20L / yr
skill iconJava
skill iconSpring Boot
Hibernate (Java)
skill iconKubernetes
RESTful APIs
+3 more

Job Title: Lead Software Engineer

Experience: 4 - 12 yr

Department: Software

Reports To: Senior Software Engineer / Software Architect



Purpose of the Role

The incumbent will be responsible for designing and developing robust software solutions for products in the domains of Warehouse Automation, Industrial Automation, Robotics, and IoT. The role includes defining software architecture, ensuring scalability and performance, and mentoring the development team to drive technical excellence and innovation.


Technical Skills Required

  • Proven experience in designing, developing, and deploying high-volume, scalable applications.
  • Expertise in distributed systems, microservices, and central system architectures.
  • Programming & Frameworks: Proficiency in Java 17+.
  • Experience with frameworks such as Spring, Hibernate, Kubernetes, and RESTful APIs.
  • Knowledge of JPA, MS SQL, and database modelling/design.
  • Hands-on experience with GCP, AWS, or Azure for cloud architecture.
  • Familiarity with virtualization and containerization technologies.
  • Strong skills in data modelling and database design.
  • Knowledge of secure coding practices.
  • Tech stack: Java, MSSQL, MySQL, Spring Boot, Redis, Data Structures, Linux, basics of Kubernetes.


Behavioural Skills Required

  • Attention to Detail (Proficient)
  • Problem Solving
  • Decision Making
  • Collaborative approach
  • Adaptability to a volatile environment
  • Accountability
  • Good Leadership skills


Job Responsibilities

  • Understand requirements and define database and application structure under guidance of Software Architect.
  • Write high-quality, scalable, and efficient code.
  • Prepare Functional Requirement Documents (FRD) based on inputs from BA team.
  • Guide junior and mid-level developers and provide technical support.
  • Collaborate to identify and fix technical issues in UAT/Production.
  • Work closely to meet project deadlines.
  • Take ownership of product implementations at customer sites.
  • Hands-on development for assigned modules/products.
  • Handle application performance in production.
  • Work with customers to understand automation requirements.
  • Review and merge code changes from the team.
  • Conduct sprint meetings, demos, and resolve development roadblocks.
  • Optimize code for performance and efficiency.
Read more
TalentWeave
Dudekula Lakshmi
Posted by Dudekula Lakshmi
Pune, Cognizant office is available
8 - 10 yrs
₹11L - ₹15L / yr
PowerBI
skill iconPython
Spark
skill iconAmazon Web Services (AWS)
ETL
+3 more

Role : AWS Data Engineer

Location : Anywhere in India - where cognizant office is available 

Contract duration : 12 months contract

Total Experience : 8-10 years

Budget-15LPA

Relevant Experience : 5+years with required skills & data engineering

Client : Cognizant


Job description : 


Python


Spark


Gradle 


AWS Services (ex: S3, Athena, Redshift, Transfer, SNS, SQS, Event Bridge, Lamda, Glue Data Catalog, RDS, EC2, IAM, Flink)


Kubernetes


Argo


Kafka / Kinesis streaming


SQL


ETL Data Pipelines


Data Modelling


Power BI/ Any reporting tools


New Relic / Terraform


Operational support - Batch monitoring, root cause analysis and fix

Read more
Bengaluru (Bangalore)
4 - 8 yrs
₹12L - ₹17L / yr
Azure
skill icongrafana
Scripting
prometheus
CI/CD
+4 more

Location: Bangalore 

Experience: 4-8 years

Interview Process - Two Rounds - First Round Virtual

Second Round-Face to Face at Bangalore


Key Skills Required

☁️ Cloud & Infrastructure

  • Strong hands-on experience with AWS Cloud Services
  • Proficiency in Terraform for Infrastructure as Code (IaC)
  • Experience in managing scalable cloud environments

⚙️ Containerization & Orchestration

  • Solid experience in Kubernetes (K8s) for container orchestration
  • Understanding of microservices architecture

🔄 CI/CD & DevOps

  • Hands-on experience with Azure DevOps (CI/CD pipelines)
  • Experience in build, release, and deployment automation

📊 Observability & Monitoring

  • Strong experience with Prometheus & Grafana
  • Expertise in setting up alerts, dashboards, and monitoring system health

🔐 API Gateway & Security

  • Experience with Kong or equivalent API Gateway
  • Understanding of API security controls (authentication, rate limiting, policies)

🧠 Core Technical Competencies

  • Strong Linux troubleshooting and system debugging skills
  • Proficiency in scripting (Bash / Python / Shell)
  • Understanding of networking concepts: TCP/IP, HTTP, DNS, Load Balancing
  • Experience with system architecture and distributed systems

🚨 SRE Responsibilities

  • Monitor system performance, reliability, and availability
  • Handle incidents, perform troubleshooting, and conduct RCA
  • Automate operational tasks to improve efficiency
  • Build and maintain scalable, resilient infrastructure
  • Collaborate with development and DevOps teams for system improvements

🧪 Good to Have

  • Experience with Finacle operations
  • Exposure to API/load testing tools like JMeter or Gatling
  • Familiarity with logging tools like Loki

🤝 Soft Skills

  • Strong communication and collaboration skills
  • Ability to document processes and technical workflows clearly

🎯 Ideal Candidate

A hands-on SRE/DevOps Engineer with strong exposure to:

  • AWS + Terraform + Kubernetes
  • CI/CD (Azure DevOps)
  • Monitoring + API Gateway security 


Read more
Vivanet

at Vivanet

1 candid answer
Ashish Uikey
Posted by Ashish Uikey
Bengaluru (Bangalore)
10 - 12 yrs
Best in industry
skill iconPython
FastAPI
skill iconDjango
Microservices
Distributed Systems
+4 more

Role Objective

We are seeking a Principal Python Engineer to lead the architecture, development, and scaling of our core backend systems. You will be responsible for defining technical standards, optimizing high-throughput distributed systems, and mentoring a team of senior developers. The ideal candidate has a deep mastery of the Python ecosystem and a proven track record of delivering enterprise-grade software.


Key Responsibilities

  • System Architecture: Design and implement scalable, resilient, and secure microservices architectures using Python.
  • Technical Leadership: Act as the final authority on technical decisions, code reviews, and architectural patterns.
  • Performance Optimization: Identify and resolve complex bottlenecks in high-concurrency environments (using asyncio, multiprocessing, etc.).
  • Infrastructure & Cloud: Oversee cloud-native deployments (AWS/Azure/GCP) using Docker, Kubernetes, and Terraform.
  • Database Strategy: Design complex schemas and optimize queries across Relational (PostgreSQL) and NoSQL (MongoDB, Redis, Cassandra) databases.
  • Mentorship: Lead by example through high-quality code and drive the professional growth of the engineering team.


Mandatory Skills & Requirements

  • Python Mastery: 10+ years of professional experience with deep knowledge of Python internals, memory management, and advanced OOPS.
  • Backend Frameworks: Extensive experience with FastAPI, Django, or Flask in a production environment.
  • Distributed Systems: Proficiency with message brokers like Kafka, RabbitMQ, or Celery.
  • Engineering Excellence: Strong command of TDD (Test Driven Development), CI/CD pipelines, and SOLID principles.
  • Security: Deep understanding of OAuth2, JWT, and web security best practices (OWASP).
  • Education: Bachelor’s or Master’s degree in Computer Science or a related engineering field.


Preferred / Bonus Skills

  • Polyglot Experience: Proficiency in Go, Rust, or Java for high-performance modules.
  • AI/ML Integration: Familiarity with integrating LLMs, LangChain, or PyTorch into production pipelines.
  • Data Engineering: Experience with Big Data tools like Apache Spark or Snowflake.


Top 5 Mandatory Skills

  1. Architectural Design: Ability to design end-to-end distributed systems from scratch.
  2. Concurrency Mastery: Expert-level use of asyncio and parallel processing.
  3. Database Optimization: Advanced PostgreSQL/NoSQL performance tuning.
  4. DevOps & Scalability: Hands-on experience with Kubernetes and scaling applications for millions of users.
  5. Technical Governance: Proven experience in setting coding standards and leading large-scale code reviews.
Read more
Wissen Technology

at Wissen Technology

4 recruiters
Shakthi M
Posted by Shakthi M
Bengaluru (Bangalore), Mumbai, Pune
5 - 14 yrs
Best in industry
skill iconJava
skill iconKubernetes
JMS/EMS

Strong Java Developer with hands-on experience in building, deploying, debugging, and supporting Java applications on Kubernetes-based container platforms. 

Focus is on application development and deployment support rather than Kubernetes administration or security management. Experience with JMS/Messaging systems is preferred.

Primary skills required:

  • Core Java
  • Kubernetes (Application Deployment & Execution)
  • Docker & Containerization
  • Build & Deployment Support
  • scripting can be python also if not java script 

Kubernetes Certified Application Developer (CKAD) certification would be an added advantage.

Read more
Vivanet

at Vivanet

1 candid answer
Ashish Uikey
Posted by Ashish Uikey
Pune
7 - 12 yrs
Best in industry
Apache Kafka
Neo4J
Issue resolution
skill iconPython
Data engineering
+7 more

Senior Data Engineer

• Data Engineering • Streaming Pipelines • Graph Databases • Entity Resolution


Role at a Glance Level

  • Lead / Senior (Individual Contributor / Team Lead Track) Experience
  • 7 - 10 years of relevant professional experience Location
  • Remote (Pune Based Preferred) Employment Type
  • Contract Industry Preference
  • Any (Healthcare preferred - Payer / Provider experience strongly preferred)


About the Role

We are looking for a Senior Data Engineer with deep expertise in streaming architectures, graph database platforms, and large-scale data pipeline engineering. This is a high-ownership, hands-on role that sits at the intersection of real-time data infrastructure, entity resolution, and multi-database system design.


You will architect and build pipelines that drive a complex, multi-layered data platform - ingesting from diverse upstream sources, resolving entities at scale, and keeping graph, relational, search, and caching layers in sync. You will work closely with data architects, AI engineers, and product teams to deliver reliable, high-performance data infrastructure that powers downstream analytics and intelligent applications across any domain.


Key Responsibilities


Streaming & Ingestion Architecture

•Design and build production-grade CDC (Change Data Capture) pipelines using Apache Kafka, consuming events from PostgreSQL, SQL Server, and other RDBMS sources into a centralised knowledge graph.

•Architect multi-source ingestion connectors supporting schema evolution, backpressure handling, and at-least-once delivery guarantees across heterogeneous data sources.

•Configure and govern Confluent Schema Registry with Avro / Protobuf schemas across all Kafka topics; enforce backward and forward compatibility standards.

•Design micro-batch and streaming ETL/ELT workflows using Apache Spark or equivalent frameworks for bulk initial loads and ongoing incremental refresh patterns.

•Manage messaging workflows where required; define routing, dead-letter, and retry strategies appropriate to each integration pattern.


Graph Database Engineering

•Design, build, and optimise graph data models on a production graph database platform; Neo4j is preferred but experience with Amazon Neptune, ArangoDB, TigerGraph, or equivalent graph databases is valued.

•Author complex graph queries and traversal patterns - Cypher (Neo4j), Gremlin (Neptune/TinkerPop), or SPARQL - for both operational and analytical use cases.

•Own ingestion-side write strategies for the graph layer: batch import patterns, upsert logic, index management, and performance tuning under high write throughput.

•Collaborate with senior architects to ensure graph data models honour defined schema constraints and governance standards; apply constraint validation frameworks where applicable.

•Engineer reliable data flows across complementary stores - relational (PostgreSQL), search (Elasticsearch), caching (Redis), and time-series (TimescaleDB) - with consistent transaction semantics.


Entity Resolution & Data Quality

•Build probabilistic entity resolution engines for large-scale deduplication across master data domains - customers, products, entities, or records - leveraging record linkage concepts (Fellegi-Sunter model, blocking strategies, confidence thresholds) and libraries such as Splink, Zingg, or Dedupe.io.

•Define and enforce data quality validation rules at ingestion time; implement automated alerting for schema violations, volume anomalies, and SLA breaches.

•Design master data management patterns for cross-system entity matching and golden record creation; ensure consistency across all downstream consumers.


Data Platform & Lakehouse

•Design and implement data lakehouse patterns (Iceberg / Parquet on S3-compatible or Azure storage) for historical data retention, cost-efficient storage, and analytical workloads.

•Build and maintain ETL/ELT pipelines using Apache Spark or dbt; define transformation logic, partitioning strategies, and incremental processing patterns.

•Ensure data lineage, audit trail, and observability are built into pipeline design from the outset using OpenTelemetry or equivalent tooling.


Technical Leadership & Collaboration

•Contribute to a sub-team of data engineers; participate in sprint planning, design reviews, and on-call rotations for critical pipelines.

•Define and enforce coding standards, pipeline patterns, and infrastructure-as-code practices using Terraform, Docker, and Kubernetes.

•Drive proof-of-concept evaluations for new ingestion technologies, graph platforms, and data tooling relevant to the engagement.


Required Qualifications


Experience

•7 - 10 years of progressive experience in data engineering or a closely related discipline.

•Demonstrated track record of delivering production-grade streaming and CDC pipeline systems in enterprise environments across any industry vertical.

•Hands-on experience with graph database platforms in production - Neo4j preferred; Amazon Neptune, ArangoDB, TigerGraph, or equivalent is acceptable.

•Practical experience with entity resolution, fuzzy matching, or master data management at scale (500K+ records).

•Solid experience with multi-database architectures combining graph, relational, and search layers.

•Candidates from any industry are welcome; experience in regulated or data-intensive domains (financial services, retail, logistics, telecoms, healthcare) is advantageous.


Technical Skills

Streaming & CDC: Apache Kafka

Graph Databases: Production experience with at least one graph database platform - Neo4j (preferred), Amazon Neptune, ArangoDB, or TigerGraph; proficiency in the associated query language (Cypher, Gremlin, or SPARQL).

Supporting Databases: PostgreSQL (relational), Elasticsearch (search), Redis (caching).

Programming: Python (Advanced) - pipeline automation, data workflow scripting, testing; SQL at expert level for complex transformations and query optimisation.

Entity Resolution: Probabilistic record linkage concepts; practical experience with Splink, Zingg, Dedupe.io, or a comparable library.

Data Engineering: High-volume ETL/ELT pipeline design; Apache Spark for distributed processing; data lakehouse patterns (Iceberg, Parquet, Delta Lake).

Cloud & Infrastructure: AWS or Azure - production delivery on at least one platform; Docker, Kubernetes, Terraform.

Familiarity with semantic or schema standards - OWL 2, RDF, SHACL, JSON-LD - sufficient to write conformant graph data models against a defined schema.

Experience with OpenTelemetry, distributed tracing, or observability tooling for pipeline monitoring and incident response.

Prior work in compliance-driven data environments with audit trail, data masking, or access control requirements.

Exposure to graph analytics and visualisation tooling such as Neo4j Bloom, Gephi, or equivalent.

Experience with data governance platforms such as Microsoft Purview, Collibra, or Alation.


Preferred Qualifications

•Bachelor’s or Master’s degree in Computer Science, Information Systems, or a related engineering discipline.

Read more
Service Co

Service Co

Agency job
via Vikash Technologies by Rishika Teja
Bengaluru (Bangalore), Hyderabad, Chennai
7 - 11 yrs
₹20L - ₹30L / yr
skill icon.NET
skill iconReact.js
skill iconC#
Windows Azure
skill iconAmazon Web Services (AWS)
+6 more

5+ years of Software Engineering experience with strong hands-on expertise in C# and .NET


Experience in React or other modern JavaScript front-end frameworks


Strong knowledge of cloud platforms such as Microsoft Azure, AWS, or Google Cloud


Hands-on experience with DevOps practices, including:

CI/CD pipelines

Docker & Kubernetes

Automated deployments


Experience with Git and Agile methodologies like Scrum/Kanban


Bachelor’s degree in Computer Science, Software Engineering, or related field


Strong collaboration skills working with cross-functional teams, architects, business analysts, and stakeholders

Read more
ChicMic Studios
Mohali
4 - 9 yrs
₹8L - ₹16L / yr
FastAPI
skill iconDocker
skill iconPostgreSQL
Celery
skill iconRedis
+3 more

Job Description:

Profile: Senior Python AI/ML Engineer

Required experience: 5-9 Years

Location: Mohali, Punjab (WFO only, no hybrid )

B.tech/ MCA preferred

Immediate joiners

We are looking for a Senior Python (AI/ML) Engineerto build and scale a production-grade AI-powered platform focused on image understanding, scene reconstruction, and automated media generation. This is a backend-heavy role requiring strong expertise in system design, asynchronous processing, and scalable API development.

You will work closely with AI/ML, frontend, and DevOps teams to deliver high-performance systems that power object detection, segmentation, and 3D reconstruction pipelines.

Key Responsibilities:

- Design and develop scalable backend services and APIs using FastAPI

- Build modular,maintainable systems with clear service contracts and abstractions

- Implement asynchronous processing pipelines for compute-heavy AI workloads

- Design and manage job queues (Celery, RQ, or similar) and integrate Redis

- Handle long-running and GPU-based tasks- Work with PostgreSQL and integrate object storage (S3 or equivalent)

- Integrate and productionize ML models

- Optimize inference pipelines for latency, batching, and throughput

- Ensure system reliability with retries, fallbacks, and monitoring

- Collaborate with Data Science, Frontend, and DevOps teams

Required Skills:

- 5+ years of Python development experience

- Hands-on experience with FastAPI and REST APIs

- Strong understanding of async programming and scalable systems

- Experience with Redis and queue systems (Celery, RQ, Kafka)

- Strong PostgreSQL experience

- Experience with Docker (mandatory); Kubernetes is a plus

- Familiarity with AWS, GCP, or Azure

Good to Have:

-Exposure to AI/ML pipelines (YOLO, DETR, SAM, etc.)

- Experience deploying ML models in production

- Understanding of GPU workloads and optimization

- Background in image/video processing or real-time systems

What We’re Looking For:

- Strong problem-solving and system design mindset

- Ability to handle real-world scale and unstructured data

- Experience in AI-integrated backend systems

Benefits:

  • Flexible schedule
  • Health insurance
  • Leave encashment
  • Provident Fund


Read more
Zocket

at Zocket

4 recruiters
Dhanesh Sridhar
Posted by Dhanesh Sridhar
Chennai
1 - 3 yrs
₹5L - ₹12L / yr
skill iconAmazon Web Services (AWS)
MLOps
skill iconJenkins
skill iconKubernetes
Amazon EKS
+4 more

Why this role exists

Our infrastructure footprint is growing faster than our headcount, and we believe most of that

gap should be closed by automation and AI agents — not by hiring more humans to do toil. We

need someone early in their career who treats manual work as a bug, ships scripts and agents

instead of tickets, and wants to grow into deeper ownership over the next two years.

You will not be the most senior person on the team. You will be the one who multiplies the team.

What you'll own

In your first 1 months

• Take ownership of one slice of our CI/CD pipeline and make it measurably

faster, more reliable, or cheaper. We expect a number on a dashboard to move.

• Build at least three internal automations that replace manual ops toil —

using AI agents (Claude Code, agentic CLIs, scripted LLM workflows) as your force

multiplier.

• Be the first responder for a defined set of alerts. Write the runbooks. Drive

the alert volume down.

• Support senior engineers on AI/ML infrastructure (GPU nodes, inference

services, model deployment) — observe, document, and gradually take on contained

changes under review.

By 3 months you should be

• The go-to person for at least two production systems.

• Shipping routine infrastructure changes without needing senior review.

• Treating "manual" as a code smell.

Required (we will reject without these)

• 0–3 years hands-on experience with one major cloud (AWS, GCP, or

Azure — one is fine, depth beats breadth).

• Fluent in Linux command line, bash, and at least one scripting language

(Python or Go preferred).

• Have shipped something to production that real users hit. A side project

counts; a graded coursework lab does not.

• Comfortable with Docker — you can explain what an image vs. a

container is and why it matters.

• Working knowledge of networking fundamentals: DNS, HTTP/HTTPS,

TLS, ports, basic subnets — enough to debug "it works on my machine."

• Git fluency: branches, merges, rebases, conflict resolution.

• CI/CD pipelines — you have authored or substantially modified pipelines

in GitHub Actions, GitLab CI, ArgoCD, Jenkins, or similar. Not just "I clicked Re-run."

• Kubernetes basics — kubectl for real work, can read pod logs,

understand deployments and services, can debug a CrashLoopBackOff without

panicking. You do not need to have run a cluster; you do need to have lived inside one.

• Active user of AI coding agents (Claude Code, Cursor, Copilot, agentic

CLIs, etc.). You should be able to walk us through specific tasks where they made you

faster, and specific tasks where they failed you and how you noticed. "I have tried it" is

not enough.

Bonus (real plus, not required)

• Infrastructure as Code: Terraform, Pulumi, or Ansible.

• Observability: Prometheus/Grafana, Datadog, OpenTelemetry, any APM.

• Have built or extended an LLM-based agent — a custom MCP server, a

scripted multi-step workflow, an internal tool that calls models in a loop. Anything beyond

chat-with-Claude.

• Exposure to GPU workloads, model serving (vLLM, Triton, TGI, etc.), or

ML pipelines.

What we don't care about

• Whether your degree is in CS — or whether you have a degree at all.

• Brand-name companies on your resume.

• Certifications. They are fine. They do not substitute for having shipped.

How we work

• We default to automation. If you do something manually twice, the third

time you script it or hand it to an agent.

• AI agents are part of the workflow, not a novelty. Expect interview

questions about exactly how you use them — and where you have caught them being

wrong.

• Small, reversible changes beat big-bang rollouts.

• Postmortems are blameless and written down.

• We push back on each other. If you only execute, you will be unhappy

here.

How to apply

Send:

• Your resume.

• A short note (≤200 words) describing one infra or automation problem you

solved, and how AI agents factored in — or did not, and why. We read these. Generic

notes get rejected.

Internal note — delete before posting externally

• Comp band, location policy, team name, and reporting line marked

[CONFIRM] need to be filled in before this goes external.

• The Required list is intentionally tight: CI/CD and Kubernetes basics

promoted from bonus. Expect this to filter ~80% of typical junior DevOps applicants. The

remaining pool will skew toward people who have actually shipped infra at a startup, not

bootcamp grads or pure cloud-cert holders.

• IaC, observability, agent-building, and GPU/ML serving stay as bonus.

Promoting any of these to required at 0–3 yrs collapses the pool to near-zero or forces

hiring senior people at junior comp. If you want IaC required, re-level this to mid (3–5

yrs) and raise the band.

• Screening implication: the resume screen should explicitly check for

CI/CD pipeline authorship and any K8s-touching production work. If neither is on the

resume, reject at screen. Do not waste interview slots.

• Pipeline watch: if fewer than ~15 qualified resumes after 2 weeks of

active sourcing, the first thing to relax is the AI-agent-fluency bar (move to bonus and

screen for it in interview instead). Do not relax the "shipped to production" requirement

— that is the load-bearing filter.

Read more
Mlops Solutions Pvt Ltd
Sowmyasri Parthi
Posted by Sowmyasri Parthi
Bengaluru (Bangalore)
5 - 10 yrs
₹1L - ₹37L / yr
skill iconSpring Boot
skill iconJava
skill iconReact.js
skill icon.NET
Spring MVC
+8 more

Senior Software Engineer

Responsibilities:

• Lead by the principle of "customer first" to analyse, debug, develop, and maintain customer-centric software.

• Collaborate closely with multidisciplinary teams to analyse, debug and fix issues with high quality code, zero regressions, scalable, innovative technical solutions.

• Optimizing components for maximum performance and scalability

• Participates in R&D, Proof of Concepts, Prototyping, Code review, Root Causing, etc.

• At least 2-3 yrs of experience in taking full ownership of software development lifecycle including planning, design, architecture, development, test & deployment. And 2+ years of experience in supporting production or customer issues and escalations.

• Review and analyze support tickets that are complex in nature and require more technical knowledge to analyze. Investigate issues to identify root causes and document findings clearly.

• Influences the development practices so that they follow best practices, policies, and procedures.

• Ensure software products meet all non-functional requirements including operational and security needs.

• Excellent verbal and written communication skills, problem solving skills.

• Address complex technical challenges within software systems, ensuring robustness, compliance, and customer satisfaction.

• Contribute to knowledge base

• Support the Lead and Mentor the team of software engineers and own the technical health of the service the team is working on.


Requirements:

• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.

• Minimum of 5 years of professional experience in Java development with expertise in core Java, JDK, data structures, and multithreading.

• Strong analytical skills with experience in root cause analysis and fixes • Strong experience with Spring and Spring Boot frameworks.

• Strong understanding of software design principles, architecture, and best practices

• Familiarity with server technologies, including Tomcat and WebLogic.

• Proficiency in working with relational databases such as Oracle and PostgreSQL

• Experience with messaging queues, particularly JMS MQ or Artemis MQ.

• Possess exceptional debugging and troubleshooting skills to resolve complex issues across the entire, cross-functional technology stack

• Awareness / exposure to debugging Frontend applications (ReactJS / .Net) is a good to have.

• Exposure to Kubernetes, containerization and cloud (AWS) technologies in building scalable, resilient, and distributed environments.

• Excellent problem-solving skills and the ability to work in a fast-paced and in production customer sensitive environments.

• Strong communication and collaboration skills. Quick learner.

• Experience and knowledge in working in the HRP application – specifically the Claims functionality. Added experience in other areas including Enrollment, Billing, Financials is a plus. Candidate must have worked previously in HRP

• Familiarity with ticketing systems (Jira, SalesForce) and production support workflows.

Read more
NeoGenCode Technologies Pvt Ltd
Akshay Patil
Posted by Akshay Patil
Bengaluru (Bangalore)
5 - 12 yrs
₹10L - ₹30L / yr
skill iconNodeJS (Node.js)
skill iconJavascript
TypeScript
skill iconExpress
Sails.js
+10 more

Job Title : Senior Node.js Developer (SDE 2)

Experience : 5+ Years

Location : Bengaluru (Whitefield)

Work Mode : Hybrid (3 Days WFO)

Openings : 2

Notice Period : Immediate–20 Days Preferred


Role Overview :

We are looking for an experienced Senior Node.js Developer (SDE 2) professional with strong expertise in building scalable backend applications, microservices, and distributed systems in Agile environments.


Mandatory Keywords :

Node.js, JavaScript, TypeScript, GraphQL, Microservices, Cloud, REST APIs, Docker, Kubernetes, Redis, System Design.


Key Responsibilities :

• Design, develop, and deploy scalable backend systems using Node.js & TypeScript.

• Build REST APIs and GraphQL services.

• Develop secure, reusable, and high-performance applications.

• Work on microservices, cloud platforms, and production support.

• Follow best practices including TDD, testing, and DevOps processes.

• Mentor team members and contribute to technical excellence.


Mandatory Skills :

• 5+ years of software development experience.

• Strong hands-on experience in Node.js, JavaScript, TypeScript.

• Experience with Express.js/Koa/Sails.js.

GraphQL, REST APIs, Microservices

AWS/Azure/GCP Cloud

Docker, Kubernetes, Redis

• Strong understanding of distributed systems, Git, and DevOps.


Good to Have :

Kafka, RabbitMQ, Apollo Federation, New Relic, Datadog, Splunk, React.js/Next.js, Jest/Mocha/Cucumber


Interview Process :

  • L1 – Technical
  • L2 – Coding
  • L3 – System Design (HLD + LLD)
Read more
Improving
Aayushi Vats
Posted by Aayushi Vats
Remote only
10 - 20 yrs
₹25L - ₹40L / yr
Engineering Management
Team Management
Team leadership
skill iconKubernetes
Google Cloud Platform (GCP)
+3 more

Engineering Manager


Total Experience - 10+ Years Experience

Work Mode - Remote 


What You’ll Do

Core Responsibilities

  • Lead end-to-end project delivery from planning to execution.
  • Communicate effectively with clients, stakeholders, and internal teams.
  • Prior experience in Infrastructure Engineering or Application Development environments.
  • Drive technical discussions around system architecture, scalability, and delivery.
  • Balance client expectations, project priorities, and team bandwidth effectively.
  • Mentor and guide team members while ensuring healthy work-life balance.
  • Continuously learn and stay updated with evolving technologies and industry practices.

Management Responsibilities

  • Act as the primary point of contact for clients and delivery teams.
  • Track milestones, risks, dependencies, and project schedules proactively.
  • Create and maintain regular project status reports and delivery metrics.
  • Establish efficient technical and operational processes for teams.
  • Provide accurate effort estimations and prioritize tasks across modules and teams.
  • Build strong feedback loops with internal and external stakeholders.

Organization Building

  • Drive key initiatives with organization-wide impact across teams and functions.
  • Improve reporting, operational efficiency, and delivery management practices.

What We’re Looking For

  • 10+ years of experience in software delivery and engineering management.
  • Experience managing and mentoring teams of 10+ members.
  • Strong understanding of SDLC, system design, and software architecture.
  • Hands-on experience delivering projects in both Fixed Scope and T&M models.
  • Background in engineering delivery, ideally having grown from a developer role.
  • Strong analytical and reporting skills with experience in delivery metrics.
  • Excellent client-facing communication and stakeholder management abilities.
  • Experience handling high-pressure situations, risks, and disaster management scenarios.
  • Ability to understand business priorities and align team execution accordingly.


Read more
Improving
Leena Lahari
Posted by Leena Lahari
Remote only
5 - 12 yrs
₹25L - ₹35L / yr
Software Testing (QA)
Apache Spark
Spark
skill iconKubernetes
Apache Mesos
+5 more

Senior Spark QA Engineer – Functional & Performance Testing


Location: Remote

Experience: 5+ Years 


Job Description - 

We are looking for a Senior Spark QA Engineer with strong expertise in functional and performance testing of Apache Spark applications and distributed data platforms.


Key Responsibilities -

  • Perform manual and automated testing of Spark jobs, Spark SQL queries, and ETL pipelines.
  • Execute functional, scalability, and performance testing for Spark workloads.
  • Set up and manage Spark clusters on Standalone, YARN, Kubernetes, Mesos, Databricks, EMR, and Dataproc.
  • Conduct benchmarking, validation, and performance analysis of Spark applications.
  • Identify bottlenecks and troubleshoot distributed system issues.
  • Lead QA initiatives and mentor team members.

Required Skills - 

  • 5+ years of QA/testing experience with strong hands-on expertise in Apache Spark.
  • Experience in functional and performance testing of distributed systems.
  • Strong understanding of Spark architecture and optimization techniques.
  • Experience with Kubernetes and cloud-based Spark environments.
  • Proficiency in Python or Java.
  • Experience with automation frameworks and CI/CD pipelines.
Read more
Redtring
Keshav Senthil
Posted by Keshav Senthil
Hyderabad
3 - 6 yrs
₹20L - ₹25L / yr
skill iconJava
skill iconKotlin
skill iconAmazon Web Services (AWS)
skill iconRedis
Apache Kafka
+7 more

About Us:


We are hiring for a pre seed funded startup called Zeromoblt (https://zeromoblt.com/), a high-agency Hyderabad-based startup revolutionizing student transportation with lean, intelligent tech stacks.


Our mission: architect world-class systems from scratch—fast, scalable, and algorithmically sharp—using Kotlin, React, AWS (EC2, IoT, IAM), Google Maps, and multi-cloud setups. Stealth mode operations mean you're building 0→1 products with founders, not fixing tickets.


What You'll Do

  • Lead end-to-end ownership of complex systems: design, build, deploy, monitor, and iterate at scale.
  • Architect high-performance backends in Kotlin (or JVM langs) that handle real-time routing and IoT data.
  • Craft scalable React UIs that power ops dashboards and parent-facing apps.
  • Drive cloud decisions across AWS, Azure/GCP—optimising costs for our bootstrap runway.
  • Apply DSA/system design to solve hard problems like dynamic route optimization and predictive scaling.
  • Shape the engineering roadmap: propose, prioritise, and ship features with founders.
  • Mentor juniors while executing solo on high-impact bets—no layers, just results.


We're Looking For

  • 3-6 years of hands-on engineering where you've owned and shipped production systems (prove it with code/stories).
  • Elite CS fundamentals: advanced DSA, system design (distributed systems a must), design patterns.
  • Mastery of Kotlin/Java + modern React; real AWS experience (EC2, IAM, CLI—you know our stack).
  • Proven "leap-taker": startup grit, side projects, or open-source that screams hunger.
  • Figure-it-out velocity: you thrive in chaos, learn our domain overnight, and deliver 10x faster than peers.


This Role Is Not For You If…

  • You need structured roadmaps, PM hand-holding, or big-tech process.
  • Comfort > impact: stable salary over equity upside and chaos.
  • You've never worn all hats (dev, ops, product) in a resource-constrained environment.


Why Join Us

  • Massive ownership: lead tech for 10k+ students, direct founder access, shape ZeroMoblt's scale.
  • Flat, high-trust team: flexible Hyderabad/remote, no bureaucracy.
  • Hungry culture: we hire hustlers scaling from 700 to 10k students your wins are visible daily.
  • Hungry to Leap? Apply now!
Read more
MyOperator - VoiceTree Technologies

at MyOperator - VoiceTree Technologies

1 video
2 recruiters
Vijay Muthu
Posted by Vijay Muthu
Remote only
3 - 6 yrs
₹15L - ₹20L / yr
skill iconAmazon Web Services (AWS)
Amazon CloudWatch
skill icongrafana
prometheus
skill iconKubernetes
+3 more

About MyOperator

MyOperator is a Business AI Operator platform that enables businesses, teams, and AI agents to work together seamlessly for customer operations such as Sales, Support, Escalations, Feedback, and Refund processes. With 12,000+ businesses using our platform, we operate at meaningful scale and power mission-critical communication workflows including voice bots, WhatsApp automation, and intelligent call routing.


We are building for reliability, speed, and impact. MyOperator values ownership, critical thinking, and execution. This is a high-expectation, high-learning environment where engineers are empowered to solve complex problems and build systems that directly affect customer outcomes.


Role Overview

We are looking for a skilled and proactive Site Reliability Engineer (SRE) to take end-to-end ownership of production reliability, observability, and performance engineering across MyOperator’s AI-powered communication infrastructure.


This role is not operational-only — it requires strong system design thinking, deep troubleshooting ability, and a production ownership mindset. You will define reliability standards, build observability frameworks, lead incident response, and drive SLO-based engineering practices across distributed AWS and Kubernetes environments.


Key Responsibilities

  • Own production reliability, uptime, latency, and error budgets across critical services.
  • Design and manage production-grade monitoring using Grafana, VictoriaMetrics (Prometheus), and PromQl, AWS CloudWatch.
  • Define and enforce SLIs, SLOs, and SLA thresholds for AI communication systems (voice bots, WhatsApp APIs, call routing).
  • Build real-time operational dashboards for incident response, capacity planning, and leadership visibility.
  • Implement end-to-end distributed tracing using OpenTelemetry (OTEL Collector).
  • Design and maintain centralized logging with strong correlation between logs, metrics, and traces.
  • Create SLO-based alerting systems with minimal noise and fast incident detection.
  • Lead incident response lifecycle: alert triage, mitigation, RCA documentation, and preventive improvements.
  • Drive MTTR reduction through structured monitoring, automation, and reliability engineering practices.
  • Monitor and troubleshoot AWS EKS (Kubernetes) production workloads.
  • Instrument and monitor LLM API integrations, AI inference pipelines, and messaging systems.
  • Analyze logs using OpenSearch / ELK for anomaly detection and root cause identification.
  • Automate operational workflows using Python or Bash to eliminate manual toil.
  • Drive performance optimization, scalability improvements, and capacity planning.
  • Collaborate with engineering teams to instrument new services from day one.

Required Skills & Qualifications

  • 3–6 years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
  • Hands-on experience with:
  • VictoriaMetrics / Prometheus (time-series monitoring)
  • Grafana dashboards and visualization
  • PromQL for writing complex queries and alerts
  • Experience implementing distributed tracing using OpenTelemetry (Mandatory).
  • Strong experience with centralized logging systems (ELK / OpenSearch / Loki).
  • Experience with alerting frameworks such as Alertmanager or Grafana Alerts.
  • Strong understanding of SLIs, SLOs, SLA design, and reliability engineering principles.
  • Hands-on experience managing AWS production workloads (EC2, RDS, ELB, CloudWatch, IAM).
  • Experience with Kubernetes (AWS EKS preferred).
  • Familiarity with CI/CD pipelines and automation tools.
  • Good understanding of Linux systems, networking, and cloud infrastructure.
  • Experience handling production incidents and participating in on-call rotations.
  • Ability to automate operational tasks using Python or Bash.

Good to Have

  • Experience with OpenSearch / ELK log pipelines and anomaly detection.
  • Kubernetes monitoring (pod health, node metrics, autoscaling behavior).
  • CI/CD observability integration (Jenkins, GitHub Actions).
  • Experience in monitoring LLM APIs and AI inference pipelines.
  • Familiarity with MLOps or AI observability tools (Arize, WhyLabs, etc.).
  • Service mesh exposure (Istio).
  • Infrastructure as Code (Terraform, CloudFormation).
  • Experience with chaos engineering or load testing tools.
  • Multi-cluster or multi-region architecture exposure.

Key Expectations

  • Ownership of production systems and high availability.
  • Strong troubleshooting and debugging skills.
  • Focus on automation and reliability improvements.
  • Proactive approach to incident prevention.
  • Ability to reduce alert noise and improve signal quality.
  • Data-driven approach to reliability engineering.

This Role Is Not For

  • Candidates with purely development experience and no production ownership.
  • Candidates without real incident response or on-call experience.
  • Freshers or candidates with less than 3 years of experience.


Read more
OpsTree Global
Pragati Srivastava
Posted by Pragati Srivastava
Bengaluru (Bangalore), Mumbai
4 - 9 yrs
₹20L - ₹25L / yr
Google Cloud Platform (GCP)
skill iconKubernetes
Terraform
Monitoring
prometheus
+3 more

Immediate Hiring: GCP DevOps Engineer | Mumbai & Bengaluru (On-site)


OpsTree Global is urgently hiring a GCP DevOps Engineer with 4–9 years of experience for immediate requirements in Mumbai and Bengaluru.


Key Skills

  • Google Cloud Platform (GCP)
  • Terraform / Infrastructure as Code (IaC)
  • Kubernetes & Helm Charts
  • CI/CD – Jenkins, GitLab CI, GitHub Actions
  • Linux Administration
  • Scripting – Python / Go / Java

Role Responsibilities

  • Build and manage scalable cloud infrastructure on GCP
  • Automate deployments and infrastructure provisioning
  • Ensure system reliability, monitoring, and performance optimization
  • Collaborate with development and operations teams for seamless delivery


📍 Locations: Mumbai & Bengaluru (On-site)

⚡ Immediate Joiners Preferred

💼 Experience: 4–9 Years

Read more
Searce Inc

at Searce Inc

3 recruiters
Mohammed Rabidheen
Posted by Mohammed Rabidheen
Coimbatore
4 - 8 yrs
Best in industry
Microsoft Windows Azure
skill iconKubernetes
Terraform
Observability
Reliability engineering

About Searce

Searce (pronounced 'search') is a global, AI-native, and engineering-led modern technology consultancy. Founded in 2004 with a vision to "solve for better," we partner with organizations to "futurify" their businesses by leveraging the full power of Cloud, AI, and Data Engineering.

With a presence across 10+ countries—including the US, India, Singapore, and Australia—Searce has evolved over two decades into a trusted technology partner for over 3,000 clients. We are not just a service provider; we are a group of "solvers-at-heart" who thrive on complex technical challenges.

Why Join the "Solvers" Brigade?

  • Award-Winning Excellence: In 2026, Searce was recognized as the Google Cloud Workplace AI Transformation Partner of the Year (APAC). We are a Premier Google Cloud Partner and a top-tier Managed Services Provider (MSP).
  • AI-First Mindset: We specialize in Applied AI (Generative & Conventional), Cloud Modernization, and Location Intelligence, helping industries from FinServ and Healthcare to Retail and Manufacturing reinvent themselves.
  • The "Futurify" DNA: We don't just maintain; we improve. We use our proprietary EVLOS business innovation framework to ensure our clients aren't just moving to the cloud, but are staying ahead of the curve.

Our Culture: The HAPPIER Values

We look for individuals who live and breathe our HAPPIER values:

  • Humble: We learn from everyone.
  • Adaptable: We embrace change as the only constant.
  • Positive: We focus on solutions, not just problems.
  • Passionate: We are obsessed with engineering excellence.
  • Innovative: We challenge the status quo.
  • Excellence: We deliver impactful, futuristic outcomes.
  • Responsible: We take ownership of our work and its impact.


Your Mission: The Role

solving for better.

You are a reliability-owning, hands-on solver. Not just a "break-fix engineer."

As a DRI (directly responsible individual) for our clients' most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.

You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into today's production environments.


Your Responsibilities

what you will wake up to solve.

This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:

  • Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. You'll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
  • Engineer the Blueprint: You'll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
  • Automate with Intelligence: You'll lead the charge from the keyboard to futurify our operations. You'll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
  • Drive FinOps & Impact: You'll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
  • Be the Expert in the Room: You'll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
  • Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, you'll help build a high-performing squad that lives the “Always Hardened” mindset.


Experience & Relevance

We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.

  • Cloud Operations Pedigree: 4+ years of experience in Azure cloud infrastructure, with a significant portion in cloud managed services. Hands-on experience in Kubernetes is mandatory.
  • Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
  • Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.


Functional Skills:

  • Service Delivery Mindset: A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
  • Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
  • Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
  • Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.
Read more
Egnyte

at Egnyte

4 recruiters
Bhavana Kapalganti
Posted by Bhavana Kapalganti
Remote only
5 - 10 yrs
Best in industry
Performance Testing
skill iconKubernetes
Dynatrace
HP LoadRunner
skill iconDocker
+2 more

ABOUT EGNYTE


Egnyte is the secure multi-cloud platform for content security and governance that enables organizations to better protect and collaborate on their most valuable content. Established in 2008, Egnyte has democratized cloud content security for more than 23,000 organizations, helping customers improve data security, maintain compliance, prevent and detect ransomware threats, and boost employee productivity on any app, any cloud, anywhere. For more information, visit www.egnyte.com.


Egnyte is looking for a Performance Engineer to join our performance engineering team. As a Performance Engineer, you will drive proactive monitoring and improve automation for regular operational tasks.


WHAT YOU’LL DO:


  • Develop tools to measure & monitor performance bottlenecks within the application
  • Triage reported performance issues and translate them into reproducible test scenarios
  • Collaborate with production Engineering and application engineering teams to design and execute production scenarios that will assess code and 3rd party performance
  • Develop and run PSR tests and measure stats for them
  • Work with various sub teams to ensure SLA of their core apis are tracked and maintained release over release
  • Application and architecture code profiling
  • Infrastructure and application performance tuning
  • Troubleshooting performance issues
  • Ability to distill volumes of data, analyze performance results, and diagnose performance problems
  • Capacity estimating, modeling, or planning


YOUR QUALIFICATIONS:


  • Bachelor’s degree in computer science or related field. Advanced degree preferred
  • 5+ years of work experience in performance engineering
  • Expert knowledge and strong experience using tools, LoadRunner/JMeter etc. and understanding of APM solutions like Grafana, AppDynamics, Dynatrace etc
  • Experience with microservice architecture, Docker, Kubernetes, Jenkins, Azure, GCP and application monitoring tools
  • Strong expertise on monitoring and analyzing application logs, database reports, system metrics like CPU Utilization, Memory usage, Network usage, Garbage Collection and DB Parameters
  • Strong expertise on identifying potential performance issues and providing recommendations to improve performance
  • Proficiency in JVM technology and JVM troubleshooting skills
  • Proficiency in debugging application in production at a large scale
  • JVM Profiling, GC Analysis and Tuning experience
  • Experience with Performance, Load, Stress, and Scalability Testing
Read more
Virtana

at Virtana

3 candid answers
2 recruiters
Krutika Devadiga
Posted by Krutika Devadiga
Pune
5 - 9 yrs
Best in industry
Google Cloud Platform (GCP)
DevOps
Shell Scripting
skill iconPython
skill iconKubernetes
+11 more

Role Overview:

Virtana is looking for a Senior DevOps Engineer to join our R&D Infrastructure team. In this role, you won't just follow conventions — you'll help redefine them. You will own the architecture, build, and day-to-day operations of the GCP-based cloud platform that powers Virtana's SaaS products and the AI-driven observability experience our Global 2000 customers depend on. This is a hands-on senior individual contributor role with meaningful technical leadership scope, working alongside engineers and architects on a unified observability platform.


Work Location: Pune


Job Type: Hybrid


Role Responsibilities:

  • GCP Cloud Operations: Develop, deploy, operate, and support production cloud infrastructure primarily on GCP — leveraging GKE, BigTable, BigQuery, Dataflow, Cloud Storage, IAM, and core networking services.
  • Reliability & SLAs: Ensure production systems are running at all times with multiple levels of redundancy to meet committed SLAs; lead incident response, root cause analysis, and post-incident reviews.
  • Build & Release Automation: Design, implement, and continuously improve scalable CI/CD pipelines and test frameworks leveraged by QA and development teams across the company.
  • Infrastructure as Code: Manage large-scale, repeatable deployments using Terraform, Ansible, Puppet, or SaltStack; champion Git-based workflows and version control standards for distributed engineering teams.
  • Security & Availability: Maintain the ongoing maintenance, security, patching, and availability of services in line with tight operations, security, and procedural models.
  • Monitoring & Alerting: Plan and deliver high-value monitoring and alerting features to support operations, support, and customer-facing reliability — eating our own dog food with the Virtana Platform wherever possible.
  • Capacity & Cost: Forecast capacity, plan upgrades, patches, and migrations, and drive cloud cost efficiency across hybrid and multi-cloud environments.
  • Cross-Functional Partnership: Work with development, operations, and support personnel to identify, isolate, and diagnose issues; handle support escalations and drive permanent fixes.


Required Qualifications:

  • Bachelor's degree in Computer Science / Engineering or equivalent relevant experience.
  • 5–7 years of professional hands-on DevOps / SRE experience supporting production cloud environments.
  • Strong, demonstrable production experience on GCP — including GKE, BigTable, BigQuery, Dataflow, IAM, and core GCP networking services.
  • Deep, hands-on expertise with container orchestration (Kubernetes) and Docker in production.
  • Advanced proficiency with at least one infrastructure-as-code / configuration management tool: Terraform, Ansible, Puppet, or SaltStack.
  • Solid understanding of networking, firewalls, load balancers, DNS, and database operations.
  • Strong working knowledge of Git-based workflows and version control standards for distributed engineering teams.
  • Comfort operating hybrid environments that include both Linux and Windows ecosystems.
  • Excellent verbal and written communication skills, with the ability to explain highly technical topics to both technical and non-technical audiences.
  • Self-motivated, detail-oriented, and able to work both independently and within a globally distributed team.


Good to Have:

  • Strong scripting skills and a demonstrated ability to automate operational toil — Python preferred; Bash, Go, or Groovy a plus.
  • Hands-on experience designing and operating CI/CD pipelines with Jenkins (Spinnaker, GitHub Actions, or GitLab CI also welcome).
  • Exposure to AWS or other public clouds in addition to GCP.
  • Experience operating SaaS platforms built on microservices architectures.
Read more
Service based company

Service based company

Agency job
via Codemind Staffing Solutions by Krishna kumar
Chennai
4 - 7 yrs
₹10L - ₹18L / yr
DevOps
Microsoft Windows Azure
Windows Azure
skill iconDocker
skill iconKubernetes
+4 more

Key responsibilities

• Design, build, and maintain robust CI/CD pipelines using Azure DevOps Services (Azure Pipelines) and Git-based workflows.

• Implement and manage infrastructure as code (IaC) using ARM templates, Bicep, and/or Terraform for repeatable environment provisioning.

• Containerize applications (Docker) and manage container orchestration platforms such as AKS (Azure Kubernetes Service).

• Automate build, test, release, and rollback processes; integrate automated testing and quality gates into pipelines.

• Monitor and improve platform reliability and observability using logging and monitoring tools (e.g., Azure Monitor, Application Insights, Prometheus, Grafana).

• Drive platform security and compliance through pipeline controls, secrets management (Key Vault / Vault), and secure configuration practices.

• Implement cost-optimization and governance for Azure resources (tags, policies, budgets).

• Troubleshoot build/release failures, production incidents, and performance bottlenecks; perform root-cause analysis and implement permanent fixes.

• Mentor developers in Git workflows, pipeline authoring, best practices for IaC, and cloud-native design.

• Maintain clear documentation: runbooks, deployment playbooks, architecture diagrams, and pipeline templates. 

Required skills & experience

• 4+ years hands-on experience working with Azure and cloud-native application delivery.

• Deep experience with Azure DevOps (Repos, Pipelines, Artifacts, Boards).

• Strong IaC skills with Terraform, ARM templates, or Bicep.

• Solid experience with CI/CD design and YAML pipeline authoring.

• Practical knowledge of containerization (Docker) and Kubernetes — preferably AKS.

• Scripting skills: PowerShell, Bash, and/or Python for automation.

• Experience with Git workflows (branching strategies, PRs, code reviews).

• Familiarity with configuration management and secrets management (Azure Key Vault, HashiCorp Vault).

• Understanding of networking, identity (Azure AD), and security fundamentals in Azure.

• Strong troubleshooting, debugging, and incident response skills.

• Good collaboration and communication skills; ability to work across teams.

Certification

AZ-400: Microsoft Certified: DevOps Engineer Expert or AZ-104 or AZ 305 or Terraform Associate.

 


Read more
Blitzy

at Blitzy

2 candid answers
1 product
Bisman Gill
Posted by Bisman Gill
Pune
5yrs+
₹65L - ₹90L / yr
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Terraform
skill iconPython
Distributed Systems

The Role

As a Senior Site Reliability Engineer at Blitzy's Pune headquarters, you will be the backbone of our platform's reliability, scalability, and operational excellence. You'll work at the intersection of software engineering and infrastructure, ensuring our AI-powered development platform remains highly available and performant as we scale rapidly. This is a high-impact, hands-on role for an engineer who thrives in a fast-moving environment and takes deep ownership of the systems they build.


What Success Looks Like

  • In 30 days: You have a deep understanding of Blitzy's infrastructure architecture, have identified key reliability risks, and are actively contributing to on-call rotations.
  • In 90 days: You have shipped meaningful improvements to observability, incident response workflows, and deployment pipelines that measurably reduce MTTR and increase system uptime.
  • In 6 months: You have driven at least one major reliability initiative from inception to production, established SLO/SLA frameworks for critical services, and are a trusted technical voice shaping our infrastructure roadmap.


Areas of Ownership

  • Design, build, and operate scalable, fault-tolerant infrastructure across cloud environments (AWS, GCP, or Azure).
  • Define and enforce SLOs, SLAs, and error budgets; lead blameless postmortems and drive systemic improvements.
  • Build and maintain robust CI/CD pipelines, release automation, and deployment infrastructure.
  • Own observability: design and maintain logging, metrics, tracing, and alerting stacks (e.g., Prometheus, Grafana, Datadog, OpenTelemetry).
  • Partner closely with software engineering teams to embed reliability practices into the development lifecycle.
  • Drive capacity planning, performance benchmarking, and cost optimization across our infrastructure.
  • Champion security best practices within the infrastructure and deployment layers.


Required Experience

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.
  • Strong proficiency in at least one major cloud platform (AWS preferred); experience with Kubernetes and container orchestration at scale.
  • Hands-on experience with infrastructure-as-code tools (Terraform, Pulumi, or equivalent).
  • Proven track record designing and maintaining high-availability, distributed systems.
  • Deep expertise in observability tooling, incident management, and on-call practices.
  • Strong scripting and automation skills (Python, Go, Bash, or similar).
  • Excellent communication skills with the ability to collaborate across engineering teams and present technical findings to leadership.


What Makes You Stand Out

  • Experience supporting AI/ML workloads or GPU-accelerated infrastructure.
  • Prior experience in a high-growth startup environment where you wore multiple hats.
  • Familiarity with eBPF, service mesh technologies (Istio, Linkerd), or advanced networking.
  • Contributions to open-source SRE/DevOps tooling or communities.
  • Experience building global, multi-region infrastructure with strict latency and availability requirements.


What Makes This Role Different

You won't be maintaining legacy systems or fighting fires in a sprawling monolith. At Blitzy, you're building reliability into a greenfield AI platform that is redefining how the world creates software. You'll have direct influence over architectural decisions, work side-by-side with world-class engineers, and see the tangible impact of your work as we scale to serve Fortune 500 customers. As a founding member of the Pune SRE team, you'll help shape the culture and technical standards of a team that will grow with the company.

Read more
Blitzy

at Blitzy

2 candid answers
1 product
Bisman Gill
Posted by Bisman Gill
Pune
5yrs+
Upto ₹85L / yr (Varies
)
skill iconKubernetes
Google Cloud Platform (GCP)
Linux/Unix
skill iconDocker
Terraform
+1 more

The Role

As a DevOps Engineer at Blitzy's Pune headquarters, you'll build and operate the infrastructure that powers our AI agents and the applications they produce. You'll work at the intersection of cloud infrastructure, developer tooling, and AI-native systems — designing the pipelines, clusters, and automation that allow Blitzy to ship production-ready software at machine speed. This is a hands-on, high-ownership role for an engineer who moves fast, automates everything, and cares deeply about developer experience and system reliability.


What Success Looks Like

  • Kubernetes clusters are running reliably at scale, with clear deployment standards, Helm-managed releases, and minimal manual intervention required from engineering teams.
  • CI/CD pipelines are fast, consistent, and trusted — developers ship confidently knowing the automation handles the rest.
  • Observability is comprehensive: alerts are actionable, dashboards are meaningful, and incidents are resolved faster because the right data is always available.
  • Infrastructure provisioning is fully automated — no snowflake environments, no manual setup, everything reproducible through code.
  • AI agent orchestration infrastructure is stable and scalable, directly enabling Blitzy's core product to deliver for enterprise customers.
  • Engineering teams notice the difference — developer productivity is measurably higher and infrastructure is no longer a bottleneck to shipping.


Areas of Ownership

  • Build and manage Kubernetes clusters supporting AI agent workloads and application deployment at scale.
  • Design, implement, and maintain CI/CD pipelines for application and AI service delivery — ensuring speed, reliability, and repeatability.
  • Automate infrastructure provisioning and dynamic scaling using Python scripts and Terraform IaC.
  • Deploy and manage applications using Helm charts; own packaging standards and release automation.
  • Build and maintain comprehensive observability stacks — alerting, distributed tracing, metrics, and logging (e.g., Prometheus, Grafana, Datadog, OpenTelemetry).
  • Monitor and maintain production services and APIs; own incident response and drive blameless postmortems.
  • Build dedicated infrastructure for AI agent orchestration and management, enabling Blitzy's core autonomous development capabilities.
  • Collaborate with engineering teams on deployment strategies and continuously improve developer experience through tooling and automation.


Required Experience

  • 5–8 years of DevOps, infrastructure, or platform engineering experience.
  • Python proficiency for scripting, automation, and infrastructure tooling.
  • Deep Kubernetes expertise — cluster management, workload deployment, scaling, and troubleshooting.
  • Hands-on Helm experience for application packaging and release management.
  • Proven ability to design and implement CI/CD pipelines across complex, multi-service environments.
  • Practical experience with at least one major cloud platform (AWS, GCP, or Azure).
  • Terraform proficiency for infrastructure-as-code provisioning and state management.
  • Strong Linux administration and containerization fundamentals (Docker, OCI).


What Makes You Stand Out

  • CKA (Certified Kubernetes Administrator) certification.
  • Familiarity with MLOps tooling such as MLflow, Kubeflow, or similar platforms for AI/ML workload management.
  • Experience with microservices architecture and distributed systems design.
  • Knowledge of API gateways and service mesh technologies (Istio, Linkerd, or equivalent).
  • Prior experience in a high-growth AI or software startup where you moved fast and owned broadly.
  • Track record of meaningfully improving developer productivity through platform and tooling investments.


What Makes This Role Different

Most DevOps roles have you maintaining existing systems. At Blitzy, you're building the infrastructure layer for a platform that autonomously writes enterprise software — a genuinely new category of product. You'll work on AI agent orchestration, Kubernetes at scale, and developer tooling that is directly responsible for how fast Blitzy delivers value to Fortune 500 customers. As an early member of the Pune engineering team, you'll have outsized influence over our infrastructure culture and technical direction. High performers are eligible for company equity — giving you real ownership in what you build.

Read more
Ampera Technologies
Faisal AshrafNomani
Posted by Faisal AshrafNomani
Gurugram
3 - 10 yrs
Best in industry
Linux/Unix
Microsoft Windows
DNS
skill iconDocker
skill iconKubernetes
+9 more

About the Role

We are seeking a proactive and detail-oriented Site Reliability Engineer (SRE) with 3+ years of experience to ensure high availability, reliability, and performance of production systems.

This role focuses on automation, observability, incident management, and cross-team coordination to drive operational excellence.


Key Responsibilities

· Maintain reliable, scalable, and secure production environments.

· Implement and manage monitoring, alerting, and logging solutions.

· Contribute to defining and tracking SLIs/SLOs and support error budget practices.

· Automate operational tasks to improve efficiency and reduce manual effort.

· Perform troubleshooting and Root Cause Analysis (RCA) for production incidents.

· Optimize system performance, availability, and capacity.

· Maintain runbooks, SOPs, and incident documentation in Confluence.

· Adhere to change management, deployment governance, and disaster recovery standards.

· Support incident response for critical production services.


Collaboration & Tools

· Coordinate with external vendors and internal cross-functional teams.

· Work closely with Engineering, Product Owners, and Operations teams.

· Manage incidents and changes using ServiceNow & JIRA.

· Collaborate through Slack and structured communication channels.


Technical Skills

Systems & Cloud

· Strong knowledge of Windows and Linux/Unix systems.

· Solid understanding of networking fundamentals (DNS, TCP/IP, Load Balancing, Firewalls).

· Experience with at least one cloud platform (AWS, Azure, or GCP).

Automation & CI/CD

· Proficiency in one scripting/programming language (Python, Go, Bash, PowerShell, or Java).

· Understanding of CI/CD pipelines and automation practices.

Containers & Observability

· Hands-on experience with Docker and Kubernetes.

· Experience with monitoring tools such as Grafana or Power BI.

· Ability to analyze logs, metrics, and traces for troubleshooting.

ITSM & Documentation

· Experience with ServiceNow & JIRA (incident/change/problem workflows).

· Working knowledge of Confluence for technical documentation and knowledge management.


Additional Experience (Preferred)

· Background in DevOps, Cloud Engineering, or Platform Engineering.

· Understanding of security best practices and compliance standards.

· Familiarity with AI-assisted engineering tools (Claude Code, Jellyfish, GitHub Copilot).

· Exposure to large-scale or production-grade systems.


Soft Skills

· Strong analytical and troubleshooting mindset.

· Excellent written and verbal communication skills.

· Effective stakeholder and vendor coordination.

· Ownership-driven and composed during high-severity incidents.



Applying to jobs at Ampera is completely free. We never ask candidates for any payment.

Read more
Simbian AI
Akanksha Sharan
Posted by Akanksha Sharan
Remote only
4 - 8 yrs
₹15L - ₹40L / yr
skill iconKubernetes
skill iconAmazon Web Services (AWS)
helm
skill iconPython
Shell Scripting

About Simbian


Simbian is at the forefront of cybersecurity innovation, leveraging purpose-built AI Agents to deliver 10x security outcomes for global enterprises and MSSPs. Our platform autonomously investigates and responds to alerts, freeing security teams from repetitive tasks. Simbian combines privacy-first technology, proven integration with 70+ enterprise tools, and rapid deployment for measurable value. Role


Overview

We are seeking a collaborative, innovative DevOps Engineer passionate about enabling secure, scalable operations for cutting-edge cybersecurity products. Join our team during a period of high growth and help architect the future of agentic AI security platforms.


Key Responsibilities

• Kubernetes Management:

o Manage and maintain production-grade Kubernetes clusters across multiple cloud providers (AWS is essential, Azure is valuable, GCP is a plus).

o Deploy, upgrade, troubleshoot, and scale stateful and stateless workloads (NGINX, Postgres, MongoDB, OpenCTI, OpenSearch, Kafka, Hadoop, Fluentd) in Kubernetes.


• Cloud Operations:

o Operate and optimize cloud environments, with strong expertise in AWS (AWS Certified Solutions Architect Professional or equivalent Azure cert preferred).

o Design, deploy, and manage infrastructure on AWS and Azure (GCP optional). • SQL Database Management:

o Administer SQL databases, ideally Postgres, on Kubernetes clusters or cloud VMs.

o Perform routine maintenance, backups, upgrades, monitoring, and optimization.


• Infrastructure as Code:

o Build, install, upgrade, and maintain Helm charts with expertise.

o Use and understand Ansible for cloud automation (AWS/Azure), and Terraform for infrastructure provisioning.


• Monitoring, Logging, Observability:

o Implement and manage logging and metrics stacks using OpenSearch/Elasticsearch, Prometheus, Grafana, Thanos or similar open source tools.


• Programming & Scripting:

o Develop automation scripts in Bash (proficient with control structures). o Produce scripts or microservices in Node.js (preferred) or Python/Django (bonus).


• CI/CD:

o Build and maintain CI/CD pipelines preferably using GitHub Actions (Jenkins or equivalent is acceptable).


• Containerization:

o Create, manage, and troubleshoot Docker/Podman containers, images, volumes, and use Docker Compose for local development.


• Customer-Facing On-Prem Deployments (Bonus):

o Install, configure, and support Kubernetes on customer premises.

o Demonstrate ownership, initiative, and strong customer communication skills.

o Solid knowledge of Linux administration, networking, and cloud environments.


What You’ll Bring:

• 4+ years’ experience in DevOps, SRE, or Production Engineering.

• Mastery of Kubernetes, AWS, infrastructure automation, and database management.

• Strong collaborative, curious, and growth-driven mindset.

• Ability to challenge ideas, drive innovation, and embrace rapid change.

• Excellent communication for technical customer interactions.


Why Join Simbian?

• Work with pioneering agentic AI security—impact global security teams.

• Shape infrastructure for privacy-first technology in a high-growth startup.

• Enjoy a dynamic remote-first work culture with opportunities for ownership and advancement. 

Read more
appscrip

at appscrip

2 recruiters
Nilam Surati
Posted by Nilam Surati
Surat
0.6 - 1.5 yrs
₹3L - ₹5L / yr
DevOps
skill iconAmazon Web Services (AWS)
Google Cloud Platform (GCP)
Windows Azure
skill iconDocker
+2 more

we are currently hiring for Junior DevOps Developer 


Can you pls check below Job Description for the post 

 

Job Description: Junior DevOps Developer (0.6 – 1.5 Years Experience)

Job Title: Junior DevOps Developer

Experience: 6 months to 1.5 years

Employment Type: Full-time

About the Role:

We are looking for a motivated Junior DevOps Developer to support our development and operations teams. You will assist in managing cloud infrastructure, improving deployment processes, and maintaining system reliability.

Key Responsibilities:

  • Assist in managing and maintaining cloud infrastructure (AWS/GCP/Azure)
  • Support CI/CD pipeline setup and maintenance
  • Help automate deployment processes and routine tasks
  • Monitor system performance and troubleshoot issues
  • Assist in containerization using Docker and Kubernetes
  • Perform root cause analysis for production issues
  • Collaborate with developers to improve system performance and scalability
  • Maintain documentation for infrastructure and processes
  • cloud platform and infrastructure include hetzener

Required Skills:

  • Basic understanding of DevOps concepts and workflows
  • Knowledge of cloud platforms like AWS, GCP, or Azure
  • Familiarity with Docker and Kubernetes
  • Basic understanding of Infrastructure as Code tools (Terraform is a plus)
  • Knowledge of Git and version control systems
  • Basic scripting knowledge (Bash/Python preferred)

Good to Have:

  • Exposure to CI/CD tools (Jenkins, GitHub Actions, GitLab CI/CD)
  • Understanding of monitoring tools (Grafana, Prometheus) 
  • Understanding of monitoring tools (Grafana, Prometheus) 


You can contact me on this WhatsApp number: Nine three one six one two zero one three two

Read more
Improving
Aayushi Vats
Posted by Aayushi Vats
Pune, Bengaluru (Bangalore)
8 - 12 yrs
₹15L - ₹40L / yr
skill iconKubernetes
Terraform
CI/CD
skill iconDocker
skill iconGo Programming (Golang)
+3 more

Platform Engineer – Cloud & On-Prem Infrastructure


Location - Pune or Bangalore (WFO- 5 days)


Must-Have Skills:

  • 8+ years deploying, upgrading, and maintaining infrastructure across on-premises and public cloud, with Kubernetes and Docker
  • Proficiency in Infrastructure as Code using Terraform or Pulumi
  • Hands-on coding in Golang or Python, plus Bash scripting 

Good-to-Have Skills:

  • Familiarity with Kubernetes management solutions (OpenShift, Rancher, GKE, EKS, AKS, VMware TKG) 
  • Experience with VM management platforms (e.g., Red Hat OpenShift Virtualization, VMware
  • Kubernetes certifications (CKA, CKAD
  • Exposure to service mesh technologies (Istio, Linkerd)

Who You Are

  • A platform engineer who builds and maintains the infrastructure backbone for both on-prem and cloud environments
  • Passionate about automating daily operations and eliminating manual toil
  • Comfortable authoring and evolving IaC (Infrastructure as a Code) templates to enforce consistency.

What You’ll Do & Learn

  • Roll out & maintain on-premises and cloud infrastructure for development and testing environments 
  • Implement & support CI/CD pipelines to drive our software delivery processes
  • Develop automation tools that streamline routine operations and improve reliability 
  • Build & enhance Infrastructure-as-Code templates (Terraform, Pulumi) for rapid, repeatable provisioning 
  • Document system designs, configurations, and processes to enable an asynchronous, distributed team culture
Read more
chennai
0 - 2 yrs
₹2.5L - ₹3L / yr
Microsoft Windows
Linux/Unix
skill iconAmazon Web Services (AWS)
Google Cloud Platform (GCP)
skill iconPython
+8 more

Location: Chennai (Hybrid)

Commitment: Minimum 2 Years (Excluding 3 months of Probation)

Experience Level: Fresher / Entry Level


Job Overview:

We are looking for a skilled and versatile System Administrator with strong expertise in Windows and Linux environments, along with working knowledge of cloud infrastructure, cybersecurity, automation, and AI/ML systems.

The ideal candidate should be capable of handling enterprise IT infrastructure, supporting multi-cloud environments, and contributing to AI/ML deployment and integration activities. Strong communication skills and the ability to collaborate with technical and client-facing teams are essential.

 

Key Responsibilities:

  • Manage and maintain Windows and Linux server environments ensuring stability, performance, and security.
  • Support deployment, configuration, and administration of IT infrastructure components across on-prem and cloud environments.
  • Monitor system health, troubleshoot issues, and ensure high availability of services.
  • Work with cloud platforms such as AWS, Microsoft Azure, and Google Cloud.
  • Assist in implementation of security solutions including IAM, firewalls, endpoint protection, and SIEM tools.
  • Develop and maintain automation scripts using Python, PowerShell, or JavaScript.
  • Support deployment and integration of AI/ML models into production environments.
  • Collaborate with engineering and development teams to optimize infrastructure and application performance.
  • Participate in technical discussions, documentation, and client support activities when required.

 

Required Skills & Qualifications:

  • Strong knowledge of Windows and Linux system administration.
  • Good understanding of networking, servers, and cloud fundamentals.
  • Experience or exposure to AWS, Azure, or GCP.
  • Proficiency in scripting languages such as Python, PowerShell, or JavaScript.
  • Basic understanding of cybersecurity principles and system hardening.
  • Familiarity with AI/ML concepts and deployment workflows is an advantage.
  • Strong analytical and troubleshooting skills.
  • Excellent verbal and written communication skills.

 

Preferred Qualifications:

  • Experience with virtualization and containerization (VMware, Docker, Kubernetes).
  • Knowledge of CI/CD pipelines and DevOps practices.
  • Exposure to MLOps concepts and model deployment workflows.
  • Understanding of monitoring tools and logging systems.
  • Experience working in hybrid or enterprise IT environments.

 

What We Offer:

  • Exposure to enterprise-level infrastructure and cloud environments.
  • Opportunity to work on real-world AI/ML integration projects.
  • Structured career growth into Cloud, DevOps, Security, or AI/ML engineering roles.
  • Collaborative work environment with hands-on learning opportunities.
  • Competitive compensation and long-term growth path.

 

Who Should Apply:

  • Freshers or candidates with up to 2 years of experience.
  • Candidates passionate about system administration, cloud computing, and AI/ML.
  • Individuals eager to work in infrastructure-heavy, production environments.
  • Strong communicators who can work in team-oriented and client-facing roles.
Read more
Incubyte

at Incubyte

4 recruiters
Titiksha Singh
Posted by Titiksha Singh
Remote only
3 - 6 yrs
Best in industry
DevOps
skill iconKubernetes
skill iconNodeJS (Node.js)
skill iconPython

About Us

We believe the future of software development is AI-native — where engineers operate at a higher level of abstraction and quality remains non-negotiable. 

Incubyte is a software craft consultancy where the “how” of building software matters as much as the “what”.  

We partner with companies of all sizes, from helping enterprises build, scale, and modernize to early-stage founders bring their ideas to life. 

Our engineers operate in an AI-native development model, using AI as a collaborator across the SDLC to accelerate development while upholding the discipline of software craftsmanship. Guided by Software Craftsmanship and Extreme Programming practices, we build reliable, maintainable, and scalable systems with speed, without compromising quality. If this way of building software resonates with you, we’d like to talk. 

Our Guiding Principles 

These principles define how we work at Incubyte. They are non-negotiable. 

Relentless Pursuit of Quality with Pragmatism 

  We build high-quality systems without losing sight of delivery. 

Extreme Ownership 

  We take responsibility end-to-end for decisions, execution, and outcomes. 

Proactive Collaboration 

  We collaborate closely, challenge each other, and solve problems together. 

Active Pursuit of Mastery 

  We continuously improve our craft and raise our bar. 

Invite, Give, and Act on Feedback 

We seek, give, and act on feedback to get better every day. 

Ensuring Client Success 

We act as trusted partners and focus on real outcomes, not just output. 


Job Description

This is a remote position.

Experience Level

This role is ideal for engineers with 3-5 years of experience and a strong background in building secure, scalable platforms.

We are looking for hands-on DevOps and Backend Engineers with real-world experience in application/feature development, system design, testing practices such as TDD, full-stack development, handling production incidents, distributed systems, and modern infrastructure challenges. 


What You’ll Do as a Software Craftsperson

  • Design and document real-world DevOps and backend scenarios based on production incidents such as outages, scaling challenges, and secure deployments
  • Translate real engineering experiences into benchmark tasks that contribute to training next-generation AI systems
  • Contribute to building secure, scalable, Kubernetes-native architectures across modern infrastructure environments
  • Work across critical engineering domains including CI/CD pipelines, observability, identity & access management, infrastructure-as-code, and backend services
  • Collaborate with internal teams to design and simulate realistic engineering workflows and system behaviors
  • Apply practical engineering judgment to model distributed systems challenges and improve system resilience and reliability



Requirements

What You’ll Bring


3-5 years of experience in DevOps and Backend Engineering with a strong foundation in building secure, scalable systems.

Strong hands-on expertise in DevOps and backend technologies (Node.js/Java/Go/Python) including:

  • Kubernetes, Terraform, and CI/CD pipelines
  • Tools such as k9s, k3s (GitLab CI preferred)
  • Backend technologies such as Go, Python, or Java
  • Experience with Docker, gRPC, and Kubernetes-native services

Demonstrated experience working with secure, offline or air-gapped deployments (highly preferred)

Familiarity with distributed systems and backend architecture, with exposure to ML or distributed pipelines being a plus.

Hands-on experience across multiple core functional areas, with exposure to at least five of the following:

  • Identity & Access Management
  • Observability (Prometheus + Grafana)
  • CI/CD Pipelines
  • Keycloak
  • GitLab CI
  • Terraform OSS
  • Kubernetes ecosystem tools

Strong problem-solving ability with real-world experience in handling production systems, incidents, and infrastructure challenges

Ability to work across multiple layers of the stack, from infrastructure to backend services, while ensuring scalability, reliability, and security


Benefits

Life at Incubyte


We are a remote-first company with structured flexibility. Teams commit to shared rhythms during core hours, ensuring smooth collaboration while maintaining autonomy. Twice a year, we come together in person for a co-working sprint and once a year for a retreat - with all travel expenses covered.

Our environment is built for crafters: experimenting with real-world systems, solving complex infrastructure challenges, and contributing to cutting-edge AI initiatives. We are all lifelong learners, and our work is our passion.

Perks

  • Dedicated learning & development budget
  • Sponsorship for conference talks
  • Comprehensive medical & term insurance
  • Employee-friendly leave policies
  • Home Office fund
  • Medical Insurance
Read more
Bengaluru (Bangalore)
5 - 8 yrs
₹38L - ₹45L / yr
Node.js
skill iconPython
Field Engineer
Forward Deployed
skill iconDocker
+1 more

Role & Responsibilities

Own the Client’s Outcome:

  • Embed with enterprise customers – on-site and remotely – to understand their supply chain operations, data estate, and what success actually looks like for their business.
  • Scope and design technical solutions for messy, real-world logistics problems – with a clear line to measurable impact: cost per delivery, SLA performance, empty kilometres.
  • Own the full deployment lifecycle: architecture through go-live through steady-state. You’re accountable for the outcome, not just the code.

Build and Ship:

  • Design, build, and maintain backend services in Node.js or Python that power routing, planning, and execution at enterprise scale.
  • Build and own the integrations connecting Locus to client ERPs, TMS, WMS, and OMS platforms – these integrations are often the riskiest part of a deployment.
  • Write production code that runs under real load. If it isn’t in production, it hasn’t shipped.

Be the Technical Interface with the Client:

  • Run architecture reviews, lead integration workshops, and represent Locus in executive steering meetings. You need to be credible at every level of the client organisation.
  • Bring field learnings back into the product and platform teams. Some of Locus’s best features started as a client workaround.
  • Push back when a client request would compromise platform integrity – and propose a better alternative.

Show Up On-Site:

  • Travel to client sites – domestic and international, up to ~30% of the time – for kick-offs, integration sprints, go-lives, and post-live reviews.
  • Build the kind of relationship where the client’s ops lead calls you directly when something goes wrong at 2am, not a support ticket.
  • Be comfortable wherever the work is: a warehouse floor, a logistics control tower, a C-suite boardroom.

Make the Next Deployment Easier:

  • Document architecture decisions, integration patterns, and deployment playbooks – every engagement should make the next one faster.
  • Work closely with Product, Customer Success, and Platform Engineering. Share what you’re seeing in the field; don’t wait to be asked.
  • Mentor junior FDEs and raise the technical bar across the team.

Ideal Candidate

  • Strong Forward Deployed / Field Engineer
  • Mandatory (Experience 1): Must have 5+ years of backend engineering experience with hands-on coding in Node.js or Python, building production-grade systems
  • Mandatory (Experience 2): Must have minimum 2+ years in client-facing / deployment-heavy roles, where they worked directly with enterprise customers
  • Mandatory (Experience 3): Must have experience shipping and owning production systems end-to-end: From design → build → deployment → post-production support
  • Mandatory (Tech Skills 1 - Backend & Systems): Strong in: Node.js or Python (must-have), Building scalable backend services
  • Mandatory (Tech Skills 2 - Integrations): Must have experience with: Enterprise integrations (APIs, third-party systems), Systems like ERP / TMS / WMS / OMS
  • Mandatory (Tech Skills 3 - Data & Messaging): Hands-on with: Relational + NoSQL databases, Event streaming / queues (Kafka / RabbitMQ or similar)
  • Mandatory (Tech Skills 4 - Cloud & Deployment): Experience with: Cloud platforms (AWS / GCP / Azure), Docker + Kubernetes (or containerised deployments)
  • Mandatory (Company): Top Product companies / Startups / SaaS / platform companies


Read more
Hyderabad
7 - 15 yrs
₹25L - ₹50L / yr
Microsoft Windows Azure
Azure OpenAI
skill iconKubernetes
Microservices
ASP.NET
+1 more

Role: Senior Software Developer (Backend) - .Net with Microservices & Cloud

YOE: 6.5+ years

Skills: C#, ASP. Net, Microservices Architecture, ASP.NET Core, Web API development, Azure

Kubernetes Service (AKS), API Gateway / Azure API, Entra (Authentication), Azure Service Bus, Azure

Functions, Azure Blob storage, Caching, NoSQL Databases

About the role:

The Software Developer Senior Designs, builds, tests, and – most importantly – ships high-value software

that solves real problems. Strives for security, performance, simplicity, usability, and maintainability.

Mentors and guides less experienced software engineers.

Responsibilities:

1. Team Contribution

● Works within established agile methods, promoting an atmosphere of continuous improvement.

● Continuously learns new technologies and patterns and practices.

● Documents knowledge for the benefit of the team.

● Reports to the team on obstacles and roadblocks.

● Participates in, and occasionally leads, sprint planning, standups, retrospectives, and other team

meetings.

● Promote patterns and best practices on the team.

● Mentors and guides the less experienced software engineers.

2. Planning and Design

● Works with the product team and stakeholders to refine and document requirements.

● Estimates effort for planning purposes.

● Designs and documents enterprise-level software architecture, consulting with Enterprise

Architecture when appropriate.

3. Development

● Writes code to develop software that meets requirements and specifications.

● Follows established software development life cycle (SLDC).

● Writes code with readability and future maintenance in mind.

● Follows established source control standards and best practices.

● Adheres to established secure coding practices.

● Reviews code for other developers.

● Leads team-based development efforts.

4. Quality Assurance

● Validates QA findings and fixes defects.

● Develops integration and testing points in the software that allow for QA testing.

● Assists QA in running performance and load tests.

5. Release

● Assists with release planning and releases.

6. Support

● Assists the support team as needed, including root cause analysis.

● Writes maintenance and metric statistics scripts and entry points for measuring and monitoring.

Requirements:

Solid Understanding of The Following:

● Microservices Architecture:


Confidential. c Foxsense Innovations. 1


● Microservices design principles (bounded contexts, loose coupling)

● API-first design and contract management

● Event-driven design principles

● Asynchronous messaging patterns

● Eventual consistency concepts

● Idempotency and message replay handling

● ASP.NET Core Web API development

● Web Apps

● Azure Kubernetes Service (AKS)

● Azure Blob Storage usage and lifecycle management

● API Gateway / Azure API Management concepts

● Entra (Authentication)

● Azure Service Bus

● Azure Functions

● Caching

● NoSQL DatabasesProcesses & Standards: Git, GitFlow, OO Programming, Kanban, Secure

Coding, & Agile Methodologies

Bonus Skills:

● Excellent written and verbal communication

● Excellent documentation

● Continuous learning

● Collaboration across team and functional boundaries

● Troubleshooting and creative problem solving

● Design simple architecture that supports complex applications and APIs

● Architect extensible databases

● Author complex component-based client applications and restful APIs

● Perform advanced CRUD operations against multiple data sources

● Manipulate enterprise level data structures

● Mentor less experienced team members

● Take ownership of team processes and legacy applications

● Perform business analysis tasks, such as requirements gathering and wireframing

Read more
Searce Inc
Pune
3 - 5 yrs
Best in industry
skill iconAmazon Web Services (AWS)
skill iconKubernetes

Your Mission: The Role

solving for better.

You are a reliability-owning, hands-on solver. Not just a "break-fix engineer."

As a DRI (directly responsible individual) for our clients' most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.

You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into today's production environments.


Your Responsibilities

what you will wake up to solve.

This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:

  • Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. You'll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
  • Engineer the Blueprint: You'll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
  • Automate with Intelligence: You'll lead the charge from the keyboard to futurify our operations. You'll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
  • Drive FinOps & Impact: You'll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
  • Be the Expert in the Room: You'll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
  • Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, you'll help build a high-performing squad that lives the “Always Hardened” mindset.


Experience & Relevance

We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.

  • Cloud Operations Pedigree: 3+ years of experience in AWS cloud infrastructure, with a significant portion in a cloud managed services.
  • Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
  • Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.


Functional Skills:

  • Service Delivery Mindset: A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
  • Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
  • Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
  • Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.


Join the ‘real solvers’

ready to futurify?

If you are excited by the possibilities of what an AI-native engineering-led, modern tech consultancy can do to futurify businesses, apply here and experience the ‘Art of the possible’. Don’t Just Send a Resume. Send a Statement.

Read more
Searce Inc

at Searce Inc

3 recruiters
Vaivashhya VN
Posted by Vaivashhya VN
Bengaluru (Bangalore), Coimbatore
3 - 5 yrs
Best in industry
Google Cloud Platform (GCP)
skill iconKubernetes

Your Mission: The Role

solving for better.

You are a reliability-owning, hands-on solver. Not just a "break-fix engineer."

As a DRI (directly responsible individual) for our clients' most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.

You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into today's production environments.


Your Responsibilities

what you will wake up to solve.

This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:

  • Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. You'll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
  • Engineer the Blueprint: You'll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
  • Automate with Intelligence: You'll lead the charge from the keyboard to futurify our operations. You'll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
  • Drive FinOps & Impact: You'll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
  • Be the Expert in the Room: You'll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
  • Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, you'll help build a high-performing squad that lives the “Always Hardened” mindset.


Experience & Relevance

We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.

  • Cloud Operations Pedigree: 3+ years of experience in GCP cloud infrastructure, with a significant portion in a cloud managed services.
  • Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
  • Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.


Functional Skills:

  • Service Delivery Mindset: A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
  • Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
  • Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
  • Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.


Join the ‘real solvers’

ready to futurify?

If you are excited by the possibilities of what an AI-native engineering-led, modern tech consultancy can do to futurify businesses, apply here and experience the ‘Art of the possible’. Don’t Just Send a Resume. Send a Statement.

Read more
FrontM Limited
Pradeep Chandkiran
Posted by Pradeep Chandkiran
Bengaluru (Bangalore)
3 - 5 yrs
₹8L - ₹14L / yr
skill iconKubernetes
Terraform
skill iconAmazon Web Services (AWS)

Location: Bangalore preferred / Hybrid as applicable

Experience: 3+ years

Education: B.E/B.Tech in Computer Science, Engineering or a related technical discipline

Salary: Above market standards, flexible for the right candidate

Career growth: Long-term opportunity with potential to lead DevOps architecture and cloud platform operations


About FrontM

FrontM builds software platforms for frontline workforces operating in remote and low-connectivity environments, with a strong focus on the maritime industry. The platform supports communication, collaboration, healthcare, learning, welfare and operational workflows across mobile, web, kiosk and connected device environments.

The platform runs across cloud infrastructure, constrained networks and specialised customer environments, requiring reliable DevOps practices, strong observability, secure architecture and careful operational discipline.


Role Summary

As a Senior DevOps Engineer, you will take ownership of FrontM’s AWS cloud infrastructure, CI/CD pipelines, platform reliability and technical operations. You will work closely with the VP of Delivery, CTO and CEO to maintain secure, scalable and high-availability infrastructure for FrontM’s production systems.

This role requires strong hands-on DevOps experience, broad AWS knowledge, Kubernetes experience and the ability to troubleshoot complex networking and production issues across multi-domain SaaS environments.


Key Responsibilities

Cloud Infrastructure & DevOps Architecture (≈45%)

· Own, maintain and improve AWS cloud infrastructure for FrontM platforms

· Create and maintain Terraform scripts for infrastructure deployment and management

· Manage Kubernetes workloads deployed within AWS EKS

· Support multi-zone AWS infrastructure design for availability, resilience and scale

· Maintain AWS services including Route 53, EC2, API Gateway, VPC, VPN, AWS Cognito, ElastiCache, DynamoDB and Lambda

· Contribute to DevOps architecture planning in line with FrontM’s platform roadmap

CI/CD, Operations & Platform Reliability (≈35%)

· Build, maintain and improve CI/CD pipelines for backend and platform services

· Oversee technical operations with hands-on administration, monitoring and release support

· Ensure continuous server uptime, stability, performance and maintainability

· Debug, respond to and restore system outages in production and staging environments

· Improve observability across infrastructure and applications, including migration from Elastic stack to logz.io

· Support backend stability, scale and performance across Node.js, Java and related services

Security, Networking & Production Support (≈20%)

· Maintain AWS security configurations, access controls and monitoring practices

· Support complex networking requirements across multi-domain SaaS implementations

· Troubleshoot network, infrastructure and access issues with internal teams and customer-side users

· Work with backend teams to support API integrations and infrastructure abstractions for complex requirements

· Document operational procedures, incident findings and technical support steps clearly


Required Technical Skills

Cloud Infrastructure & AWS

· Strong hands-on experience with AWS infrastructure and cloud operations

· Experience with Route 53, EC2, API Gateway, VPC, VPN, AWS Cognito, ElastiCache, DynamoDB and Lambda

· Experience with AWS security setup, monitoring and multi-zone infrastructure

· Ability to manage infrastructure using Terraform

Kubernetes, CI/CD & Observability

· Strong experience with Kubernetes, preferably AWS EKS

· Extensive CI/CD and DevOps experience

· Experience with infrastructure observability and application monitoring tools

· Ability to diagnose production bottlenecks, server failures and performance issues

Backend, Networking & SaaS Operations

· Experience supporting Node.js, Java and backend system procedures for stability and scale

· Good understanding of APIs, integrations and backend service dependencies

· Experience with complex networking and multi-domain SaaS implementations

· Ability to troubleshoot technical issues with non-technical end users

Nice to Have

· Experience with MongoDB clusters in MongoDB Atlas

Personal Attributes

· Strong ownership mindset for uptime, reliability and production stability

· Practical problem-solving approach with the ability to act quickly during incidents

· Clear written and spoken communication in English

· Ability to work independently and coordinate with senior management when required

· Comfortable working in fast-moving engineering teams

· Attention to detail in security, monitoring, documentation and operational processes


Why join FrontM?

Long-Term Career Growth

Opportunity to work on cloud infrastructure used by global maritime and remote workforce customers, with scope to grow into DevOps architecture and platform leadership roles.

Engineering Challenges That Matter

Work on infrastructure that supports applications used in remote, low-bandwidth and operationally demanding environments.

Broad Technical Ownership

Take responsibility across cloud infrastructure, Kubernetes, CI/CD, observability, networking, security and production reliability.


Apply now

Join a team focused on building reliable software infrastructure for real-world use cases and contribute to systems used across the global maritime workforce.

Read more
Searce Inc

at Searce Inc

3 recruiters
Mohammed Rabidheen
Posted by Mohammed Rabidheen
Bengaluru (Bangalore)
3 - 5 yrs
Best in industry
skill iconKubernetes
Google Cloud Platform (GCP)

Your Mission: The Role

solving for better.

You are a reliability-owning, hands-on solver. Not just a "break-fix engineer."

As a DRI (directly responsible individual) for our clients' most critical systems, you’ll be the go-to expert within the squad that ensures their environments are secure, reliable, and optimized 24/7. You will deliver measurable impact – improved uptime, faster response times, and real cost savings. Not just closed tickets. Not just alerts. Real outcomes you engineer yourself.

You will lead the charge on technical execution, from complex troubleshooting and root cause analysis to engineering proactive, automated solutions. This role is about building the future of reliable cloud operations and shipping it into today's production environments.


Your Responsibilities

what you will wake up to solve.

This isn’t a “manage tickets” role. You are the architect, the executioner and the DRI for our Cloud Managed Services GTM, deploying solutions that turn operational noise into hardened outcomes. Here’s how you’ll make your mark:

  • Own Service Reliability: You will be the go-to technical expert for 24/7 cloud operations and incident management. You'll ensure strict adherence to SLOs by getting your hands dirty, leading high-stakes troubleshooting to deliver a superior client experience.
  • Engineer the Blueprint: You'll translate client needs into scalable, automated, and secure cloud architectures. You will write and maintain the operational playbooks and Infrastructure as Code (IaC) that your squad uses every day.
  • Automate with Intelligence: You'll lead the charge from the keyboard to futurify our operations. You'll embed AI-driven automation, predictive monitoring, and AIOps into core processes to eliminate toil and preempt incidents.
  • Drive FinOps & Impact: You'll own the technical execution of the FinOps framework. You will continuously analyze, configure, and optimize cloud spend for clients through hands-on engineering.
  • Be the Expert in the Room: You'll share your knowledge through internal demos, documentation, and technical deep dives, representing the deep expertise that turns operational complexity into business resilience.
  • Mentor & Elevate: You will be a technical mentor for your peers. Through code reviews and collaborative problem-solving, you'll help build a high-performing squad that lives the “Always Hardened” mindset.


Experience & Relevance

We are looking for future technology leaders, not just coders. We value raw intelligence, analytical rigor, and an obsessive passion for technology over any prior experience.

  • Cloud Operations Pedigree: 3+ years of experience in GCP cloud infrastructure, with a significant portion in a cloud managed services.
  • Commercial Acumen: Proven track record of building and scaling a net-new managed services business.
  • Client-Facing Tech Acumen: 2+ years of experience in a client-facing technical role, acting as the trusted advisor for cloud operations, security, and reliability.


Functional Skills:

  • Service Delivery Mindset: A deep understanding of MSP business models, SLAs, and the importance of client satisfaction in an operational context.
  • Client Engagement: Ability to ask appropriate questions to get to the heart of an operational issue and win trust with stakeholders.
  • Cross-Functional Catalyst: Thrive in multi-disciplinary teams, bringing together operations, security, and development teams.
  • Repository builder: Creates reusable frameworks, IaC modules, and operational playbooks for scale.


Join the ‘real solvers’

ready to futurify?

If you are excited by the possibilities of what an AI-native engineering-led, modern tech consultancy can do to futurify businesses, apply here and experience the ‘Art of the possible’. Don’t Just Send a Resume. Send a Statement.

Read more
Bengaluru (Bangalore), Mumbai, Delhi, Gurugram, Noida, Ghaziabad, Faridabad, Pune, Hyderabad, Chennai, Ahmedabad
4 - 6 yrs
₹8L - ₹15L / yr
ASP.NET
.net core
mvc
skill iconC#
SQL
+13 more

Position: Microsoft .NET Full Stack Developer

Experience: 4–6 Years

Open Positions: 10

Location: PAN India (Final Round – Face-to-Face Interview)

Budget: Up to 15 LPA

Notice Period: Immediate joiners preferred

Key Responsibilities:

· Work on highly distributed and scalable system architecture

· Design, develop, test, and maintain high-quality software solutions

· Ensure performance, security, and maintainability of applications

· Collaborate with cross-functional teams and stakeholders

· Perform system testing and resolve technical issues


Required Skills:

· Strong experience in ASP.NET, C#, .NET Core, MVC

· Hands-on experience with SQL Server / PostgreSQL

· Experience in Angular / React (Frontend technologies)

· Knowledge of microservices architecture & RESTful APIs

· Familiarity with CQRS pattern

· Exposure to AWS / Docker / Kubernetes

· Experience with CI/CD pipelines (Azure DevOps, Jenkins)

· Knowledge of Node.js is an added advantage

· Understanding of Agile methodology

· Good exposure to cybersecurity and compliance


Technology Stack:

· Microsoft .NET technologies (primary)

· Cloud platforms: AWS (SaaS/PaaS/IaaS)

· Databases: MSSQL, MongoDB, PostgreSQL

· Caching: Redis, Memcached

· Messaging queues: RabbitMQ, Kafka, SQS

 

Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
4 - 7 yrs
₹12L - ₹18L / yr
skill iconPython
skill iconGo Programming (Golang)
skill iconDocker
skill iconKubernetes
Object Oriented Programming (OOPs)
+2 more

Core Responsibilities:

  • Design & Development: Architect and implement scalable backend services and APIs using Python or Golang, ensuring high performance, resilience, and extensibility.
  • System Ownership: Take end-to-end ownership of critical modules, from design and development to deployment and support.
  • Technical Leadership: Conduct design and code reviews, enforce best practices, and mentor junior engineers to raise the team’s technical bar.
  • Collaboration: Work closely with product managers, architects, and other engineers to translate business requirements into technical solutions.
  • Performance & Reliability: Troubleshoot complex issues in production systems, identify root causes, and design sustainable long-term solutions.
  • Innovation: Evaluate new technologies, contribute to proof-of-concepts, and recommend tools that can improve developer productivity.
  • Process Improvement: Drive initiatives to improve coding standards, CI/CD pipelines, and automated testing practices.
  • Knowledge Sharing: Document designs, create technical guides, and share insights with the broader engineering team.


Experience and Expertise:

  • 4–7 years of backend development experience with Python or Golang.
  • Strong expertise in designing, developing, and scaling microservices and distributed systems.
  • Solid understanding of concurrency, multi-threading, and performance optimization.
  • Proficiency with databases (SQL/NoSQL), caching systems (Redis, Memcached), and messaging systems (Kafka, RabbitMQ, etc.).
  • Hands-on experience with Linux development, Docker, and Kubernetes.
  • Familiarity with cloud platforms (AWS/GCP/Azure) and related services.
  • Strong debugging, profiling, and optimization skills for production-grade systems.
  • Experience with AI-powered development tools is a strong plus; familiarity with concepts like 'agentic coding' for workflow automation or 'context engineering' for leveraging LLMs in system design is highly desirable.


Skills:

  • Strong problem-solving ability, with experience handling complex technical challenges.
  • Ability to lead technical initiatives and mentor junior engineers.
  • Excellent communication skills to collaborate with cross-functional teams and articulate trade-offs.
  • Self-motivated, proactive, and able to operate independently while aligning with team goals.
  • Passionate about engineering culture, quality, and developer productivity.


Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
2 - 4 yrs
₹8L - ₹12L / yr
skill iconPython
skill iconGo Programming (Golang)
Linux/Unix
skill iconDocker
skill iconKubernetes
+3 more

Core Responsibilities:

  • Design, develop, and maintain backend services and APIs using Python or Golang.
  • Write high-quality, testable, and maintainable code with a focus on performance and scalability.
  • Implement automated tests and contribute to CI/CD pipelines.
  • Collaborate with product, QA, and DevOps teams for end-to-end feature delivery.
  • Troubleshoot production issues and provide timely resolutions.
  • Participate in design and architecture discussions to improve system efficiency.
  • Contribute to improving development processes, coding standards, and best practices.


Experience and Expertise:

  • 2–4 years of experience in backend development with Python or Golang.
  • Solid understanding of RESTful APIs, microservices, and distributed systems.
  • Strong knowledge of data structures, algorithms, and OOPS principles.
  • Hands-on experience with relational and/or NoSQL databases.
  • Familiarity with Linux development, Docker, and basic cloud concepts (AWS/GCP/Azure).
  • Proficiency with Git and version control workflows.
  • Familiarity with AI-powered development tools or exposure to projects involving large language models (LLMs) is a plus.


Skills:

  • Strong analytical and debugging skills with the ability to solve complex problems.
  • Good communication and collaboration skills across teams.
  • Ability to work independently with minimal supervision while being a strong team player.
  • Growth mindset – eagerness to learn new technologies and improve continuously.


Read more
Leading provider of Capital Market solutions in India

Leading provider of Capital Market solutions in India

Agency job
via HyrHub by Neha Koshy
Bengaluru (Bangalore)
1 - 2 yrs
₹2L - ₹7L / yr
skill iconPython
skill iconGo Programming (Golang)
skill iconDocker
skill iconKubernetes
Linux/Unix
+3 more

Core Responsibilities:

  • Design, develop, and maintain backend services using Python or Golang.
  • Write clean, efficient, and well-documented code following best practices.
  • Build and consume RESTful APIs and microservices.
  • Collaborate with QA, DevOps, and product teams for smooth feature delivery.
  • Participate in peer code reviews and technical discussions.
  • Debug and fix issues, ensuring system stability and performance.
  • Continuously learn and apply new technologies and tools in backend development.


Experience and Expertise:

  • 0–2 years of software development experience (internships or projects acceptable).
  • Proficiency in at least one backend programming language (Python or Golang).
  • Strong understanding of object-oriented programming and software fundamentals.
  • Knowledge of data structures, algorithms, and database concepts.
  • Familiarity with Linux-based development environments.
  • Exposure to Git and version control workflows.


Skills:

  • Strong analytical and problem-solving ability.
  • Willingness to learn, adapt, and take ownership.
  • Effective communication and teamwork skills.
  • Curiosity for emerging technologies, including AI-driven development, backend technologies, distributed systems, and modern engineering practices.
Read more
LearnTube.ai

at LearnTube.ai

2 candid answers
Vinayak Sharan
Posted by Vinayak Sharan
Remote, Mumbai
3 - 6 yrs
₹14L - ₹32L / yr
skill iconPython
FastAPI
skill iconDocker
skill iconAmazon Web Services (AWS)
SQL
+3 more

Role Overview:


As a Backend Developer at LearnTube.ai, you will ship the backbone that powers 2.3 million learners in 64 countries—owning APIs that crunch 1 billion learning events & the AI that supports it with <200 ms latency.


Skip the wait and get noticed faster by completing our AI-powered screening. Click this link to start your quick interview. It only takes a few minutes and could be your shortcut to landing the job! -https://bit.ly/LT_Python


What You'll Do:


At LearnTube, we’re pushing the boundaries of Generative AI to revolutionize how the world learns. As a Backend Engineer, your roles and responsibilities will include:

  • Ship Micro-services – Build FastAPI services that handle ≈ 800 req/s today and will triple within a year (sub-200 ms p95).
  • Power Real-Time Learning – Drive the quiz-scoring & AI-tutor engines that crunch millions of events daily.
  • Design for Scale & Safety – Model data (Postgres, Mongo, Redis, SQS) and craft modular, secure back-end components from scratch.
  • Deploy Globally – Roll out Dockerised services behind NGINX on AWS (EC2, S3, SQS) and GCP (GKE) via Kubernetes.
  • Automate Releases – GitLab CI/CD + blue-green / canary = multiple safe prod deploys each week.
  • Own Reliability – Instrument with Prometheus / Grafana, chase 99.9 % uptime, trim infra spend.
  • Expose Gen-AI at Scale – Publish LLM inference & vector-search endpoints in partnership with the AI team.
  • Ship Fast, Learn Fast – Work with founders, PMs, and designers in weekly ship rooms; take a feature from Figma to prod in < 2 weeks.


What makes you a great fit?


Must-Haves:

  • 3+ yrs Python back-end experience (FastAPI)
  • Strong with Docker & container orchestration
  • Hands-on with GitLab CI/CD, AWS (EC2, S3, SQS) or GCP (GKE / Compute) in production
  • SQL/NoSQL (Postgres, MongoDB) + You’ve built systems from scratch & have solid system-design fundamentals

Nice-to-Haves

  • k8s at scale, Terraform,
  • Experience with AI/ML inference services (LLMs, vector DBs)
  • Go / Rust for high-perf services
  • Observability: Prometheus, Grafana, OpenTelemetry


About Us: 


At LearnTube, we’re on a mission to make learning accessible, affordable, and engaging for millions of learners globally. Using Generative AI, we transform scattered internet content into dynamic, goal-driven courses with:

  • AI-powered tutors that teach live, solve doubts in real time, and provide instant feedback.
  • Seamless delivery through WhatsApp, mobile apps, and the web, with over 1.4 million learners across 64 countries.


Meet the Founders: 


LearnTube was founded by Shronit Ladhani and Gargi Ruparelia, who bring deep expertise in product development and ed-tech innovation. Shronit, a TEDx speaker, is an advocate for disrupting traditional learning, while Gargi’s focus on scalable AI solutions drives our mission to build an AI-first company that empowers learners to achieve career outcomes. We’re proud to be recognised by Google as a Top 20 AI Startup and are part of their 2024 Startups Accelerator: AI First Program, giving us access to cutting-edge technology, credits, and mentorship from industry leaders.


Why Work With Us? 


At LearnTube, we believe in creating a work environment that’s as transformative as the products we build. Here’s why this role is an incredible opportunity:

  • Cutting-Edge Technology: You’ll work on state-of-the-art generative AI applications, leveraging the latest advancements in LLMs, multimodal AI, and real-time systems.
  • Autonomy and Ownership: Experience unparalleled flexibility and independence in a role where you’ll own high-impact projects from ideation to deployment.
  • Rapid Growth: Accelerate your career by working on impactful projects that pack three years of learning and growth into one.
  • Founder and Advisor Access: Collaborate directly with founders and industry experts, including the CTO of Inflection AI, to build transformative solutions.
  • Team Culture: Join a close-knit team of high-performing engineers and innovators, where every voice matters, and Monday morning meetings are something to look forward to.
  • Mission-Driven Impact: Be part of a company that’s redefining education for millions of learners and making AI accessible to everyone.
Read more
NeoGenCode Technologies Pvt Ltd
Gurugram
2.5 - 6 yrs
₹6L - ₹10L / yr
skill iconJava
skill iconNodeJS (Node.js)
skill iconSpring Boot
Systems design
High-level design
+12 more

Job Title : Backend Engineer (AI-First | FinTech/Crypto)

Experience : 3 to 6 Years

Location : Gurugram (Sector 49)

Working Hours : 10:00 AM – 6:00 PM

Work Mode : On-site | 6 Days Working

Employment Type : Full-time


Role Overview :

This is not a typical ticket-based engineering role. You will take end-to-end ownership of systems—designing architecture, building scalable solutions, and solving real-world performance challenges.

We operate in an AI-first engineering environment, leveraging advanced tools and automation workflows to build high-performance distributed systems.


Mandatory Skills :

Java/Spring Boot or Node.js, System Design (HLD/LLD), Distributed Systems, Event-Driven Architecture (Kafka/RabbitMQ), Low-Latency APIs, PostgreSQL/MongoDB, CI/CD, Docker/Kubernetes, AI-assisted development (Copilot/Cursor/Claude)


Key Responsibilities :

  • Design and build scalable backend systems (Java/Spring Boot, Node.js, or similar).
  • Architect and implement event-driven systems (Kafka, RabbitMQ, pub/sub).
  • Develop secure and reliable financial systems with strong data integrity.
  • Solve scalability and performance challenges in fintech/crypto environments.
  • Own features end-to-end: design → development → deployment → monitoring.
  • Work with real-time data pipelines (WebSockets, streaming, event sourcing).
  • Define service contracts and optimize system architecture.


AI-First Engineering (Must-Have Mindset) :

You will :

  • Use tools like GitHub Copilot, Cursor, and Claude in daily development
  • Follow spec-driven development using structured instructions
  • Review, validate, and ship AI-generated code with strong engineering judgment


Core Requirements :

  • 3+ years of backend development experience.
  • Strong expertise in Java (Spring Boot) or Node.js.
  • Solid understanding of System Design (HLD/LLD, Distributed Systems).
  • Experience with event-driven architectures (Kafka, RabbitMQ, async pipelines).
  • Hands-on experience building low-latency, high-throughput systems.
  • Strong database knowledge (PostgreSQL, MongoDB, etc.).
  • Understanding of security, performance optimization, and reliability.
  • Experience with CI/CD, Git, Docker, Kubernetes.
  • Exposure to React / React Native is a plus.


Good to Have (Differentiators) :

  • Experience in FinTech / Crypto / Web3 / Blockchain.
  • Built systems for trading, payments, or real-time financial data.
  • Experience with AI agents, automation pipelines, or agent-based systems.
  • Exposure to parallel AI workflows (coding / testing / refactoring).
  • Contributions to open source or technical blogs.
  • Experience handling production-scale systems.
Read more
Fonada
Noida
10 - 20 yrs
₹20L - ₹35L / yr
Asterisk
IVR
skill iconSpring Boot
RabbitMQ
Microservices
+6 more

Role Overview

We are looking for a hands-on Senior Telephony Engineer who actively writes production-grade code and has deep experience with Asterisk-based systems, Java backend development, and high-scale dialler platforms.


Key Responsibilities

This is NOT an architecture-only role we need someone who can:

  • Write code
  • Debug real-time call issues
  • Build and optimize telephony flows end-to-end
  • Key Responsibilities (Hands-on Coding Focus)
  • Develop and maintain Asterisk dialplans, AGI scripts, and call flows
  • Build Java-based backend services for telephony control and orchestration
  • Implement and optimize predictive / preview / progressive diallers
  • Integrate telephony stack with:

Kafka

RabbitMQ

  • Write scalable code for call routing, retry logic, and queue handling
  • Work directly on SIP signalling, RTP flows, and debugging call issues
  • Handle real-time call events, CDR processing, and logging pipelines
  • Optimize systems for high concurrency (thousands of parallel calls)
  • Debug production issues like:

Call drops

Latency

One-way audio

SIP failures


Qualifications & Skills

  • Bachelors degree in Computer Engineering; Masters is a plus.
  • Telephony (Core Requirement)
  • Strong hands-on experience with Asterisk
  • Deep knowledge of:

SIP / RTP / VoIP

Dialplans

AGI / AMI

  • Experience building or maintaining dialers (very important)
  • Backend Development
  • Strong coding skills in Java (Spring Boot preferred)
  • Experience building microservices / APIs
  • Comfortable writing high-performance, low-latency code
  • Messaging & Event Systems
  • Hands-on experience with:

Apache Kafka

RabbitMQ

  • Ability to implement event-driven systems
  • Scaling & Performance
  • Experience handling high call volumes (1000+ concurrent calls)


Understanding of:

  • Multi-threading
  • Queue management
  • Load handling
  • Good to Have
  • Experience with predictive dialers
  • Exposure to WebRTC / real-time communication
  • Experience with Docker / Kubernetes
  • Understanding of TRAI / Indian telecom ecosystem
  • Experience with FreeSWITCH (bonus)


What We Are NOT Looking For

  • Pure solution architects who dont code
  • People with only theoretical telecom knowledge
  • Candidates without real dialer / Asterisk production experience


What We Are Looking For

Someone who has:

  • Written real dialplans and backend code
  • Debugged live call issues
  • Worked on production telephony systems
  • A problem solver who can go deep into logs, packets, and code


Impact of the Role

You will directly contribute to building a high-scale telephony + AI voice platform, working on real-time systems that handle thousands of concurrent calls.

Read more
NeoGenCode Technologies Pvt Ltd
Akshay Patil
Posted by Akshay Patil
Bengaluru (Bangalore)
3 - 8 yrs
₹15L - ₹18L / yr
DevOps
Google Cloud Platform (GCP)
skill iconKubernetes
helm
Terraform
+5 more

Job Title : DevOps Engineer

Experience : 3+ Years

Location : Indiranagar, Bengaluru (Work From Office – 5 Days)

Employment Type : Full-Time

Work Timings : 11:00 AM to 7:00 PM IST

Notice Period : Immediate Joiners Preferred


Role Overview :

We are seeking a skilled DevOps Engineer with 3+ years of experience in building and managing scalable cloud-native infrastructure.

The ideal candidate will have strong expertise in Kubernetes and Helm, along with hands-on experience in deploying and maintaining production-grade systems on cloud platforms.

This role offers an opportunity to work in a high-growth startup environment, contributing to both existing systems and new infrastructure development.


Key Responsibilities :

  • Design, deploy, and manage scalable infrastructure using Kubernetes.
  • Build and maintain CI/CD pipelines for efficient and automated deployments.
  • Manage and optimize cloud environments (preferably GCP).
  • Implement Infrastructure as Code using Helm/Terraform.
  • Monitor system performance and ensure high availability and reliability.
  • Handle bug fixes, system improvements, and performance optimization.
  • Collaborate with engineering teams to design scalable microservices architecture.
  • Implement logging, monitoring, and alerting solutions.
  • Ensure security best practices including IAM, secrets management, and network policies.


Mandatory Skills :

  • Strong hands-on experience with Kubernetes.
  • Expertise in Helm Charts.
  • Experience with Google Cloud Platform (GCP).
  • Hands-on experience with ArgoCD or similar CI/CD tools.
  • Knowledge of CI/CD tools like Jenkins, GitHub Actions, GitLab CI.
  • Experience in database hosting and scaling.


Nice to Have :

  • Exposure to other cloud platforms (AWS/Azure).
  • Experience with modern DevOps and automation tools.
  • Ability to quickly learn and adapt to new technologies.


Team & Work Scope :

  • No dedicated DevOps team currently – high ownership role.
  • Work on both existing systems (maintenance & improvements) and new system builds (greenfield projects).
  • Opportunity to shape DevOps practices and infrastructure from scratch.


Preferred Candidate Profile :

  • 3+ years of relevant DevOps experience.
  • Strong problem-solving and debugging skills.
  • Experience working in fast-paced startup environments.
  • Understanding of scalability, security, and performance optimization.
  • Good communication and collaboration skills.

Hiring Process :

  1. Profile Screening
  2. GT Assessment
  3. Technical Interview – Round 1
  4. Technical Interview – Round 2
  5. Final Round (if required with US team)
Read more
Mumbai, Pune, Bengaluru (Bangalore), Hyderabad, Delhi, Gurugram, Noida, Chennai, Ahmedabad
4 - 8 yrs
₹8L - ₹12L / yr
ASP.NET
.NET Core
MVC
skill iconC#
SQL server
+22 more

Key Responsibilities:

• Work on highly distributed and scalable system architecture

• Design, develop, test, and maintain high-quality software solutions

• Ensure performance, security, and maintainability of applications

• Collaborate with cross-functional teams and stakeholders

• Perform system testing and resolve technical issues

Required Skills:

• Strong experience in ASP.NET, C#, .NET Core, MVC

• Hands-on experience with SQL Server / PostgreSQL

• Experience in Angular / React (Frontend technologies)

• Knowledge of microservices architecture & RESTful APIs

• Familiarity with CQRS pattern

• Exposure to AWS / Docker / Kubernetes

• Experience with CI/CD pipelines (Azure DevOps, Jenkins)

• Knowledge of Node.js is an added advantage

• Understanding of Agile methodology

• Good exposure to cybersecurity and compliance

Technology Stack:

• Microsoft .NET technologies (primary)

• Cloud platforms: AWS (SaaS/PaaS/IaaS)

• Databases: MSSQL, MongoDB, PostgreSQL

• Caching: Redis, Memcached

• Messaging queues: RabbitMQ, Kafka, SQS


Read more
AI-Powered Platform

AI-Powered Platform

Agency job
via Peak Hire Solutions by Dharati Thakkar
Remote only
5 - 10 yrs
₹35L - ₹45L / yr
skill iconMachine Learning (ML)
skill iconPython
Artificial Intelligence (AI)
Natural Language Processing (NLP)
Scikit-Learn
+10 more

Budget: 35 LPA to 45 LPA

Work schedule is Mon to Fri, 3:30am to 12:30pm IST


Key Responsibilities:

  • Design, develop, and deploy computer vision and machine learning models for analyzing visual and document-based data.
  • Build pipelines that convert unstructured visual inputs into structured and usable information.
  • Develop and evaluate models for tasks such as object detection, segmentation, document parsing, and image understanding.
  • Apply OCR and related techniques to extract meaningful information from complex documents and imagery.
  • Work with large datasets and build efficient training and evaluation pipelines.
  • Handle real-world visual datasets that may contain noise, inconsistencies, incomplete information, or varying formats.
  • Experiment with different approaches to solve challenging computer vision problems and evaluate tradeoffs between accuracy, performance, and complexity.
  • Collaborate with product and engineering teams to integrate machine learning models into scalable production systems.
  • Continuously improve model performance, accuracy, and robustness in real-world environments.
  • Stay up to date with the latest developments in AI and computer vision and apply relevant techniques where appropriate.
  • Actively leverage modern AI tools and frameworks to accelerate experimentation, development, and engineering workflows.


Requirements:

  • 5+ years of hands-on experience building and deploying machine learning models, particularly in Computer Vision or document understanding.
  • Strong proficiency in Python for machine learning and data processing.
  • Hands-on experience with modern ML frameworks such as PyTorch and libraries in the Hugging Face ecosystem.
  • Experience with computer vision tooling such as OpenCV.
  • Experience with common ML and data science libraries such as scikit-learn, NumPy, and Pandas.
  • Experience developing models for tasks such as segmentation, object detection, or document analysis.
  • Experience working with large image datasets and building training pipelines.
  • Solid understanding of model evaluation, data preprocessing, and performance optimization.
  • Strong problem-solving skills and ability to work in a fast-paced product environment.
  • Ability to collaborate effectively with cross-functional engineering and product teams.
  • The candidate should be based in India
  • Willing to work remotely full-time
  • Work schedule is Mon to Fri, 3:30am to 12:30pm IST


Preferred Qualifications:

  • Experience with TensorFlow or other deep learning frameworks.
  • Experience working with OCR pipelines or document analysis systems.
  • Experience deploying machine learning models in production environments.
  • Experience with containerized deployments such as Docker or Kubernetes.
  • Experience working with complex technical documents, diagrams, or structured visual data.
  • Familiarity with spatial or geometry-related data problems.
  • Experience with libraries such as Detectron2, MMDetection, or similar.
  • Familiarity with frameworks used to integrate modern AI models into applications (e.g., LangChain or similar tooling).
  • Contributions to open-source ML or computer vision projects are a plus.


Additional Information:

  • The problems we work on involve complex visual and document-based data, so we value engineers who enjoy tackling challenging technical problems and experimenting with different approaches to reach practical solutions.
  • Candidates are required to include links to relevant projects, GitHub repositories, research work, or examples of machine learning systems they have built.


Benefits:

  • Flexible remote work opportunities with career development opportunities
  • Engagement with a supportive and collaborative global team
  • Competitive market based salary
Read more
Bengaluru (Bangalore)
4 - 10 yrs
₹1L - ₹10L / yr
skill icon.NET
SSO
ASP.NET
ASP.NET MVC
MySQL
+16 more

Dear Candidates,


We have an urgent requirement for a Technical Lead – Full Stack role based in Bangalore. Please find the details below:


Work Location (WFO):

Nagar, Bengaluru, Karnataka


Interview Process:

L1 Interview – Face-to-Face at Office


Experience Required:

4-6 Years (Minimum1+ years in Technical Leadership role)


Role Overview:

The candidate will lead the technical vision and architecture of a compliance platform by designing scalable, secure, and high-performance systems. The role involves driving full-stack development across .NET and open-source technologies, enabling unified AI Agent capabilities, Single Authentication (SSO), and a One-UI experience.

Key Responsibilities:

  • Define and own end-to-end architecture including micro-frontends, .NET services, FastAPI APIs, and microservices
  • Lead full-stack development using .NET and modern open-source technologies
  • Modernize legacy systems (ASP.NET, .NET Core, MS SQL Server) to cloud-native architecture
  • Design and implement AI Agents, SSO, and unified UI experiences
  • Manage sprint planning, backlogs, and collaborate with Product Owners
  • Implement CI/CD pipelines using Jenkins, GitHub Actions
  • Drive containerization and orchestration using Docker & Kubernetes
  • Ensure secure deployments and cloud infrastructure management
  • Establish engineering best practices, code reviews, and architecture governance
  • Mentor teams on Clean Architecture, SOLID principles, and DevOps practices

Required Skills:

  • ReactJS, FastAPI, Python, REST/GraphQL
  • ASP.NET, MVC, .NET Core, Entity Framework, MS SQL Server
  • Strong experience in Microservices Architecture
  • DevOps: CI/CD, Jenkins, GitOps, Docker, Kubernetes
  • Cloud Platforms: AWS / Azure / GCP
  • AI/ML & LLM tools: OpenAI, Llama, LangChain, etc.
  • Security: RBAC, API security, secrets management

Qualifications:

  • BE / BTech in Computer Science
Read more
Searce Inc

at Searce Inc

3 recruiters
Reena Bandekar
Posted by Reena Bandekar
Pune, Gurugram, Bengaluru (Bangalore), Hyderabad
5 - 12 yrs
₹15L - ₹28L / yr
DevOps
skill iconKubernetes
Incident management
Observability
Reliability engineering
+4 more

Lead Cloud Reliability Engineer


Job Responsibilities

● Lead and manage the Cloud Reliability teams to provide strong Managed Services support to end-customers.

● Isolate, troubleshoot and resolve issues reported by CMS clients in their cloud environment

● Drive the communication with the customer providing details about the issue, current steps, next plan of action, ETA

● Gather client's requirements related to use of specic cloud services and provide assistance in seing them up and resolving issues

● Create SOPs and knowledge articles for use by the L1 teams to resolve common issues

● Identify recurring issues, perform root cause analysis and propose/implement preventive actions

● Follow change management procedure to identify, record and implement changes

● Plan and deploy OS, security patches in Windows/Linux environment and upgrade k8s clusters

● Identify the recurring manual activities and contribute to automation

● Provide technical guidance and educate team members on development and operations. Monitor metrics and develop ways to improve.

● System troubleshooting and problem-solving across plaorm and application domains. Ability to use a wide variety of open-source technologies and cloud services.

● Build, maintain, and monitor conguration standards.

● Ensuring critical system security through using best-in-class cloud security solutions.


Qualifications

● 4-7 years experience in Cloud Infrastructure and Operations domains and IT operational experience preferably in a global enterprise environment.

● Specialize in one or two cloud deployment platforms: AWS, GCP

● Hands on experience with AWS/GCP services (EKS, ECS, EC2, VPC, RDS, Lambda, GKE, Compute Engine)

● Understanding of one or more programming languages (Python, JavaScript, Ruby, Java, .Net)

● Logging and Monitoring tools (ELK, Stackdriver, CloudWatch)

● Knowledge on Conguration Management tools such as Ansible, Terraform, Puppet, Chef

● Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Mesos)

● Good analytical, communication, problem solving, and learning skills.

● Knowledge on programming against cloud plaorms such as Google Cloud Platform and lean development methodologies.

● Strong service aitude and a commitment to quality.

● Willingness to work in shifts.

Read more
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort