4+ Reliability engineering Jobs in Mumbai | Reliability engineering Job openings in Mumbai
Apply to 4+ Reliability engineering Jobs in Mumbai on CutShort.io. Explore the latest Reliability engineering Job opportunities across top companies like Google, Amazon & Adobe.

We are hiring a Site Reliability Engineer (SRE) to join our high-performance engineering team. In this role, you'll be responsible for driving reliability, performance, scalability, and security across cloud-native systems while bridging the gap between development and operations.
Key Responsibilities
- Design and implement scalable, resilient infrastructure on AWS
- Take ownership of the SRE function – availability, latency, performance, monitoring, incident response, and capacity planning
- Partner with product and engineering teams to improve system reliability, observability, and release velocity
- Set up, maintain, and enhance CI/CD pipelines using Jenkins, GitHub Actions, or AWS CodePipeline
- Conduct load and stress testing, identify performance bottlenecks, and implement optimization strategies
Required Skills & Qualifications
- Proven hands-on experience in cloud infrastructure design (AWS strongly preferred)
- Strong background in DevOps and SRE principles
- Proficiency with performance testing tools like JMeter, Gatling, k6, or Locust
- Deep understanding of cloud security and best practices for reliability engineering
- AWS Solution Architect Certification – Associate or Professional (preferred)
- Solid problem-solving skills and a proactive approach to systems improvement
Why Join Us?
- Work with cutting-edge technologies in a cloud-native, fast-paced environment
- Collaborate with cross-functional teams driving meaningful impact
- Hybrid work culture with flexibility and autonomy
- Open, inclusive work environment focused on innovation and excellence

• Run the production environment by monitoring availability and taking a holistic view of
system health
• Build software and systems to manage platform infrastructure and applications
• Improve reliability, quality, and time-to-market of our suite of software solutions
• Measure and optimize system performance, with an eye toward pushing our capabilities
forward, getting ahead of customer needs, and innovating to continually improve
• Provide primary operational support and engineering for multiple large distributed
software applications
• Drive cross-team alignment across development teams around reliability initiatives
The ideal candidate must -
• Bachelor’s degree in computer science or other highly technical, scientific discipline
• Ability to program (structured and OO) with one or more high level languages, such as
Python, Java, C/C++, Ruby, and JavaScript
• Good experience with microservices architecture and serverless technologies
• Exposure to event driven architecture and state machines
• A proactive approach to spotting problems, areas for improvement, and performance
bottlenecks

My client is a leader in the Ed Tech space
Job Title: Sr. Reliability Engineer/ Engineering Manager
Location: Powai (Mumbai)/ Bangaluru/ Delhi NCR
- System maintenance and administration of Applications and Software
- Integration with different systems and apps
- Must have knowledge on Microservices
- Should be proficient in scripting language Python, Good to have knowledge on Java
- Excellent over Root Cause Analysis
- Should be a technical as well as process-oriented candidate
- Excellent Debugging and Monitoring skills
- Responsible for daily trouble ticket resolution, client interaction and customer support via email and video calls
- Responsible for the setup for the Continuous Integration build and deployment for DEV and UAT environments
- Responsible for operational support and problem resolution for application users and internal operations team
- Identifying, tracking, managing and resolving project issues effectively and efficiently
- Supporting projects during the normalization phase after Go-Live and ensuring the smooth transitions to operations
- Review and implement the processes related to IT Support and Operations before handing over to Support Services
- Strong ability to work independently on complex issues
- Collaborate efficiently with internal experts to resolve customer issues quickly
- Both Proactive & reactive in work approach
- Should be willing to work in 24/7 work environment
Ideal candidate has:
- 6-10 years
- Product Background
Requirements
Technical Skills
- Ability to solution & deliver all of Operations/SRE services & processes including managing L2 Environment Support
- 5-12 years of overall environment support experience with 5+ years of experience as support / SRE engineer
- Experience in implementing Monitoring solutions using APM tools( Example: AppDynamics, Graylog, Dynatrace, Datadog etc.) set up and test proactive monitoring alerts
- Have a broad knowledge profile and really excel in some areas, such as HTTP/TLS, DNS, networking or containerization
- Comfortable with large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.
Process Skills
- Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
- Interest in designing, analyzing and troubleshooting large-scale distributed systems.
Behavioral Skills
- Practice sustainable incident response and blameless postmortems.
- Proven ability in developing relationships with stakeholders, communicating project/program status, and understanding detailed business requirements across multiple project initiatives
- This role requires candidates to work in rotational shifts. 24*7 support
Benefits
LOCATION: Mumbai
COMPENSATION: Competitive
WHY ZYCUS? :
- Be a part of one of the fastest growing product Company in India
- Come join a young, dynamic & enterprising team
- Work on the latest technologies
- Flexible working hours (As per business requirement).
Zycus Global Leader Procurement: https://www.zycus.com/newsroom/press-releases.html" target="_blank">https://www.zycus.com/newsroom/press-releases.html