2+ Incident management Jobs in Hyderabad | Incident management Job openings in Hyderabad
Apply to 2+ Incident management Jobs in Hyderabad on CutShort.io. Explore the latest Incident management Job opportunities across top companies like Google, Amazon & Adobe.
🚀 RECRUITING BOND HIRING
Role: CLOUD OPERATIONS & MONITORING ENGINEER - (THE GUARDIAN OF UPTIME)
⚡ THIS IS NOT A MONITORING ROLE
THIS IS A COMMAND ROLE
You don’t watch dashboards.
You control outcomes.
You don’t react to incidents.
You eliminate them before they escalate.
This role powers an AI-driven SaaS + IoT platform where:
---> Uptime is non-negotiable
---> Latency is hunted
---> Failures are never allowed to repeat
Incidents don’t grow.
Problems don’t hide.
Uptime is enforced.
🧠 WHAT YOU’LL OWN
(Real Work. Real Impact.)
🔍 Total Observability
---> Real-time visibility across cloud, application, database & infrastructure
---> High-signal dashboards (Grafana + cloud-native tools)
---> Performance trends tracked before growth breaks systems
🚨 Smart Alerting (No Noise)
---> Alerts that fire only when action is required
---> Zero false positives. Zero alert fatigue
Right signal → right person → right time
⚙ Automation as a Weapon
---> End-to-end automation of operational tasks
---> Standardized logging, metrics & alerting
---> Systems that scale without human friction
🧯 Incident Command & Reliability
---> First responder for critical incidents (on-call rotation)
---> Root cause analysis across network, app, DB & storage
Fix fast — then harden so it never breaks the same way again
📘 Operational Excellence
---> Battle-tested runbooks
---> Documentation that actually works under pressure
Every incident → a stronger platform
🛠️ TECHNOLOGIES YOU’LL MASTER
☁ Cloud: AWS | Azure | Google Cloud
📊 Monitoring: Grafana | Metrics | Traces | Logs
📡 Alerting: Production-grade alerting systems
🌐 Networking: DNS | Routing | Load Balancers | Security
🗄 Databases: Production systems under real pressure
⚙ DevOps: Automation | Reliability Engineering
🎯 WHO WE’RE LOOKING FOR
Engineers who take uptime personally.
You bring:
---> 3+ years in Cloud Ops / DevOps / SRE
---> Live production SaaS experience
---> Deep AWS / Azure / GCP expertise
---> Strong monitoring & alerting experience
---> Solid networking fundamentals
---> Calm, methodical incident response
---> Bonus (Highly Preferred):
---> B2B SaaS + IoT / hybrid platforms
---> Strong automation mindset
---> Engineers who think in systems, not tickets
💼 JOB DETAILS
📍 Bengaluru
🏢 Hybrid (WFH)
💰 (Final CTC depends on experience & interviews)
🌟 WHY THIS ROLE?
Most cloud teams manage uptime. We weaponize it.
Your work won’t just keep systems running — it will keep customers confident, operations flawless, and competitors wondering how it all works so smoothly.
📩 APPLY / REFER : 🔗 Know someone who lives for reliability, observability & cloud excellence?
JOB SCOPE
o Lead the incident management process and team involved in resolving the
incident.
o Responding to Sev1 incident, identifying the cause, and initiating the
incident management process.
o Working with delivery teams to prioritizing incidents according to their
urgency and influence on the business.
o Creating knowledgebase that outline incident protocols such as how to
handle cybersecurity threats or how to correct server failures.
o Collaborating with the various teams to ensure that all protocols are
diligently followed.
o Reporting on incident, problem, change, service request issues and
escalating to ensure they are closed ON TIME while ensuring recurring ones
are addressed.
o Adjusting the incident management process as required to ensure its
effectiveness.
o Creating the RCA with help of the delivery teams and ensure that it’s
presented within said time and also ensuring continuous improvement in
SLA, TAT, count etc
o Communicating with upper management if major issues are found in the IT
system.
o Will be the owner of the Unified Helpdesk application and should have the
capability to enhance the process, tool further as the need arises.
• REQUIREMENTS
o Bachelor's degree in information technology, engineering, or a related field.
o At least 5+ years of experience working in IT service management, or a
similar role.
o Strong knowledge of IT service management software including ITIL and
COBIT.
o Experience working with IT systems and software such as Manage Now,
Fresh Service, Tivoli, SolarWinds, Nagios XI, etc
o Solid scripting knowledge in languages, such as Shell, SQL, Java, C++ etc.
o Excellent managerial skills and ability to collaborate with team members.
o Ability to analyse a high volume of technical data and work in a fast-paced
environment.
o Strong problem solving, analytical, and time management skills.

