5+ Incident management Jobs in Hyderabad | Incident management Job openings in Hyderabad
Apply to 5+ Incident management Jobs in Hyderabad on CutShort.io. Explore the latest Incident management Job opportunities across top companies like Google, Amazon & Adobe.
🔑 Core Responsibilities
- Troubleshoot Windows OS & Microsoft 365 (Outlook, Teams, OneDrive)
- Manage user accounts & access (Active Directory)
- Handle L1 support issues (hardware, software, network basics)
- Track incidents in ServiceNow/Jira and meet SLAs
- Support user onboarding/offboarding
🧠 Key Skillsets (Must-Have)
- 2–3 years IT Support / Helpdesk experience
- Strong in Windows 10/11 & Office 365
- Basic AWS exposure (mandatory)
- Knowledge of Active Directory & networking (DNS, VPN, Wi-Fi)
- Experience with ticketing tools (ServiceNow/Jira)
- Basic Linux/Unix knowledge
- Good communication & problem-solving skills
We are hiring an L1 IT Support Engineer with 2–3 years of experience in desktop/helpdesk support to provide first-level technical assistance across end-user systems, cloud, and enterprise IT environments.
Key Responsibilities
- Troubleshoot Windows OS and Office 365 issues (Outlook, Teams, OneDrive)
- Manage Active Directory tasks: password resets, access/user management
- Install/configure laptops, desktops, printers, and software
- Perform basic network troubleshooting (Wi-Fi, VPN, DNS, DHCP)
- Support AWS CloudWatch alerts and basic Linux troubleshooting
- Handle patching, RCA, documentation, and SOP updates
- Manage tickets in ServiceNow/Jira and meet SLA timelines
- Support onboarding/offboarding and escalate complex issues to L2
Required Skills
- 2–3 years in IT Support / Helpdesk / Desktop Support
- Strong in Windows 10/11, Office 365, Active Directory
- Basic exposure to AWS / CloudWatch and Linux/Unix
- Familiarity with ServiceNow/Jira, ITIL/SLA processes
- Knowledge of SIP/VoIP basics is a plus
- Strong communication and troubleshooting skills
Lead Cloud Reliability Engineer
Job Responsibilities
● Lead and manage the Cloud Reliability teams to provide strong Managed Services support to end-customers.
● Isolate, troubleshoot and resolve issues reported by CMS clients in their cloud environment
● Drive the communication with the customer providing details about the issue, current steps, next plan of action, ETA
● Gather client's requirements related to use of specic cloud services and provide assistance in seing them up and resolving issues
● Create SOPs and knowledge articles for use by the L1 teams to resolve common issues
● Identify recurring issues, perform root cause analysis and propose/implement preventive actions
● Follow change management procedure to identify, record and implement changes
● Plan and deploy OS, security patches in Windows/Linux environment and upgrade k8s clusters
● Identify the recurring manual activities and contribute to automation
● Provide technical guidance and educate team members on development and operations. Monitor metrics and develop ways to improve.
● System troubleshooting and problem-solving across plaorm and application domains. Ability to use a wide variety of open-source technologies and cloud services.
● Build, maintain, and monitor conguration standards.
● Ensuring critical system security through using best-in-class cloud security solutions.
Qualifications
● 4-7 years experience in Cloud Infrastructure and Operations domains and IT operational experience preferably in a global enterprise environment.
● Specialize in one or two cloud deployment platforms: AWS, GCP
● Hands on experience with AWS/GCP services (EKS, ECS, EC2, VPC, RDS, Lambda, GKE, Compute Engine)
● Understanding of one or more programming languages (Python, JavaScript, Ruby, Java, .Net)
● Logging and Monitoring tools (ELK, Stackdriver, CloudWatch)
● Knowledge on Conguration Management tools such as Ansible, Terraform, Puppet, Chef
● Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Mesos)
● Good analytical, communication, problem solving, and learning skills.
● Knowledge on programming against cloud plaorms such as Google Cloud Platform and lean development methodologies.
● Strong service aitude and a commitment to quality.
● Willingness to work in shifts.
REVIEW CRITERIA:
MANDATORY:
- Strong Hands-On AWS Cloud Engineering / DevOps Profile
- Mandatory (Experience 1): Must have 12+ years of experience in AWS Cloud Engineering / Cloud Operations / Application Support
- Mandatory (Experience 2): Must have strong hands-on experience supporting AWS production environments (EC2, VPC, IAM, S3, ALB, CloudWatch)
- Mandatory (Infrastructure as a code): Must have hands-on Infrastructure as Code experience using Terraform in production environments
- Mandatory (AWS Networking): Strong understanding of AWS networking and connectivity (VPC design, routing, NAT, load balancers, hybrid connectivity basics)
- Mandatory (Cost Optimization): Exposure to cost optimization and usage tracking in AWS environments
- Mandatory (Core Skills): Experience handling monitoring, alerts, incident management, and root cause analysis
- Mandatory (Soft Skills): Strong communication skills and stakeholder coordination skills
ROLE & RESPONSIBILITIES:
We are looking for a hands-on AWS Cloud Engineer to support day-to-day cloud operations, automation, and reliability of AWS environments. This role works closely with the Cloud Operations Lead, DevOps, Security, and Application teams to ensure stable, secure, and cost-effective cloud platforms.
KEY RESPONSIBILITIES:
- Operate and support AWS production environments across multiple accounts
- Manage infrastructure using Terraform and support CI/CD pipelines
- Support Amazon EKS clusters, upgrades, scaling, and troubleshooting
- Build and manage Docker images and push to Amazon ECR
- Monitor systems using CloudWatch and third-party tools; respond to incidents
- Support AWS networking (VPCs, NAT, Transit Gateway, VPN/DX)
- Assist with cost optimization, tagging, and governance standards
- Automate operational tasks using Python, Lambda, and Systems Manager
IDEAL CANDIDATE:
- Strong hands-on AWS experience (EC2, VPC, IAM, S3, ALB, CloudWatch)
- Experience with Terraform and Git-based workflows
- Hands-on experience with Kubernetes / EKS
- Experience with CI/CD tools (GitHub Actions, Jenkins, etc.)
- Scripting experience in Python or Bash
- Understanding of monitoring, incident management, and cloud security basics
NICE TO HAVE:
- AWS Associate-level certifications
- Experience with Karpenter, Prometheus, New Relic
- Exposure to FinOps and cost optimization practices
JOB SCOPE
o Lead the incident management process and team involved in resolving the
incident.
o Responding to Sev1 incident, identifying the cause, and initiating the
incident management process.
o Working with delivery teams to prioritizing incidents according to their
urgency and influence on the business.
o Creating knowledgebase that outline incident protocols such as how to
handle cybersecurity threats or how to correct server failures.
o Collaborating with the various teams to ensure that all protocols are
diligently followed.
o Reporting on incident, problem, change, service request issues and
escalating to ensure they are closed ON TIME while ensuring recurring ones
are addressed.
o Adjusting the incident management process as required to ensure its
effectiveness.
o Creating the RCA with help of the delivery teams and ensure that it’s
presented within said time and also ensuring continuous improvement in
SLA, TAT, count etc
o Communicating with upper management if major issues are found in the IT
system.
o Will be the owner of the Unified Helpdesk application and should have the
capability to enhance the process, tool further as the need arises.
• REQUIREMENTS
o Bachelor's degree in information technology, engineering, or a related field.
o At least 5+ years of experience working in IT service management, or a
similar role.
o Strong knowledge of IT service management software including ITIL and
COBIT.
o Experience working with IT systems and software such as Manage Now,
Fresh Service, Tivoli, SolarWinds, Nagios XI, etc
o Solid scripting knowledge in languages, such as Shell, SQL, Java, C++ etc.
o Excellent managerial skills and ability to collaborate with team members.
o Ability to analyse a high volume of technical data and work in a fast-paced
environment.
o Strong problem solving, analytical, and time management skills.



