Cutshort logo
Infra360 Solutions Pvt Ltd logo
Senior DevOps Engineer (SRE)
Senior DevOps Engineer (SRE)
Infra360 Solutions Pvt Ltd's logo

Senior DevOps Engineer (SRE)

HR Infra360's profile picture
Posted by HR Infra360
3 - 8 yrs
₹10L - ₹15L / yr
Gurugram
Skills
skill iconDocker
skill iconKubernetes
DevOps
skill iconAmazon Web Services (AWS)
Windows Azure
Google Cloud Platform (GCP)
helm

Please Apply - https://zrec.in/7EYKe?source=CareerSite


About Us

Infra360 Solutions is a services company specializing in Cloud, DevSecOps, Security, and Observability solutions. We help technology companies adapt DevOps culture in their organization by focusing on long-term DevOps roadmap. We focus on identifying technical and cultural issues in the journey of successfully implementing the DevOps practices in the organization and work with respective teams to fix issues to increase overall productivity. We also do training sessions for the developers and make them realize the importance of DevOps. We provide these services - DevOps, DevSecOps, FinOps, Cost Optimizations, CI/CD, Observability, Cloud Security, Containerization, Cloud Migration, Site Reliability, Performance Optimizations, SIEM and SecOps, Serverless automation, Well-Architected Review, MLOps, Governance, Risk & Compliance. We do assessments of technology architecture, security, governance, compliance, and DevOps maturity model for any technology company and help them optimize their cloud cost, streamline their technology architecture, and set up processes to improve the availability and reliability of their website and applications. We set up tools for monitoring, logging, and observability. We focus on bringing the DevOps culture to the organization to improve its efficiency and delivery.


Job Description

Job Title:             Senior DevOps Engineer / SRE

Department:       Technology

Location:             Gurgaon

Work Mode:         On-site

Working Hours:   10 AM - 7 PM 

Terms:                 Permanent

Experience:      4-6 years

Education:           B.Tech/MCA

Notice Period:     Immediately

About Us

At Infra360.io, we are a next-generation cloud consulting and services company committed to delivering comprehensive, 360-degree solutions for cloud, infrastructure, DevOps, and security. We partner with clients to transform and optimize their technology landscape, ensuring resilience, scalability, cost efficiency and innovation.

Our core services include Cloud Strategy, Site Reliability Engineering (SRE), DevOps, Cloud Security Posture Management (CSPM), and related Managed Services. We specialize in driving operational excellence across multi-cloud environments, helping businesses achieve their goals with agility and reliability.

We thrive on ownership, collaboration, problem-solving, and excellence, fostering an environment where innovation and continuous learning are at the forefront. Join us as we expand and redefine what’s possible in cloud technology and infrastructure.


Role Summary

We are seeking a Senior DevOps Engineer (SRE) to manage and optimize large-scale, mission-critical production systems. The ideal candidate will have a strong problem-solving mindset, extensive experience in troubleshooting, and expertise in scaling, automating, and enhancing system reliability. This role requires hands-on proficiency in tools like Kubernetes, Terraform, CI/CD, and cloud platforms (AWS, GCP, Azure), along with scripting skills in Python or Go. The candidate will drive observability and monitoring initiatives using tools like Prometheus, Grafana, and APM solutions (Datadog, New Relic, OpenTelemetry).

Strong communication, incident management skills, and a collaborative approach are essential. Experience in team leadership and multi-client engagement is a plus.


Ideal Candidate Profile


  • Solid 4-6 years of experience as an SRE and DevOps with a proven track record of handling large-scale production environments
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • Strong Hands-on experience with managing Large Scale Production Systems
  • Strong Production Troubleshooting Skills and handling high-pressure situations.
  • Strong Experience with Databases (PostgreSQL, MongoDB, ElasticSearch, Kafka)
  • Worked on making production systems more Scalable, Highly Available and Fault-tolerant
  • Hands-on experience with ELK or other logging and observability tools
  • Hands-on experience with Prometheus, Grafana & Alertmanager and on-call processes like Pagerduty
  • Problem-Solving Mindset
  • Strong with skills - K8s, Terraform, Helm, ArgoCD, AWS/GCP/Azure etc
  • Good with Python/Go Scripting Automation
  • Strong with fundamentals like DNS, Networking, Linux
  • Experience with APM tools like - Newrelic, Datadog, OpenTelemetry
  • Good experience with Incident Response, Incident Management, Writing detailed RCAs
  • Experience with Applications best practices in making apps more reliable and fault-tolerant
  • Strong leadership skills and the ability to mentor team members and provide guidance on best practices.
  • Able to manage multiple clients and take ownership of client issues.
  • Experience with Git and coding best practices


Good to have

  • Team-leading Experience
  • Multiple Client Handling
  • Requirements gathering from clients
  • Good Communication


Key Responsibilities


  1. Design and Development:
  2. Architect, design, and develop high-quality, scalable, and secure cloud-based software solutions.
  3. Collaborate with product and engineering teams to translate business requirements into technical specifications.
  4. Write clean, maintainable, and efficient code, following best practices and coding standards.
  5. Cloud Infrastructure:
  6. Develop and optimise cloud-native applications, leveraging cloud services like AWS, Azure, or Google Cloud Platform (GCP).
  7. Implement and manage CI/CD pipelines for automated deployment and testing.
  8. Ensure the security, reliability, and performance of cloud infrastructure.
  9. Technical Leadership:
  10. Mentor and guide junior engineers, providing technical leadership and fostering a collaborative team environment.
  11. Participate in code reviews, ensuring adherence to best practices and high-quality code delivery.
  12. Lead technical discussions and contribute to architectural decisions.
  13. Problem Solving and Troubleshooting:
  14. Identify, diagnose, and resolve complex software and infrastructure issues.
  15. Perform root cause analysis for production incidents and implement preventative measures.
  16. Continuous Improvement:
  17. Stay up-to-date with the latest industry trends, tools, and technologies in cloud computing and software engineering.
  18. Contribute to the continuous improvement of development processes, tools, and methodologies.
  19. Drive innovation by experimenting with new technologies and solutions to enhance the platform.
  20. Collaboration:
  21. Work closely with DevOps, QA, and other teams to ensure smooth integration and delivery of software releases.
  22. Communicate effectively with stakeholders, including technical and non-technical team members.
  23. Client Interaction & Management: 
  24. Will serve as a direct point of contact for multiple clients.
  25. Able to handle the unique technical needs and challenges of two or more clients concurrently. 
  26. Involve both direct interaction with clients and internal team coordination.
  27. Production Systems Management: 
  28. Must have extensive experience in managing, monitoring, and debugging production environments. 
  29. Will work on troubleshooting complex issues and ensure that production systems are running smoothly with minimal downtime.
Read more
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos

About Infra360 Solutions Pvt Ltd

Founded :
2022
Type :
Services
Size :
0-20
Stage :
Bootstrapped

About

At Infra360.io, we are a next-generation cloud consulting and services company committed to delivering comprehensive, 360-degree solutions for Cloud, Infrastructure, DevOps, MLOps and Security. We partner with clients to modernize and optimize their cloud, ensuring resilience, scalability, cost efficiency and innovation.


We thrive on ownership, collaboration, problem-solving, and excellence, fostering an environment where innovation and continuous learning are at the forefront. Join us as we expand and redefine what’s possible in cloud technology and infrastructure.


Read more

Candid answers by the company

What does the company do?
What is the location preference of jobs?

Our core services include Cloud Strategy, Site Reliability Engineering (SRE), DevOps, Cloud Security Posture Management (CSPM), and related Managed Services. We specialize in driving operational excellence across multi-cloud environments, helping businesses achieve their goals with agility and reliability.

Company social profiles

bloglinkedin

Similar jobs

Virtana
at Virtana
3 candid answers
2 recruiters
Krutika Devadiga
Posted by Krutika Devadiga
Pune
5 - 9 yrs
Best in industry
Google Cloud Platform (GCP)
DevOps
Shell Scripting
skill iconPython
skill iconKubernetes
+11 more

Role Overview:

Virtana is looking for a Senior DevOps Engineer to join our R&D Infrastructure team. In this role, you won't just follow conventions — you'll help redefine them. You will own the architecture, build, and day-to-day operations of the GCP-based cloud platform that powers Virtana's SaaS products and the AI-driven observability experience our Global 2000 customers depend on. This is a hands-on senior individual contributor role with meaningful technical leadership scope, working alongside engineers and architects on a unified observability platform.


Work Location: Pune


Job Type: Hybrid


Role Responsibilities:

  • GCP Cloud Operations: Develop, deploy, operate, and support production cloud infrastructure primarily on GCP — leveraging GKE, BigTable, BigQuery, Dataflow, Cloud Storage, IAM, and core networking services.
  • Reliability & SLAs: Ensure production systems are running at all times with multiple levels of redundancy to meet committed SLAs; lead incident response, root cause analysis, and post-incident reviews.
  • Build & Release Automation: Design, implement, and continuously improve scalable CI/CD pipelines and test frameworks leveraged by QA and development teams across the company.
  • Infrastructure as Code: Manage large-scale, repeatable deployments using Terraform, Ansible, Puppet, or SaltStack; champion Git-based workflows and version control standards for distributed engineering teams.
  • Security & Availability: Maintain the ongoing maintenance, security, patching, and availability of services in line with tight operations, security, and procedural models.
  • Monitoring & Alerting: Plan and deliver high-value monitoring and alerting features to support operations, support, and customer-facing reliability — eating our own dog food with the Virtana Platform wherever possible.
  • Capacity & Cost: Forecast capacity, plan upgrades, patches, and migrations, and drive cloud cost efficiency across hybrid and multi-cloud environments.
  • Cross-Functional Partnership: Work with development, operations, and support personnel to identify, isolate, and diagnose issues; handle support escalations and drive permanent fixes.


Required Qualifications:

  • Bachelor's degree in Computer Science / Engineering or equivalent relevant experience.
  • 5–7 years of professional hands-on DevOps / SRE experience supporting production cloud environments.
  • Strong, demonstrable production experience on GCP — including GKE, BigTable, BigQuery, Dataflow, IAM, and core GCP networking services.
  • Deep, hands-on expertise with container orchestration (Kubernetes) and Docker in production.
  • Advanced proficiency with at least one infrastructure-as-code / configuration management tool: Terraform, Ansible, Puppet, or SaltStack.
  • Solid understanding of networking, firewalls, load balancers, DNS, and database operations.
  • Strong working knowledge of Git-based workflows and version control standards for distributed engineering teams.
  • Comfort operating hybrid environments that include both Linux and Windows ecosystems.
  • Excellent verbal and written communication skills, with the ability to explain highly technical topics to both technical and non-technical audiences.
  • Self-motivated, detail-oriented, and able to work both independently and within a globally distributed team.


Good to Have:

  • Strong scripting skills and a demonstrated ability to automate operational toil — Python preferred; Bash, Go, or Groovy a plus.
  • Hands-on experience designing and operating CI/CD pipelines with Jenkins (Spinnaker, GitHub Actions, or GitLab CI also welcome).
  • Exposure to AWS or other public clouds in addition to GCP.
  • Experience operating SaaS platforms built on microservices architectures.
Read more
Fonada
Karandeep Singh
Posted by Karandeep Singh
Noida
5 - 8 yrs
₹15L - ₹20L / yr
DevOps
skill iconAmazon Web Services (AWS)
Microsoft Windows Azure
Google Cloud Platform (GCP)
VMware vSphere
+8 more


About the Role 

We are looking for a Senior DevOps Engineer to lead the design, automation, and scaling of our hybrid cloud infrastructure spanning public cloud and private/on-premises environments. You will partner closely with software engineering, security, and product teams to build reliable, secure, and high-performance systems that support rapid product delivery. This is a hands-on role with significant influence over our infrastructure strategy, deployment workflows, and engineering culture. 


Key Responsibilities 

  • Architect, deploy, and maintain scalable, highly available infrastructure across both public cloud (AWS, Azure, GCP) and private cloud platforms (OpenStack, VMware vSphere/Tanzu, Nutanix, or similar). 
  • Operate and maintain on-premises infrastructure: hypervisors, compute, storage (Ceph, NetApp, SAN/NAS), networking (SDN, VLANs, BGP, MPLS), and hardware capacity planning, alongside their public cloud equivalents. 
  • Design and own CI/CD pipelines that deploy seamlessly across public and private environments. 
  • Implement and manage Infrastructure as Code (Terraform, Ansible, Pulumi) with strong version control and review practices, using providers for both public and private cloud platforms. 
  • Manage container orchestration (Kubernetes, ECS, OpenShift, Rancher) across managed cloud services and self-managed/bare-metal clusters, including upgrades, autoscaling, and workload reliability. 
  • Build observability into all systems through logging, metrics, tracing, and alerting (Prometheus, Grafana, Datadog, ELK, or similar) with unified visibility across hybrid environments. 
  • Champion security best practices: secrets management, IAM hardening, network segmentation, vulnerability scanning, and compliance (SOC 2, ISO 27001, HIPAA, or data-sovereignty requirements). 
  • Lead incident response, root-cause analysis, and post-mortems; drive long-term reliability improvements and SLO/SLA adherence. 
  • Optimize cost, capacity, and resource utilization across public cloud spend and on-premises hardware without compromising performance or availability. 
  • Partner with data center operations and network providers on hardware provisioning, firmware management, MPLS circuit management, and lifecycle planning. 
  • Mentor junior DevOps and software engineers; promote DevOps culture, automation-first thinking, and shared ownership of production. 
  • Evaluate and introduce new tools, platforms, and processes that improve developer productivity and system reliability. 

Required Qualifications 

  • 5+ years of experience in DevOps, SRE, or Platform Engineering roles, with at least 2 years at a senior level. 
  • Deep expertise with at least one major public cloud provider (AWS, Azure, or GCP) in production. 
  • Hands-on experience operating private cloud or virtualization platforms (OpenStack, VMware, Nutanix, or equivalent) in production. 
  • Strong experience with virtualization, storage systems, and enterprise networking in on-premises environments. 
  • Strong hands-on experience with Kubernetes in production, including both managed cloud and self-managed/bare-metal clusters. 
  • Proficiency in Infrastructure as Code (Terraform and Ansible strongly preferred). 
  • Solid scripting and programming skills in Python, Go, Bash, or similar. 
  • Experience designing and operating CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, CircleCI, or ArgoCD. 
  • Strong Linux systems administration and networking fundamentals (TCP/IP, DNS, load balancing, VPNs, firewalls, routing, MPLS). 
  • Experience with monitoring and observability stacks (Prometheus, Grafana, Datadog, New Relic, ELK, or OpenTelemetry). 
  • Proven track record of leading incident response and improving system reliability. 
  • Excellent communication skills and the ability to collaborate across engineering, security, infrastructure, and product teams. 

Preferred Qualifications 

  • Experience designing hybrid and multi-cloud architectures, including secure connectivity (Direct Connect, ExpressRoute, MPLS, VPN, SD-WAN) between public and private environments. 
  • Familiarity with service meshes (Istio, Linkerd), API gateways, and GitOps workflows (ArgoCD, Flux). 
  • Background in security-focused or regulated environments and exposure to compliance frameworks. 
  • Experience with database administration (PostgreSQL, MySQL, Redis, MongoDB) in cloud-managed and self-hosted setups. 
  • Contributions to open-source DevOps or cloud infrastructure tooling. 
  • Relevant certifications (AWS Solutions Architect / DevOps Engineer, Azure Administrator, CKA, CKAD, RHCE, VMware VCP, OpenStack Certified Administrator, HashiCorp Terraform Associate). 


Read more
Deqode
at Deqode
1 recruiter
Shraddha Katare
Posted by Shraddha Katare
Bengaluru (Bangalore), Pune, Chennai, Hyderabad, Gurugram
5 - 7 yrs
₹7L - ₹15L / yr
skill iconAmazon Web Services (AWS)
DevOps
Terraform

Job Title: AWS DevOps Engineer

Experience Level: 5+ Years

Location: Bangalore, Pune, Hyderabad, Chennai and Gurgaon

Summary:

We are looking for a hands-on Platform Engineer with strong execution skills to provision and manage cloud infrastructure. The ideal candidate will have experience with Linux, AWS services, Kubernetes, and Terraform, and should be capable of troubleshooting complex issues in cloud and container environments.

Key Responsibilities:

  • Provision AWS infrastructure using Terraform (IaC).
  • Manage and troubleshoot Kubernetes clusters (EKS/ECS).
  • Work with core AWS services: VPC, EC2, S3, RDS, Lambda, ALB, WAF, and CloudFront.
  • Support CI/CD pipelines using Jenkins and GitHub.
  • Collaborate with teams to resolve infrastructure and deployment issues.
  • Maintain documentation of infrastructure and operational procedures.

Required Skills:

  • 3+ years of hands-on experience in AWS infrastructure provisioning using Terraform.
  • Strong Linux administration and troubleshooting skills.
  • Experience managing Kubernetes clusters.
  • Basic experience with CI/CD tools like Jenkins and GitHub.
  • Good communication skills and a positive, team-oriented attitude.

Preferred:

  • AWS Certification (e.g., Solutions Architect, DevOps Engineer).
  • Exposure to Agile and DevOps practices.
  • Experience with monitoring and logging tools.


Read more
Cygen Host
at Cygen Host
2 candid answers
Cygen Host
Posted by Cygen Host
Bengaluru (Bangalore), Mumbai, Delhi
3 - 7 yrs
₹12L - ₹30L / yr
Microsoft Azure
skill iconAmazon Web Services (AWS)

As a DevOps Engineer, you’ll play a key role in managing our cloud infrastructure, automating deployments, and ensuring high availability across our global server network. You’ll work closely with our technical team to optimize performance and scalability.


Responsibilities

✅ Design, implement, and manage cloud infrastructure (primarily Azure)

✅ Automate deployments using CI/CD pipelines (GitHub Actions, Jenkins, or equivalent)

✅ Monitor and optimize server performance & uptime (100% uptime goal)

✅ Work with cPanel-based hosting environments and ensure seamless operation

✅ Implement security best practices & compliance measures

✅ Troubleshoot system issues, scale infrastructure, and enhance reliability


Requirements

🔹 3-7 years of DevOps experience in cloud environments (Azure preferred)

🔹 Hands-on expertise in CI/CD tools (GitHub Actions, Jenkins, etc.)

🔹 Proficiency in Terraform, Ansible, Docker, Kubernetes

🔹 Strong knowledge of Linux system administration & networking

🔹 Experience with monitoring tools (Prometheus, Grafana, ELK, etc.)

🔹 Security-first mindset & automation-driven approach


Why Join Us?

🚀 Work at a fast-growing startup backed by Microsoft

💡 Lead high-impact DevOps projects in a cloud-native environment

🌍 Hybrid work model with flexibility in Bangalore, Delhi, or Mumbai

💰 Competitive salary ₹12-30 LPA based on experience


How to Apply?

📩 Apply now & follow us for future updates:

🔗 X (Twitter): https://x.com/CygenHost

🔗 LinkedIn: https://www.linkedin.com/company/cygen-host/

🔗 Instagram: https://www.instagram.com/cygenhost

Would you like any modifications before posting this? Or should I move on to the next role? 🚀

Read more
Vume Interactive
at Vume Interactive
3 recruiters
Shweta Jaiswal
Posted by Shweta Jaiswal
Bengaluru (Bangalore), Hyderabad
5 - 7 yrs
₹3L - ₹20L / yr
skill iconDocker
skill iconKubernetes
DevOps
skill iconAmazon Web Services (AWS)
Windows Azure
+5 more

Key Responsibilities:

  • Work with the development team to plan, execute and monitor deployments
  • Capacity planning for product deployments
  • Adopt best practices for deployment and monitoring systems
  • Ensure the SLAs for performance, up time are met
  • Constantly monitor systems, suggest changes to improve performance and decrease costs.
  • Ensure the highest standards of security



Key Competencies (Functional):

 

  • Proficiency in coding in atleast one scripting language - bash, Python, etc
  • Has personally managed a fleet of servers (> 15)
  • Understand different environments production, deployment and staging
  • Worked in micro service / Service oriented architecture systems
  • Has worked with automated deployment systems – Ansible / Chef / Puppet.
  • Can write MySQL queries
Read more
makeO
at makeO
1 recruiter
Jagruti  Surve
Posted by Jagruti Surve
Remote only
3 - 6 yrs
₹10L - ₹15L / yr
skill iconDocker
skill iconKubernetes
DevOps
skill iconAmazon Web Services (AWS)
Windows Azure
+13 more
● Comfortable deploying applications on AWS, and have a strong working
knowledge of EC2, RDS and S3.
● Good command of Linux environment
● Experience with tools such as Docker, Kubernetes, Redis, NodeJS and Nginx
Server configurations and deployment, Kafka, Elasticsearch, Ansible, Terraform,
etc
● Bonus: AWS certification is a plus
● Bonus: Basic understanding of database queries for relational databases such as
MySQL.
● Bonus: Experience with CI servers such as Jenkins, Travis or similar types
● Bonus: Demonstrated programming capability in a high-level programming
language such as Python, Go, or similar
● Develop, maintain and administer tools which will automate operational activities
and improve engineering productivity
● Automate continuous delivery and on-demand capacity management solutions
● Developing configuration and infrastructure solutions for internal deployments
● Troubleshooting, diagnosing and fixing software issues
● Updating, tracking and resolving technical issues
● Suggesting architecture improvements, recommending process improvements
● Evaluate new technology options and vendor products. Ensuring critical system
security through the use of best in class security solutions
● Technical experience or in a similar role supporting large scale production
distributed systems
● Must understand overall system architecture , improve design and implement new
processes.
Read more
GoodWorker
at GoodWorker
6 recruiters
Sunder E
Posted by Sunder E
Bengaluru (Bangalore)
7 - 10 yrs
₹12L - ₹15L / yr
DevOps
skill iconKubernetes
skill iconDocker
skill iconAmazon Web Services (AWS)
Terraform
+1 more

Why you should join us

 

- You will join the mission to create positive impact on millions of peoples lives

- You get to work on the latest technologies in a culture which encourages experimentation - You get to work with super humans (Psst: Look up these super human1, super human2, super human3, super human4)

- You get to work in an accelerated learning environment

 

 

What you will do

 

- You will provide deep technical expertise to your team in building future ready systems.

- You will help develop a robust roadmap for ensuring operational excellence

- You will setup infrastructure on AWS that will be represented as code

- You will work on several automation projects that provide great developer experience

- You will setup secure, fault tolerant, reliable and performant systems

- You will establish clean and optimised coding standards for your team that are well documented

- You will set up systems in a way that are easy to maintain and provide a great developer experience

- You will actively mentor and participate in knowledge sharing forums

- You will work in an exciting startup environment where you can be ambitious and try new things :)

 

 

You should apply if

 

- You have a strong foundation in Computer Science concepts and programming fundamentals

- You have been working on cloud infrastructure setup, especially on AWS since 8+ years

- You have set up and maintained reliable systems that operate at high scale

- You have experience in hardening and securing cloud infrastructures

- You have a solid understanding of computer networking, network security and CDNs

- Extensive experience in AWS, Kubernetes and optionally Terraform

- Experience in building automation tools for code build and deployment (preferably in JS)

- You understand the hustle of a startup and are good with handling ambiguity

- You are curious, a quick learner and someone who loves to experiment

- You insist on highest standards of quality, maintainability and performance

- You work well in a team to enhance your impact

Read more
Olacabs.com
at Olacabs.com
6 recruiters
Roshni Pillai
Posted by Roshni Pillai
Bengaluru (Bangalore)
5 - 9 yrs
₹8L - ₹21L / yr
DevOps
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Linux/Unix
We are looking for a Site Reliability Engineer/Sr. Site Reliability Engineer to help us build and enhance platforms to achieve availability, scalability and operational effectiveness. The right individual will embrace the opportunity to tackle challenging problems and use their influence to drive continual improvement. You will also work on the cutting edge of technology, leveraging Kong, Repose, Docker, Mesos/Kubernetes, Jenkins, Chef, HaProxy, Nginx, GitLab, MySQL, Scylla, Aerospike, Service Mesh ( Istio/Linkerd), Prometheus etc.

Roles and Responsibilities
● Managing Availability, Performance, Capacity of infrastructure and applications.
● Building and implementing observability for applications health/performance/capacity.
● Optimizing On-call rotations and processes.
● Documenting “tribal” knowledge.
● Managing Infra-platforms like
- Mesos/Kubernetes
- CICD
- Observability(Prometheus/New Relic/ELK)
- Cloud Platforms ( AWS/ Azure )
- Databases
- Data Platforms Infrastructure
● Providing help in onboarding new services with the production readiness review process.
● Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
● Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
● Working with the Dev team to have an in-depth understanding of the application architecture and its bottlenecks.
● Identifying observability gaps in product services, infrastructure and working with stake owners to fix it.
● Managing Outages and doing detailed RCA with developers and identifying ways to avoid that situation.
● Managing/Automating upgrades of the infrastructure services.
● Automate toil work.

Experience & Skills
● 3+ Years of experience as an SRE/DevOps/Infrastructure Engineer on large scale microservices and infrastructure.
● A collaborative spirit with the ability to work across disciplines to influence, learn, and deliver.
● A deep understanding of computer science, software development, and networking principles.
● Demonstrated experience with languages, such as Python, Java, Golang etc.
● Extensive experience with Linux administration and good understanding of the various linux kernel subsystems (memory, storage, network etc).
● Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
● Expertise in GitOps, Infrastructure as a Code tools such as Terraform etc.. and Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
● Expertise of Amazon Web Services (AWS) and/or other relevant Cloud Infrastructure solutions like Microsoft Azure or Google Cloud.
● Experience in building CI/CD solutions with tools such as Jenkins, GitLab, Spinnaker, Argo etc.
● Experience in managing and deploying containerized environments using Docker,
Mesos/Kubernetes is a plus.
● Experience with multiple datastores is a plus (MySQL, PostgreSQL, Aerospike,
Couchbase, Scylla, Cassandra, Elasticsearch).
● Experience with data platforms tech stacks like Hadoop, Hive, Presto etc is a plus
Read more
One Championship
at One Championship
1 video
1 recruiter
Agency job
via Volks Consulting by Mutahira ahad
Bengaluru (Bangalore)
4.5 - 10 yrs
₹30L - ₹35L / yr
DevOps
CI/CD
skill iconKubernetes
skill iconDocker
Microsoft Windows Azure
+2 more

About the client :

 

Asia’s largest global sports media property in history with a global broadcast to 150+ countries. As the world’s largest martial arts organization, they are a celebration of Asia’s greatest cultural treasure, and its deep-rooted Asian values of integrity, humility, honor, respect, courage, discipline, and compassion. Has achieved some of the highest TV ratings and social media engagement metrics across Asia with its unique brand of Asian values, world-class athletes, and world-class production. Broadcast partners include Turner Sports, Star India, TV Tokyo, Fox Sports, ABS-CBN, Astro, ClaroSports, Bandsports, Startimes, Premier Sports, Thairath TV, Skynet, Mediacorp, OSN, and more. Institutional investors include Sequoia Capital, Temasek Holdings, GIC, Iconiq Capital, Greenoaks Capital, and Mission Holdings. Currently has offices in Singapore, Tokyo, Los Angeles, Shanghai, Milan, Beijing, Bangkok, Manila, Jakarta, and Bangalore.

 

Position : Devops Engineer – SDE3

 

As part of the engineering team, you would be expected to have deep technology expertise with a passion for building highly scalable products. This is a unique opportunity where you can impact the lives of people across 150+ countries!

 

Responsibilities

• Develop Collaborate in large-scale systems design discussions.

• Deploying and maintaining in-house/customer systems ensuring high availability, performance and optimal cost.

• Automate build pipelines. Ensuring right architecture for CI/CD

• Work with engineering leaders to ensure cloud security

• Develop standard operating procedures for various facets of Infrastructure services (CI/CD, Git Branching, SAST, Quality gates, Auto Scaling)

• Perform & automate regular backups of servers & databases. Ensure rollback and restore capabilities are Realtime and with zero-downtime.

• Lead the entire DevOps charter for ONE Championship. Mentor other DevOps engineers. Ensure industry standards are followed.

 

Requirements

• Overall 5+ years of experience in as DevOps Engineer/Site Reliability Engineer

• B.E/B.Tech in CS or equivalent streams from institute of repute

• Experience in Azure is a must. AWS experience is a plus

• Experience in Kubernetes, Docker, and containers

• Proficiency in developing and deploying fully automated environments using Puppet/Ansible and Terraform

• Experience with monitoring tools like Nagios/Icinga, Prometheus, AlertManager, Newrelic

• Good knowledge of source code control (git)

• Expertise in Continuous Integration and Continuous Deployment setup using Azure Pipeline or Jenkins

• Strong experience in programming languages. Python is preferred

• Experience in scripting and unit testing

• Basic knowledge of SQL & NoSQL databases

• Strong Linux fundamentals

• Experience in SonarQube, Locust & Browserstack is a plus

Read more
Yulu Bikes
at Yulu Bikes
1 video
3 recruiters
Keerthana k
Posted by Keerthana k
Bengaluru (Bangalore)
3 - 7 yrs
₹7L - ₹15L / yr
DevOps
skill iconKubernetes
skill iconJenkins
skill iconDocker
Linux/Unix
+7 more
  • Mandatory: Docker, AWS, Linux, Kubernete or ECS
  • Prior experience provisioning and spinning up AWS Clusters / Kubernetes
  • Production experience to build scalable systems (load balancers, memcached, master/slave architectures)
  • Experience supporting a managed cloud services infrastructure
  • Ability to maintain, monitor and optimise production database servers
  • Prior work with Cloud Monitoring tools (Nagios, Cacti, CloudWatch etc.)
  • Experience with Docker, Kubernetes, Mesos, NoSQL databases (DynamoDB, Cassandra, MongoDB, etc)
  • Other Open Source tools used in the infrastructure space (Packer, Terraform, Vagrant, etc.)
  • In-depth knowledge on Linux Environment.
  • Prior experience leading technical teams through the design and implementation of systems infrastructure projects.
  • Working knowledge of Configuration Management (Chef, Puppet or Ansible preferred) Continuous Integration Tools (Jenkins preferred)
  • Experience in handling large production deployments and infrastructure.
  • DevOps based infrastructure and application deployments experience.
  • Working knowledge of the AWS network architecture including designing VPN solutions between regions and subnets
  • Hands-on knowledge with the AWS AMI architecture including the development of machine templates and blueprints
  • He/she should be able to validate that the environment meets all security and compliance controls.
  • Good working knowledge of AWS services such as Messaging, Application Services, Migration Services, Cost Management Platform.
  • Proven written and verbal communication skills.
  • Understands and can serve as the technical team lead to oversee the build of the Cloud environment based on customer requirements.
  • Previous NOC experience.
  • Client Facing Experience with excellent Customer Communication and Documentation Skills
Read more
Why apply to jobs via Cutshort
people_solving_puzzle
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
people_verifying_people
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly.
ai_chip
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos