Cutshort logo

11+ Backtrack Jobs in India

Apply to 11+ Backtrack Jobs on CutShort.io. Find your next job, effortlessly. Browse Backtrack Jobs and apply today!

icon
Confidential
Agency job
via Petals Careers by Cibi Thomas
Remote only
2 - 4 yrs
₹10L - ₹25L / yr
Penetration testing
skill iconAmazon Web Services (AWS)
Google Cloud Platform (GCP)
skill iconKubernetes
Backtrack
+1 more
Job Description
  • You have 2+ years of experience with production GCP/AWS; Experience with Kubernetes is a plus
  • You have conducted penetration testing using different tools like Backtrack and Metaspoilt
  • You have experience in developing security training and guiding the internal development teams
  • You design and implement best practices concerning information security
  • You can create programs to implement Identity and Access Management
  • You have to develop automated security testing.
  • You have to triage security issues and provide recommended fixes.
  • You have conducted vulnerability assessments using various open-source and commercial tools.
  • You are an excellent collaborator & communicator. You know that start-ups are a team sport.
  • You listen to others, aren’t afraid to speak your mind and always try to ask the right questions.
  • You are excited by the prospect of working in a distributed team and company.

Role: Security Engineer

Title: Security Engineer SDE1

Location: We are open to candidates working from anywhere in India/across the globe. We are fully remote.

About Us
Requirements / Responsibilities

The ability for you to make an impact and lay a foundation for the upcoming fin-tech
innovations.
A multicultural and diverse team of colleagues from all over the globe
Mission-driven and fast-paced, entrepreneurial environment
Competitive salary and flexible leave policy
A collaborative and flat company culture
What do we offer? 
What’s in it for you?
Do you truly want to make a difference and revolutionize the lives of millions of business
owners? Do you thrive in an environment where moving at light speed and embracing new
challenges every day is essential? If yes, our client is the perfect place for you!

Regards,
TA
Read more
DeepIntent

at DeepIntent

2 candid answers
17 recruiters
Indrajeet Deshmukh
Posted by Indrajeet Deshmukh
Pune
3 - 6 yrs
Best in industry
skill iconKubernetes
skill iconGit
MySQL
skill iconAmazon Web Services (AWS)
CI/CD
+3 more

With a core belief that advertising technology can measurably improve the lives of patients, DeepIntent is leading the healthcare advertising industry into the future. Built purposefully for the healthcare industry, the DeepIntent Healthcare Advertising Platform is proven to drive higher audience quality and script performance with patented technology and the industry’s most comprehensive health data. DeepIntent is trusted by 600+ pharmaceutical brands and all the leading healthcare agencies to reach the most relevant healthcare provider and patient audiences across all channels and devices. For more information, visit DeepIntent.com or find us on LinkedIn.


We are seeking a skilled and experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a minimum of 3 years of hands-on experience in managing and maintaining production systems, with a focus on reliability, scalability, and performance. As an SRE at Deepintent, you will play a crucial role in ensuring the stability and efficiency of our infrastructure, as well as contributing to the development of automation and monitoring tools.


Responsibilities:

  • Deploy, configure, and maintain Kubernetes clusters for our microservices architecture.
  • Utilize Git and Helm for version control and deployment management.
  • Implement and manage monitoring solutions using Prometheus and Grafana.
  • Work on continuous integration and continuous deployment (CI/CD) pipelines.
  • Containerize applications using Docker and manage orchestration.
  • Manage and optimize AWS services, including but not limited to EC2, S3, RDS, and AWS CDN.
  • Maintain and optimize MySQL databases, Airflow, and Redis instances.
  • Write automation scripts in Bash or Python for system administration tasks.
  • Perform Linux administration tasks and troubleshoot system issues.
  • Utilize Ansible and Terraform for configuration management and infrastructure as code.
  • Demonstrate knowledge of networking and load-balancing principles.
  • Collaborate with development teams to ensure applications meet reliability and performance standards.


Additional Skills (Good to Know):

  • Familiarity with ClickHouse and Druid for data storage and analytics.
  • Experience with Jenkins for continuous integration.
  • Basic understanding of Google Cloud Platform (GCP) and data center operations.


Qualifications:

  • Minimum 3 years of experience in a Site Reliability Engineer role or similar.
  • Proven experience with Kubernetes, Git, Helm, Prometheus, Grafana, CI/CD, Docker, and microservices architecture.
  • Strong knowledge of AWS services, MySQL, Airflow, Redis, AWS CDN.
  • Proficient in scripting languages such as Bash or Python.
  • Hands-on experience with Linux administration.
  • Familiarity with Ansible and Terraform for infrastructure management.
  • Understanding of networking principles and load balancing.


Education:

Bachelor's degree in Computer Science, Information Technology, or a related field.


DeepIntent is committed to bringing together individuals from different backgrounds and perspectives. We strive to create an inclusive environment where everyone can thrive, feel a sense of belonging, and do great work together.

DeepIntent is an Equal Opportunity Employer, providing equal employment and advancement opportunities to all individuals. We recruit, hire and promote into all job levels the most qualified applicants without regard to race, color, creed, national origin, religion, sex (including pregnancy, childbirth and related medical conditions), parental status, age, disability, genetic information, citizenship status, veteran status, gender identity or expression, transgender status, sexual orientation, marital, family or partnership status, political affiliation or activities, military service, immigration status, or any other status protected under applicable federal, state and local laws. If you have a disability or special need that requires accommodation, please let us know in advance.

DeepIntent’s commitment to providing equal employment opportunities extends to all aspects of employment, including job assignment, compensation, discipline and access to benefits and training.

Read more
Bengaluru (Bangalore)
4 - 8 yrs
₹25L - ₹60L / yr
skill iconPython
DevOps
skill iconAmazon Web Services (AWS)
Ansible
Terraform
+4 more
We are a digital B2B platform that offers loans, working capital, and payment services to small businesses.

Candidate MUST HAVE product-based company experience and a minimum of 3years of experience in DevOps.

What you will do (or learn) : 

1. Build our application stack on AWS. Infrastructure as code (read Terraform)
2. Build state-of-the-art CI/CD pipelines.
3. Manage data warehouses and data pipelines.
4. Work on infrastructure and data security.
5. State-of-the-art log management system and tooling around them.
6. Monitoring and alerting system.

What do we expect from you?
1. 3 to 10 years of experience with DevOps or SRE principles.
2. Good fundamentals of database management and other distributed systems management.
3. Experience in infrastructure as code or other configuration management systems.
4. Experience in scripting languages (like bash, python, go lang etc.)
5. Good understanding of Linux systems
6. Strong debugging and troubleshooting skills
7. Experience in tooling around monitoring, CI/CD, log management systems. 
Read more
APIwiz
Balaji Vijayan
Posted by Balaji Vijayan
Bengaluru (Bangalore)
3 - 7 yrs
Best in industry
skill iconAmazon Web Services (AWS)
Google Cloud Platform (GCP)
Linux/Unix
skill iconDocker
skill iconKubernetes
+1 more

Overview

Apiwiz (Itorix Inc) is looking for software engineers to join our team, grow with us, introduce us to new ideas and develop products that empower our users. Every day, you’ll work with team members across disciplines developing products for Apiwiz (Itorix Inc). You’ll interact daily with our product managers to understand our domain and create technical solutions that push us forward. We want to work with other engineers who bring knowledge and excitement about our opportunities.

You will impact major features and new product decisions as part of our remarkably high performing, collaborative team of engineers who thrive on the business impact of their work. With strong team support and significant freedom and self direction, you will experience the wealth of interesting, challenging problems that only a high growth startup can provide.


Roles & Responsibilities

  • Build, configure, and manage cloud compute and data storage infrastructure for multiple instances of AWS and Google Cloud Platform.
  • Manage VPCs, security groups, and user access to our various public cloud systems and services.
  • Develop processes and procedures for using cloud-based infrastructures, including, access key rotation, disaster recovery, and building new services.
  • Help the business control costs by categorizing and tagging assets running in the cloud.
  • Develop scripts and workflows to manage cloud computing systems
  • Provide oversight on log aggregation and application performance monitoring surrounding our production environments.

What we’re looking for

  • 2-3 years of experience in the provision, configuring, administrating, automating, monitoring, and supporting enterprise Cloud services
  • Strong experience in designing, building, maintaining and securing AWS resources for high-availability and production level systems and services
  • Familiar with Cloud concepts with practical hands-on experience on any Cloud Platform.
  • Hands-on experience with AWS services like Elastic Compute Cloud (EC2), Elastic Load-balancers, S3, Elastic File system, VPC, Route53, and IAM.
  • Providing 24/7 support for the application and Infrastructure support
  • Prior experience using infrastructure as a code software tool like Terraform.
  • Knowledge in software provisioning, configuration management, and application-deployment tools like Ansible.
  • Working knowledge of container technologies like Docker & Kubernetes cluster operations.
  • Familiarity with software automation tools Git, Jenkins, Code Pipeline, SonarQube


Read more
Upswing Financial Technologies Private Limited

at Upswing Financial Technologies Private Limited

2 candid answers
4 recruiters
Simran Bindra
Posted by Simran Bindra
Bengaluru (Bangalore)
3 - 6 yrs
Best in industry
Linux/Unix
Linux administration
Information security
Network Security
skill iconDocker
+4 more

At Upswing, we are committed to building a robust, scalable & secure API platform to power the world of Open Finance.

We are a passionate and self-driven team of thinkers who aspire to build the rails to connect the legacy financial sector with financial innovators through a simple and powerful banking-as-a-service (BaaS) platform.

We are looking for motivated engineers who will be working in a highly creative and cutting-edge technology environment to build a world-class financial services suite.

 

About the role

As part of the DevSecOps team at Upswing, you will get to work on building state-of-the-art infrastructure for the future. You will also be –

  • Managing security aspects of the Cloud Infrastructure 
  • Designing and Implementing Security measures, Incident Response guidelines 
  • Conducting Security Awareness Training
  • Developing SIEM tooling and pipelines end to end for vulnerability/security/incident reporting 
  • Developing automation and performing routine VAPT for Network and Applications
  • Integrating with 3rd party vendors for the services required to improve security posture 
  • Mentoring people across the teams to enable best practices 

What will you do if you join us?

  • Engage in a lot of cross-team collaboration to independently drive forward DevSecOps practices across the org 
  • Take Ownership of existing, ongoing, and future DevSecOps initiatives 
  • Plan and Engage in Architecture discussions to bring in different angles (especially security angles) to the table
  • Build Automation stack and tools for security pipeline 
  • Integrate different security measures and pipelines with the SIEM tool
  • Conducting routine VAPT using manual and automated workflows, generating and maintaining the report for the same
  • Introduce and Implement best practices across teams for a great security posture in the org

 

You should have

  • Curiosity for on-the-job learning and experimenting with new technologies and ideas
  • A strong background in Linux environment
  • Proven experience in Architecting networks with security first implementation
  • Experience with VAPT tooling for Networks and Applications is required 
  • Strong experience in Cloud technologies, multi-cloud environments, and best practices in Cloud 
  • Experience with at least one scripting language (Ruby/Python/Groovy)
  • Experience in Terraform is highly desirable but not mandatory
  • Some experience with Kubernetes, and Docker is required 
  • Understanding Java web applications and monitoring them for security vulnerabilities would be a plus 
  • Any other DevSecOps-related experience will be considered


Read more
Markowate
Gauri Parashar
Posted by Gauri Parashar
Gurugram
9 - 15 yrs
Best in industry
Microservices
Architecture
skill iconNodeJS (Node.js)
Apache Kafka
skill iconAmazon Web Services (AWS)
+5 more

About Us :

Markowate is a digital product development company building digital products on AI, Blockchain, Mobile, and Web3 digital. We work with tech startups as their technical partner and help them with their digital transformation.

Role Overview: As a Solution Architect, you will collaborate with stakeholders, including business executives, project managers, and software development teams, to understand the organization's objectives and requirements. You will then design scalable and efficient software solutions that align with these requirements. Your role involves assessing technologies, creating architecture designs, overseeing the development process, and ensuring the successful implementation of the solutions.


"Note: Should have 9+ years of relevant experience. Must have worked with Node.js technology."


Responsibilities:

  • Collaborate with stakeholders to understand and analyze business and technical requirements, and translate them into scalable and feasible solution designs.
  • Develop end-to-end solution architectures, considering factors such as system integration, scalability, performance, security, and reliability.
  • Research and evaluate new technologies, frameworks, and platforms to determine their suitability for the organization's needs.
  • Provide technical guidance and support to development teams throughout the software development life cycle (SDLC) to ensure adherence to the architectural vision and best practices.
  • Effectively communicate complex technical concepts to non-technical stakeholders, such as executives and project managers, and provide recommendations on technology-related decisions.
  • Identify and mitigate technical risks by proactively identifying potential issues and developing contingency plans.
  • Collaborate with quality assurance teams to define and implement testing strategies that validate the solution's functionality, performance, and security.
  • Create and maintain architectural documentation, including diagrams, technical specifications, and design guidelines, to facilitate efficient development and future enhancements.
  • Stay up-to-date with industry trends, best practices, and emerging technologies to drive innovation and continuous improvement within the organization.

Requirements:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • Should have 9+ years of experience and relevant experience as a solution architect.
  • Must have experience in Node.Js and should have deep understanding in AI/ML and data pipelines.
  • Proven experience as a Solution Architect or a similar role, with a strong background in software development.
  • In-depth knowledge of software architecture patterns, design principles, and development methodologies.
  • Proficiency in various programming languages and frameworks.
  • Strong problem-solving and analytical skills, with the ability to think strategically and propose innovative solutions.
  • Excellent communication and presentation skills, with the ability to convey complex technical concepts to both technical and non-technical stakeholders.
  • Experience in cloud computing platforms, such as AWS, Azure, or Google Cloud, and understanding of their services and deployment models.
  • ​Familiarity with DevOps practices, continuous integration/continuous deployment (CI/CD) pipelines, and containerization technologies like Docker and Kubernetes.


Read more
Pune
4 - 8 yrs
₹15L - ₹15L / yr
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Ansible
Prometheus
Grafana
+2 more

Position: Site Reliability Engineer

Location: Pune (Currently WFH, post pandemic you need to relocate)

 

About the Organization:

A funded product development company, headquarter in Singapore and offices in Australia, United States, Germany, United Kingdom, and India. You will gain work experience in a global environment.

 

Job Description:

We are looking for an experienced DevOps / Site Reliability engineer to join our team and be instrumental in taking our products to the next level.

 

In this role, you will be working on bleeding edge hybrid cloud / on-premise infrastructure handing billions of events and terabytes of data a day.

 

You will be responsible for working closely with various engineering teams to design, build and maintain a globally distributed infrastructure footprint.

As part of role, you will be responsible for researching new technologies, managing a large fleet of active services and their underlying servers, automating the deployment, monitoring and scaling of components and optimizing the infrastructure for cost and performance.

 

Day-to-day responsibilities

 

  • Ensure the operational integrity of the global infrastructure
  • Design repeatable continuous integration and delivery systems
  • Test and measure new methods, applications and frameworks
  • Analyze and leverage various AWS-native functionality
  • Support and build out an on-premise data center footprint
  • Provide support and diagnose issues to other teams related to our infrastructure
  • Participate in 24/7 on-call rotation (If Required)

 

Candidate's Profile:

 

 

  • Expert-level administrator of Linux-based systems
  • Experience managing distributed data platforms (Kafka, Spark, Cassandra, etc) Aerospike experience is a plus.
  • Experience with production deployments of Kubernetes Cluster
  • Experience in automating provisioning and managing Hybrid-Cloud infrastructure (AWS, GCP and On-Prem) at scale.
  • Knowledge of monitoring platform (Prometheus, Grafana, Graphite).
  • Experience in Distributed storage systems such as Ceph or GlusterFS.
  • Experience in virtualisation with KVM, Ovirt and OpenStack.
  • Hands-on experience with configuration management systems such as Terraform and Ansible
  • Bash and Python Scripting Expertise
  • Network troubleshooting experience (TCP, DNS, IPv6 and tcpdump)
  • Experience with continuous delivery systems (Jenkins, Gitlab, BitBucket, Docker)
  • Experience managing hundreds to thousands of servers globally
  • Enjoy automating tasks, rather than repeating them
  • Capable of estimating costs of various approaches, and finding simple and inexpensive solutions to complex problems
  • Strong verbal and written communication skills
  • Ability to adapt to a rapidly changing environment
  • Comfortable collaborating and supporting a diverse team of engineers
  • Ability to troubleshoot problems in complex systems
  • Flexible working hours and ability to participate in 24/7 on call support with other team members whenever required.
***** Looking for people from product organizations, who can join at the earliest.
Read more
Smarsh

at Smarsh

1 recruiter
Nichell Dsouza
Posted by Nichell Dsouza
Bengaluru (Bangalore)
9 - 15 yrs
₹40L - ₹50L / yr
Reliability engineering
skill iconKubernetes
IT infrastructure

Company Description

Smarsh is the leader in communications compliance, archiving, and analytics. We provide compliance across the broadest set of communications channels with insights on what’s being captured. Smarsh customers manage over 500 million daily conversations across 80 channels and growing. Customers include the top 10 U.S., top 8 European, top 5 Canadian, and top 3 Asian banks. The Smarsh advantage is customers stay ahead of compliance and uncover patterns and relationships hidden within their data.

At Smarsh , we’ve been helping our customers manage new forms of communication since 1998. We work closely with regulators including the SEC, FINRA, IIROC, and the PRA and FCA, and with our customers, to ensure that they understand the capabilities of today’s technology and that our platform meets their most stringent requirements. Our products include Connected Capture, Connected Archive, Web Archive & Business Solutions.

 

About the team

Are you an SRE with excellent Observability, Containerization and Orchestration skills? As a Site Reliability Engineer (SRE) in the Smarsh SaaS Operations team, you'll be part of a team who measures and improves production performance reliability through sustainable engineering practices for our suite of applications. Toil will be your number one enemy, observability your closest friend and your mission will be to drive operational burden as close to zero as you can.

Responsibilities

  • Responsible for technical direction at the platform solutions level. Is able to weigh the pros and cons of various solutions and credibly argue for the best path
  • Work closely with Product Management and the rest of the engineering team to define features and their implementations with careful attention to quality, scalability, and maintainability
  • Can break down complex technical solutions into abstractions that the rest of the team and understand
  • Can investigate and solve complex bugs, performance, and scalability issues
  • Collaborates with multiple agile teams to ensure their solutions integrate effectively
  • Track work in ticketing system (JIRA)
  • Participate in Pull Request reviews. Provide and receive feedback to continuously improve.
  • Other duties as assigned.

Desired skills & experience

  • A minimum 10+ years industry experience
  • Masters in CS or equivalent
  • Must have experience in Azure or AWS, either running some large-scale app there or migrating to Azure/AWS. 
  • Experience operating Cloud Foundry in production environments 
  • Experience managing CI/CD systems (Concourse, Jenkins, TravisCI etc.) 
  • Experience deploying and/or operating ELK stack 
  • Experience with container technologies and orchestration platforms (Docker, Kubernetes, Cloud Foundry) 
  • Experience working with monitoring and observability tools (We use Datadog and New Relic) 
  • Familiarity with working with PostgreSQL and MongoDB 
  • Background working in a multi-platform environment (Linux, Windows) 
  • Experience with running on a cloud platform, AWS preferred (S3, RDS, SQS) 
  • Familiarity with Agile/Scrum/Kanban methodologies 
  • Familiarity with programming/scripting languages (ie. Python, Bash, PowerShell, Go, etc.) 

Additional Skills

  • Expert programming skills in relevant languages
  • Exceptional analytical and problem-solving skills
  • Strong communication and collaboration skills
  • Deep understanding of modern software architecture
  • Deep domain knowledge of the industry, platform, and existing processes
  • Fault-tolerant design & maintenance
  • Knowledge and understanding of modern software programming/engineering.
  • Product delivery lifecycle - requirement refinement through ops

 

Why Smarsh?

Ready to join a thriving tech company that’s redefining digital archiving and business intelligence?

Smarsh is the leading comprehensive archiving platform. Recognized as one of today’s fastest growing companies in the U.S., Smarsh delivers innovative cloud-based solutions that help organizations manage and enforce flexible and secure records retention and compliance strategies for electronic communications, including social media and enterprise social networks (Yammer, Chatter, Facebook, LinkedIn and more).

Our motto is ‘People First. Inspire Confidence. Embrace the Impossible.’ We hire lifelong learners who have a passion for their discipline and a track record of excellence. To learn more about us, visit www.smarsh.com/careers

 


Read more
Bengaluru (Bangalore)
5 - 9 yrs
₹6L - ₹15.2L / yr
skill iconKubernetes
CI/CD
DevOps
skill iconDocker
Splunk
+8 more
Skills:Kuberentes,security tool, security processes,devsecops,three tier architecture,deveops,gitops,docker,kustomize,heim,Sast,Dast,splunk,grafana,azure,unix shell,linux shell.

Years: 5-9 Years

Job Responsibilities

 

Primary:

  • Responsible for security road map for EPDM application
  • Train the CI-CD team on the required technologies security adoptation
  • Lead the upskill program within the team
  • Support Application architect with right inputs on security processes and tools
  • Help setup DevSecOps for EPDM.
  • Find Security vulnerability in development process and sealed secretes
  • Support in defining the Three-tier architecture.

 

 

Secondary:

  • Coordination with different IT stakeholders as and when needed
  • Suggestion and Implementation of further tool chains towards DevOps and GitOps
  • Responsible to train the peer colleagues

 

 

 

Skills:

Mandatory skill:

  • Expert knowledge of container solutions. Must have >3 years of experience working with networking & debugging within Docker and Kubernetes.
  • Hands-on experience with Kubernetes workload deployments using Kustomize & Helm.
  • Good understanding of Bitnami, Hashicorp and other secrete management tools
  • SAST/DAST integration in CI/CD pipeline - design, implementation Expert knowledge of Source Control Systems, build & integration tools (e.g., GIT, Jenkins & Maven).
  • Hands-on experience with designing the CI/CD architecture & building pipelines (on On-prem, Cloud & Hybrid infrastructure services).
  • Experience with Security log management tools (e.g. Splunk ELK/EFK stack, Azure monitor or similar).
  • Experience with monitoring tools like Prometheus-Grafana & Dynatrace.
  • Experience with Infrastructure as a Service / Cloud computing (preferably Azure).
  • Expert in writing automation scripts in Yaml, Unix shell, linux shell.
  • Pulumi would be added advantage.

 

Read more
Srijan Technologies

at Srijan Technologies

6 recruiters
Adyasha Satpathy
Posted by Adyasha Satpathy
Remote only
5 - 12 yrs
₹20L - ₹32L / yr
skill iconKubernetes
skill iconDocker
Ansible
Terraform
skill iconAmazon Web Services (AWS)
+6 more

SRE - Tech Lead (DevOps):

Location: Permanent Work From Home Option
Notice: Candidates with a notice period of 30 days and less and preferred

SRE-DevOps- Tech Lead - JD:

 

Srijan is hiring for Site Reliability Engineering (SRE), We are looking for SRE/DevOps- Tech Lead or Sr. Tech Lead with strong automation skills and a good understanding of how to build & run secure & reliable platforms for cloud-native applications. Please find below the detailed job description and kindly go through the same for reference:-



Minimum Experience: 6+ years in DevOps/SRE

Permanent WFH option

Job Description:-

The focus of this role is to build scalable, resilient, secure infrastructure for cloud-native applications whilst automating every mundane task you could think of and build observability dashboards, set up alerts, etc to provide optics to relevant stakeholders. In a nutshell: “You are keepers of Production environments”. You must be a problem solver with the ability to multitask and come with strong collaboration and communication skills.



Key Responsibilities:-

  • Proactively monitor and review application performance

  • Handle on-call and emergency support

  • Ensure software has good logging and diagnostics

  • Create and maintain operational runbooks

  • Contribute in Solution Designing and evaluating Technical Debt

  • Set right practices for Well-Defined Architecture & to minimize toil.

  • Own SLI, SLO configuration as per Error Budget

  • Maintain production services through measuring and monitoring availability, latency, and overall system health.

  • Practice sustainable incident response and blameless postmortems.

  • Not be afraid to contribute changes back to the Software engineering team to improve the systems.

  • Managing the delivery pipeline into production.

  • Able to mentor junior members on regular basis

  • Troubleshooting issues with web applications

  • Understanding of security principles and best practices

  • Ensuring that critical data is backed up

  • Configuration of monitoring systems including infrastructure monitoring and Application Performance Monitoring systems such as New Relic.

  • Ensuring that web application infrastructure is built

  • Ability to act as Customer Technical Advocate and negotiate well with peers on technical fronts.

  • Flexible enough to work in different Shifts for hyper business requirement

  • Ability to handle multiple global clients on tech front and generate desired reports to represent health of SRE Delivery.



Skills/Experience:-

  • A key skill of a SRE Tech Lead is that they have a deep knowledge of the application, the code, and how it runs, is configured, and scales. That knowledge is what makes them so valuable at also monitoring and supporting it as site reliability engineers.

  • System administration, security, and networking

  • The SRE Tech Lead expected to have a good understanding of system administration (Linux or Windows) and networking.

  • Essential commands

  • User and Group Management

  • Knowledge of networking concepts (DNS, TCP/IP, and Firewalls)

  • Service Configuration

  • Storage Management

  • Good grasp of fundamental security concepts

  • Good understanding of infrastructure as code principles.

  • Knowledge of a scripting language such as Bash

  • Ability to configure infrastructure using a Configuration Management technology such as Puppet, Chef, or Ansible.

  • Familiarity with Jenkins or any other CI/CD tool

  • Proficiency in a high-level programming language such as Python or Go.

  • Understanding of container technologies such as Docker, Kubernetes

  • 2 yrs+ hands on experience with container orchestration technologies such as ECS, EKS, AKS or Kubernetes would be beneficial.

  • Use Terraform and other IaC to deploy cloud infrastructure.







Cloud technologies:-

  • Experience designing available, cost-efficient, fault-tolerant, and scalable distributed systems on AWS/Azure

  • Hands-on experience using compute, networking, storage, and database AWS/Azure services

  • Hands-on experience of 4 yrs+ with AWS/Azure deployment and management services

  • Ability to identify and define technical requirements for an AWS/AZURE-based application

  • Ability to identify which AWS/AZURE services meet a given technical requirement

  • Knowledge of recommended best practices for building secure and reliable applications on the AWS/AZURE platform

  • An understanding of the AWS/AZURE global infrastructure

  • An understanding of network technologies as they relate to AWS/AZURE

  • An understanding of security features and tools that AWS/AZURE provides and how they relate to traditional services







 

Read more
Olacabs.com

at Olacabs.com

6 recruiters
Agency job
via zyoin by RAKESH RANJAN
Bengaluru (Bangalore)
6 - 11 yrs
₹20L - ₹38L / yr
DevOps
Terraform
Ansible
CI/CD
Linux administration
+7 more

 

Roles and Responsibilities

  • Managing Availability, Performance, Capacity of infrastructure and applications.
  • Building and implementing observability for applications health/performance/capacity.
  • Optimizing On-call rotations and processes.
  • Documenting “tribal” knowledge.
  • Managing Infra-platforms like Mesos/Kubernetes,CICD,Observability (Prometheus/New Relic/ELK),Cloud Platforms (AWS/ Azure),Databases,Data Platforms Infrastructure
  • Providing help in onboarding new services with production readiness review process.
  • Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
  • Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
  • Working with Dev team to have in depth understanding of the application architecture

          and its bottlenecks.

  • Identifying observability gaps in product services, infrastructure and working with stake

          owners to fix it.

  • Managing Outages and doing detailed RCA with developers and identifying ways to

          avoid that situation.

  • Managing/Automating upgrades of the infrastructure services.
  • Automate toil work.
  •  

Experience & Skills

  • 6+ years of total experience
  • Experience as an SRE/DevOps/Infrastructure Engineer on large scale microservices and infrastructure.
  • A collaborative spirit with the ability to work across disciplines to influence, learn, and

         deliver.

  • A deep understanding of computer science, software development, and networking principles.
  • Demonstrated experience with languages, such as Python, Java, Golang etc.
  • Extensive experience with Linux administration and good understanding the various

linux kernel subsystems (memory, storage, network etc).

  • Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
  • Expertise in GitOps, Infrastructure as a Code tools such as Terraform etc.. and
  • Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
  • Expertise of Amazon Web Services (AWS) and/or other relevant Cloud Infrastructure

solutions like Microsoft Azure or Google Cloud.

  • Experience in building CI/CD solutions with tools such as Jenkins, GitLab, Spinnaker,

Argo etc.

  • Experience in managing and deploying containerized environments using Docker,

Mesos/Kubernetes is a plus.

Read more
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort