Cutshort logo
Databook logo
Platform Engineer (SRE/DevOps + Backend engineering)
Platform Engineer (SRE/DevOps + Backend engineering)
Databook's logo

Platform Engineer (SRE/DevOps + Backend engineering)

Nikhil Mohite's profile picture
Posted by Nikhil Mohite
5yrs+
Upto ₹28L / yr (Varies
)
Mumbai
Skills
skill iconPython
skill iconJavascript
DevOps
SQL
skill iconAmazon Web Services (AWS)
Terraform
Ansible
AWS CloudFormation
Monitoring

About Databook

Databook is the world’s first AI-powered enterprise customer intelligence platform, founded in 2017 to empower enterprise sales teams with a distinct advantage. Leading companies like Microsoft, Salesforce, and Databricks rely on Databook to enhance customer engagement and accelerate revenue acquisition. Backed by Bessemer Ventures, DFJ Growth, M12 (Microsoft’s Venture fund), Salesforce Ventures, and Threshold Ventures, we operate as a customer-focused, innovative organization headquartered in Palo Alto, CA, with a global distributed team.


About Our Technology Team

The Engineering team at Databook brings together collaborative and technically passionate individuals to deliver innovative customer intelligence solutions. Led by former Google and Salesforce engineers, this group explores the full engineering lifecycle, driving impactful outcomes and offering opportunities for leadership and growth in a hyper-growth context.


The Opportunity

We're seeking a proactive and skilled Platform Engineer to enhance the reliability, scalability, and performance of our platform. This role offers the chance to collaborate closely with cross-functional teams, integrate new technologies, and advance our DevOps and SRE practices. If you're passionate about driving excellence, building robust systems, and contributing to the evolution of an AI-driven platform, join our dynamic team!


Responsibilities

- Promote best practices and standards across engineering teams to ensure platform reliability and performance.

- Collaborate with product management and engineering to enhance platform scalability and align with business goals.

- Develop and optimize backend systems and infrastructure to support platform growth.

- Implement and enhance CI/CD pipelines, automation, monitoring, and alerting systems.

- Document system performance, incidents, and resolutions, producing detailed technical reports.

- Formulate backend architecture plans and provide guidance on deployment strategies and reliability improvements.

- Participate in an on-call rotation to ensure 24/7 platform reliability and rapid incident response.


Qualifications

- 5+ years of experience in Platform or Infrastructure Engineering, DevOps, SRE, or similar roles.

- Strong backend development experience using Python, JavaScript/Typescript.

- Solid understanding of API design and implementation.

- Proficiency in SQL.

- Experience with CI/CD tools like Jenkins, GitLab CI, CircleCI.

- Hands-on experience with IaC tools such as Terraform, CloudFormation, Ansible.

- Familiarity with monitoring and observability tools like Datadog, Splunk, New Relic, Prometheus.

- Strong analytical and problem-solving skills with a focus on long-term solutions.

- Excellent communication skills for collaboration with technical and non-technical stakeholders.

- Ability to thrive in a fast-paced environment and manage multiple priorities.


Working Arrangements

This position offers a hybrid work mode, combining remote and in-office work as mutually agreed upon.


Ideal Candidates Will Also Have

- Interest or experience in Machine Learning and Generative AI.

- Exposure to performance, load, and stress testing frameworks.

- Familiarity with security best practices and tools in cloud environments.


Join Us and Enjoy These Perks!

- Competitive salary with bonus

- Medical insurance coverage

- Generous leave and public holidays

- Employee referral bonus program

- Annual learning stipend for professional development

- Complimentary subscription to Masterclass


About Databook

Databook is a pioneer in Strategic Relationship Management (SRM), using advanced AI and NLP to empower B2B sales teams globally. Our award-winning SRM platform leverages financial and market data to create actionable sales strategies, achieving significant improvements in customer meetings, pipeline growth, deal size, and cycle time.

Read more
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Subodh Popalwar's profile image

Subodh Popalwar

Software Engineer, Memorres
For 2 years, I had trouble finding a company with good work culture and a role that will help me grow in my career. Soon after I started using Cutshort, I had access to information about the work culture, compensation and what each company was clearly offering.
Companies hiring on Cutshort
companies logos

About Databook

Founded :
2017
Type
Size
Stage :
Raised funding
About

Great salespeople let their customers’ strategies do the talking.


Databook’s award-winning Strategic Relationship Management (SRM) platform uses advanced AI and NLP to empower the world’s largest B2B sales teams to create, manage, and maintain strategic relationships at scale. The platform ingests and interprets billions of financial and market data signals to generate actionable sales strategies that connect the seller’s solutions to a buyer’s financial pain and urgency.


About our Technology Team-

The Engineering team at Databook brings together collaborative and technically passionate individuals to deliver innovative customer intelligence solutions that drive real value for our clients. Led by past Google and Salesforce engineers, this group offers an opportunity to explore all aspects of the engineering lifecycle, to be close to the client, and to make a real difference. We give our engineers a lot of autonomy to own and solve complex, novel problems - from system design to natural language processing (NLP) and machine learning (ML). The Engineering team offers a place where engineers can collect an abundance of experience and practice their leadership skills; all in a hyper-growth context.

Read more
Tech Stack
skill iconNodeJS (Node.js)
skill iconReact.js
skill iconPython
skill iconMongoDB
skill iconPostgreSQL
Company video
Candid answers by the company
What does the company do?
What is the location preference of jobs?
What is the work mode of the company?
How much funding has Databook raised to date?
Benefits at Databook

Databook’s Strategic Relationship Management (SRM) platform uses advanced AI to empower the world’s largest B2B sales teams

Product showcase
Databook Platform's logo
Databook Platform
Visit
Create, manage, and maintain strategic relationships at scale with auto-generated and actionable sales strategies that evolve in real time.
Read more
SRM is a go-to-market process for deepening and expanding your connection with customers by aligning around a clear understanding of customer need.
Read more
Photos
Company featured pictures
Company featured pictures
Company featured pictures
Company featured pictures
Company featured pictures
Company featured pictures
Company featured pictures
Connect with the team
Profile picture
Collin Holzem
Profile picture
Deeplai Tumbre
Company social profiles
angelbloglinkedintwitter

Similar jobs

Bengaluru (Bangalore)
3 - 10 yrs
Best in industry
skill iconPython
skill iconAmazon Web Services (AWS)
Terraform

We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. 

As a Software Engineer III at JPMorgan Chase within the Asset & Wealth Management, you serve as a seasoned member of an agile team to design and deliver trusted market-leading technology products in a secure, stable, and scalable way. You are responsible for carrying out critical technology solutions across multiple technical areas within various business functions in support of the firm’s business objectives.

Job responsibilities

 

  • Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems
  • Creates secure and high-quality production code and maintains algorithms that run synchronously with appropriate systems
  • Produces architecture and design artifacts for complex applications while being accountable for ensuring design constraints are met by software code development
  • Gathers, analyzes, synthesizes, and develops visualizations and reporting from large, diverse data sets in service of continuous improvement of software applications and systems
  • Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to coding hygiene and system architecture
  • Contributes to software engineering communities of practice and events that explore new and emerging technologies
  • Adds to team culture of diversity, equity, inclusion, and respect

 

 

Required qualifications, capabilities, and skills

 

  • Formal training or certification on software engineering concepts and 3+ years applied experience
  • Expert level in the programming on Python. Experience designing and building APIs using popular frameworks such as Flask, Fast API
  • Familiar with site reliability concepts, principles, and practices
  • Experience maintaining a Cloud-base infrastructure
  • Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
  • Emerging knowledge of software, applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
  • Emerging knowledge of continuous integration and continuous delivery tools (e.g., Jenkins, Jules, Spinnaker, BitBucket, GitLab, Terraform, etc.)
  • Emerging knowledge of common networking technologies

 

Preferred qualifications, capabilities, and skills

 

  • General knowledge of financial services industry
  • Experience working on public cloud environment using wrappers and practices that are in use at JPMC
  • Knowledge on Terraform, containers and container orchestration, especially Kubernetes preferred
Read more
Nvizion Solutions
at Nvizion Solutions
1 recruiter
Anshita Abhilasha
Posted by Anshita Abhilasha
Remote only
3 - 6 yrs
₹6L - ₹15L / yr
DevOps
Google Cloud Platform (GCP)
skill iconAmazon Web Services (AWS)
Linux/Unix
JIRA
+3 more

Nvizion Solutions is looking for the position of Site Reliability Engineer.

 

If interested, kindly share your resume along with contact details.

 

 

Title: Site Reliability Engineer

No. of job openings: 2

Location:Gurgaon/ Hyderabad/ Bengaluru/ Mumbai/Chennai ( Remote location)

Remuneration:Best in the Industry

 

 

·      Experience required: 2 to 4 yrs in the industry

·      Ensuring overall System's reliability

·      Add automation and alerting in the system

·      Providing Troubleshooting support

·      Cross team communications. Working closely with Product team and Customer success team.

·      Proactive support - to ensures the system is back to the healthy state

·      R&D for new tools/technologies to support product and support team

·      Good verbal/written communication to connect with the client.

·      Good team player with a zeal to learn new technologies.

·      The candidate will be part of the team responsible for 24X7 monitoring of distributed global platform.

  • Linux Scripting
  • CI/CD knowledge (Jenkins/ BitBucket Pipelie /GitOps)
  • Version Control
  • Cloud platform knowledge (GCP/AWS/Azure/Digital Ocean)
  • Docker, Kubernetes

 

Read more
A listed product development organization
A listed product development organization
Agency job
via RS Consultants by Biswadeep RS
Pune
4 - 8 yrs
₹15L - ₹15L / yr
skill iconAmazon Web Services (AWS)
skill iconKubernetes
Ansible
Prometheus
Grafana
+2 more

Position: Site Reliability Engineer

Location: Pune (Currently WFH, post pandemic you need to relocate)

 

About the Organization:

A funded product development company, headquarter in Singapore and offices in Australia, United States, Germany, United Kingdom, and India. You will gain work experience in a global environment.

 

Job Description:

We are looking for an experienced DevOps / Site Reliability engineer to join our team and be instrumental in taking our products to the next level.

 

In this role, you will be working on bleeding edge hybrid cloud / on-premise infrastructure handing billions of events and terabytes of data a day.

 

You will be responsible for working closely with various engineering teams to design, build and maintain a globally distributed infrastructure footprint.

As part of role, you will be responsible for researching new technologies, managing a large fleet of active services and their underlying servers, automating the deployment, monitoring and scaling of components and optimizing the infrastructure for cost and performance.

 

Day-to-day responsibilities

 

  • Ensure the operational integrity of the global infrastructure
  • Design repeatable continuous integration and delivery systems
  • Test and measure new methods, applications and frameworks
  • Analyze and leverage various AWS-native functionality
  • Support and build out an on-premise data center footprint
  • Provide support and diagnose issues to other teams related to our infrastructure
  • Participate in 24/7 on-call rotation (If Required)

 

Candidate's Profile:

 

 

  • Expert-level administrator of Linux-based systems
  • Experience managing distributed data platforms (Kafka, Spark, Cassandra, etc) Aerospike experience is a plus.
  • Experience with production deployments of Kubernetes Cluster
  • Experience in automating provisioning and managing Hybrid-Cloud infrastructure (AWS, GCP and On-Prem) at scale.
  • Knowledge of monitoring platform (Prometheus, Grafana, Graphite).
  • Experience in Distributed storage systems such as Ceph or GlusterFS.
  • Experience in virtualisation with KVM, Ovirt and OpenStack.
  • Hands-on experience with configuration management systems such as Terraform and Ansible
  • Bash and Python Scripting Expertise
  • Network troubleshooting experience (TCP, DNS, IPv6 and tcpdump)
  • Experience with continuous delivery systems (Jenkins, Gitlab, BitBucket, Docker)
  • Experience managing hundreds to thousands of servers globally
  • Enjoy automating tasks, rather than repeating them
  • Capable of estimating costs of various approaches, and finding simple and inexpensive solutions to complex problems
  • Strong verbal and written communication skills
  • Ability to adapt to a rapidly changing environment
  • Comfortable collaborating and supporting a diverse team of engineers
  • Ability to troubleshoot problems in complex systems
  • Flexible working hours and ability to participate in 24/7 on call support with other team members whenever required.
***** Looking for people from product organizations, who can join at the earliest.
Read more
fourth dimension technologies
Prathees Kumar
Posted by Prathees Kumar
Chennai
4 - 5 yrs
₹4L - ₹6L / yr
SolarWinds
ServiceNow
Microsoft SCOM
Monitoring

JOB Description

  • Monitoring entire infrastructure of Olam using various monitoring tools like
  • SCOM, SolarWinds, Telegraph, OEM.
  • Monitoring various types of alerts like
  • CPU Utilization
  • Memory Utilization
  • Database related alerts
  • DR Replication issues
  • Backup Failure Alerts
  • Exchange Mail Queue Threshold Alerts
  • Service Mailbox quota breach alert
  • Adobe Experience Manager / Site 24/7 Alerts
  • Application URL Alerting
  • Scheduling Maintenance Mode for planned Activity.
  • Daily repeat CI analysis of events/alerts/incident and raising proactive problem tickets which helps in reduction of major incident.
  • Handling Major Incidents, Driving the major incident bridge, sending communication about major incident to stake holders.
  • CMDB Inventory Management – Onboarding and Offboarding of Device's are commissioned/decommissioned.
  • Coordinating with Service Provider for MPLS related outage
  • Daily follow ups with Regional and internal teams to ensure all the node are up and running fine.


Read more
"A Product Startup"
"A Product Startup"
Agency job
Bengaluru (Bangalore)
5 - 8 yrs
₹5L - ₹20L / yr
Windows Azure
Microsoft Windows Azure
DevOps
Terraform
Solution architecture
+5 more

Senior Cloud Engineer / Jr. Cloud Solutions Architect

 

Roles and Responsibilities

  • Define, implement, deploy and maintain development, QA & production environments for cloud-based Azure architecture.

  • Create a strategy for establishing a secure and well-managed enterprise environment in Azure

  • Define and implement security architecture for production, ensure data security at all levels.

  • Provision Infrastructure as code using Azure CLI Powershell ARM templates and or Terraform with Ansible or other tools.

  • Develop scripts to automate the deployment of resource stacks and associated configurations

  • Extend MLP standard systems management processes into the cloud including change, incident, and problem management

  • Establish and implement monitoring and management infrastructure for both availability and performance management

  • Implement observability patterns using Azure Monitor Azure Application Insights and Log Analytics Workspace.

  • Provide internal training to the team.

 

Primary Skills/Requirements

  • 5+ years of experience in IT and infrastructure

  • 3+ years of experience in Azure design, support and management for a large-scale organization

  • Experience in design and implementation of high availability architecture.

  • Strong experience in Azure CLI Powershell and ARM Templates Terraform.

  • Strong understanding of IT Security and related audits

  • Experience with deploying applications on Linux - Ubuntu

  • Should know Azure offerings (Storage, OS instances, Availability zones, DR, Load balancers, VPN tunnel, Application Gateway, etc.)Cloud monitoring Experience with Azure Log Analytics Azure Monitor.

  • Experience with log collection tools and analysis, as well as infrastructure performance monitoring tools and optimization practices

  • Microsoft Azure Certification MCSE: Cloud Platform and Infrastructure or equivalent certification would be an added advantage

  • Experience with Postgres SQL Database

Behavioural

  • Positive work ethics

  • Ability to adapt to dynamic environment

  • Time Management

  • Team Player

  • Communication skills

  • Ability to work independently

Read more
Srijan Technologies
at Srijan Technologies
6 recruiters
Adyasha Satpathy
Posted by Adyasha Satpathy
Remote only
5 - 12 yrs
₹20L - ₹32L / yr
skill iconKubernetes
skill iconDocker
Ansible
Terraform
skill iconAmazon Web Services (AWS)
+6 more

SRE - Tech Lead (DevOps):

Location: Permanent Work From Home Option
Notice: Candidates with a notice period of 30 days and less and preferred

SRE-DevOps- Tech Lead - JD:

 

Srijan is hiring for Site Reliability Engineering (SRE), We are looking for SRE/DevOps- Tech Lead or Sr. Tech Lead with strong automation skills and a good understanding of how to build & run secure & reliable platforms for cloud-native applications. Please find below the detailed job description and kindly go through the same for reference:-



Minimum Experience: 6+ years in DevOps/SRE

Permanent WFH option

Job Description:-

The focus of this role is to build scalable, resilient, secure infrastructure for cloud-native applications whilst automating every mundane task you could think of and build observability dashboards, set up alerts, etc to provide optics to relevant stakeholders. In a nutshell: “You are keepers of Production environments”. You must be a problem solver with the ability to multitask and come with strong collaboration and communication skills.



Key Responsibilities:-

  • Proactively monitor and review application performance

  • Handle on-call and emergency support

  • Ensure software has good logging and diagnostics

  • Create and maintain operational runbooks

  • Contribute in Solution Designing and evaluating Technical Debt

  • Set right practices for Well-Defined Architecture & to minimize toil.

  • Own SLI, SLO configuration as per Error Budget

  • Maintain production services through measuring and monitoring availability, latency, and overall system health.

  • Practice sustainable incident response and blameless postmortems.

  • Not be afraid to contribute changes back to the Software engineering team to improve the systems.

  • Managing the delivery pipeline into production.

  • Able to mentor junior members on regular basis

  • Troubleshooting issues with web applications

  • Understanding of security principles and best practices

  • Ensuring that critical data is backed up

  • Configuration of monitoring systems including infrastructure monitoring and Application Performance Monitoring systems such as New Relic.

  • Ensuring that web application infrastructure is built

  • Ability to act as Customer Technical Advocate and negotiate well with peers on technical fronts.

  • Flexible enough to work in different Shifts for hyper business requirement

  • Ability to handle multiple global clients on tech front and generate desired reports to represent health of SRE Delivery.



Skills/Experience:-

  • A key skill of a SRE Tech Lead is that they have a deep knowledge of the application, the code, and how it runs, is configured, and scales. That knowledge is what makes them so valuable at also monitoring and supporting it as site reliability engineers.

  • System administration, security, and networking

  • The SRE Tech Lead expected to have a good understanding of system administration (Linux or Windows) and networking.

  • Essential commands

  • User and Group Management

  • Knowledge of networking concepts (DNS, TCP/IP, and Firewalls)

  • Service Configuration

  • Storage Management

  • Good grasp of fundamental security concepts

  • Good understanding of infrastructure as code principles.

  • Knowledge of a scripting language such as Bash

  • Ability to configure infrastructure using a Configuration Management technology such as Puppet, Chef, or Ansible.

  • Familiarity with Jenkins or any other CI/CD tool

  • Proficiency in a high-level programming language such as Python or Go.

  • Understanding of container technologies such as Docker, Kubernetes

  • 2 yrs+ hands on experience with container orchestration technologies such as ECS, EKS, AKS or Kubernetes would be beneficial.

  • Use Terraform and other IaC to deploy cloud infrastructure.







Cloud technologies:-

  • Experience designing available, cost-efficient, fault-tolerant, and scalable distributed systems on AWS/Azure

  • Hands-on experience using compute, networking, storage, and database AWS/Azure services

  • Hands-on experience of 4 yrs+ with AWS/Azure deployment and management services

  • Ability to identify and define technical requirements for an AWS/AZURE-based application

  • Ability to identify which AWS/AZURE services meet a given technical requirement

  • Knowledge of recommended best practices for building secure and reliable applications on the AWS/AZURE platform

  • An understanding of the AWS/AZURE global infrastructure

  • An understanding of network technologies as they relate to AWS/AZURE

  • An understanding of security features and tools that AWS/AZURE provides and how they relate to traditional services







 

Read more
An US based firm offering permanent WFH
An US based firm offering permanent WFH
Agency job
via Jobdost by Mamatha A
Remote only
3 - 10 yrs
₹5L - ₹15L / yr
skill iconPython
skill iconAmazon Web Services (AWS)
skill iconMongoDB
MySQL
skill iconDjango
+9 more

A network of the world's best developers - full-time, long-term remote software jobs with better compensation and career growth.  We enable our clients to accelerate their Cloud Offering and Capitalize on Cloud.  We have our own IoT/AI platform and we provide professional services on that platform to build custom clouds for their IoT devices.  We also build mobile apps, run 24x7 DevOps/site reliability engineering for our clients.

We are looking for a friendly, very hands-on technical, and dependable professional with plenty of experience as a backend & cloud engineer to provide site reliability services to our internal teams and end customers. We expect you to deliver with TOP quality & high speed. You must have experience developing and designing amazing UI screens.

 

This person MUST have:

  • BE Computer Science or equivalent
  • Cloud app development experience.
  • Strong Troubleshooting and debugging skills
  • A strong passion for writing simple, clean, and efficient code.
  • 3 years of experience with the Django framework and other backend technologies.
  • Knowledge of NodeJS
  • Experience with building, modifying, and extending API endpoints (REST or GraphQL) for data retrieval and persistence.
  • Understand how to use a database like Postgres (preferred choice), SQLite, MongoDB, MySQL.
  • Experience creating high-performance applications.
  • Experience with messaging and broker tools - Rabbitmq, MQTT
  • Experience with SQL and NoSQL databases
  • Experience with the full software development life cycle, including requirements collection, design, implementation, testing, and operational support.
  • Knowledge of web services
  • Proficient understanding of code versioning tools Git.
  • Hands-on experience deploying and managing infrastructure with CloudFormation/Terraform
  • Experience managing AWS infrastructure.
  • Hands-on experience in Linux environment.
  • Basic understanding of Kubernetes/Docker orchestration.
  • Manges existing infrastructure/Pipelines/Engineering tools (On-Prem or  AWS) for the engineering team (Build servers/Jenkins nodes etc.)
  • Experience with scrum or other agile software development methodology.
  • Excellent verbal and written communication, teamwork, decision making and influencing skills.
  • Handle customer calls/emails regarding technical issues for end-users.
  • Strong communication skills
  • Attention to detail.

 

 

Experience:

  • Min 3 year experience

 

Location:

  • Ahmedabad Office Or,
  • Work from home



Timings:

  • 40 hours a week with a rotational shift every month.

Position:

  • Full time/Direct
  • We have great benefits such as PF, medical insurance, 12 annual company holidays, 12 PTO leaves per year, annual increments, Diwali bonus, spot bonuses and other incentives, etc.
  • We don't believe in locking in people with large notice periods.  You will stay here because you love the company.  We have only a 30 days notice period
Read more
Uniphore Software Systems
Sandesh HS
Posted by Sandesh HS
Bengaluru (Bangalore)
5 - 10 yrs
₹25L - ₹40L / yr
SRE
Site Reliability Engineer
Reliability engineering
DevOps
skill iconKubernetes
+5 more
Your Responsibilities
  • We are looking for a Senior SRE with a proven track record of success leading complex cloud-hybrid environments. You will have:
  • Strong sense of Being an Owner, Wearing the Customer Shoes, with the ability to Empower Others demonstrated through clear
  • communication and collaboration.
  • Skills to work independently with multiple global teams, developing, configuring, deploying, and operating our global infrastructure on AWS and on-prem.
  • Operational experience in complex distributed and real-time systems, including experience with SLO/SLAs towards high availability,reliability and DR goals.
  • DevOps experience in building tools and frameworks, with an understanding of continuous deployment processes.
  • Ability to think at scale, bringing a focus on continuous delivery methodologies from design through deployment and operations.
  • Experience building and managing systems with tools including Kubernetes, Chef/Ansible/Puppet, Kafka, Docker, and Terraform.
Required Skill
  • 5+ years experience in a Software and/or Site Reliability Engineering role
  • Experience writing automation code in GoLang, Python or Java
  • Experience developing and operating large scale distributed systems with Kubernetes and Docker
  • Experience in running real time and low latency high available applications (Kafka, gRPC, RTP)
  • Experience running public cloud environments on AWS
  • Experience running hybrid clouds and on-prem infrastructures on Red Hat Enterprise Linux / CentOS
  • Bachelor degree in Engineering, Computer Science or equivalent experience
  • The ability to lead, partner, and collaborate cross functionally across an engineering organization
Read more
Nike
Remote only
5 - 10 yrs
₹20L - ₹30L / yr
Splunk
Site reliability
SRE
DevOps
skill iconAmazon Web Services (AWS)
+5 more
CORE - Site Reliability Engineer with Splunk
 
Within the Site Reliability Engineering our goal is to provide technical
solutions to complex production problems with a focus on reduction of
incident and problem toil, speeding detection and recovery of critical
incidents through observability and continuous improvement through
operational health measurement and sharing.
What You Will Work On
The following are a Site Reliability Engineer’s responsibility for this role but is
not limited to:
Drive reliability throughout the Engineering Organizations through
Observability, informed architectural improvements, and
automation.
Collaborate closely with Engineering teams to build cohesive
service operation solution into the overall service design.
Build and enhance the DevOps process, environment and tool
chains for high service reliability and availability.
Exercise and optimize the service operation process to support the
whole service with all partner teams. Mitigate and recover live site
incident efficiently.
Qualifications
Bachelor’s degree in Computer Science, Engineering, Math,
Science or another technical field
2+ years of working experience in IT industry in supporting large
scale applications/services on platforms like Azure/AWS/GCP.
3+ years of experience in software development automating
business processes using Java, Node or Python on Cloud platform
Experience in supporting high available and scalable systems with
ability to debug/troubleshoot live systems
Adaptive and flexible to manage multiple tasks with changing
priority
Hands on experience with Observability tools like Splunk,
NewRelic, Azure monitor or CloudWatch
2+ years of experience in Incident and problem management
process using tools like Service Now
Read more
SteelEye is a fast growing FinTech company based in London
SteelEye is a fast growing FinTech company based in London
Agency job
via Beiing by Divya R
Remote, Bengaluru (Bangalore)
3 - 8 yrs
₹15L - ₹30L / yr
skill iconPython
skill iconAmazon Web Services (AWS)
Ansible
Terraform
skill iconDocker
What you’ll do

• Develop and Maintain IAC using Terraform and Ansible
• Draft design documents that translate requirements into code.
• Deal with challenges associated with scale.
• Assume responsibilities from technical design through technical client support.
• Manage expectations with internal stakeholders and context-switch in a fast paced environment.
• Thrive in an environment that uses Elasticsearch extensively.
• Keep abreast of technology and contribute to the engineering strategy.
• Champion best development practices and provide mentorship.

What we’re looking for

• An AWS Certified Engineer with strong skills in
o Terraform
o Ansible
o *nix and shell scripting
• Preferably with experience in:
o Elasticsearch
o Circle CI
o CloudFormation
o Python
o Packer
o Docker
o Prometheus and Grafana
o Challenges of scale
o Production support
• Sharp analytical and problem-solving skills.
• Strong sense of ownership.
• Demonstrable desire to learn and grow.
• Excellent written and oral communication skills.
• Mature collaboration and mentoring abilities.
Read more
Why apply to jobs via Cutshort
people_solving_puzzle
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
people_verifying_people
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly. No 3rd party agencies here.
ai_chip
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
21,01,133
Matches delivered
37,12,187
Network size
15,000
Companies hiring
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Subodh Popalwar's profile image

Subodh Popalwar

Software Engineer, Memorres
For 2 years, I had trouble finding a company with good work culture and a role that will help me grow in my career. Soon after I started using Cutshort, I had access to information about the work culture, compensation and what each company was clearly offering.
Companies hiring on Cutshort
companies logos