Site Reliability Engineer/DevOps

at Digital B2B Platform

Agency job
icon
Bengaluru (Bangalore)
icon
3 - 4 yrs
icon
₹15L - ₹30L / yr
icon
Full time
Skills
DevOps
Python
CI/CD
Linux/Unix
Git
SQL
Amazon Web Services (AWS)
Ansible
MySQL
Kubernetes
Terraform
We are a digital B2B platform that offers loans, working capital, and payment services to small businesses.

Candidate MUST HAVE product-based company experience and a minimum of 3years of experience in DevOps.

What you will do (or learn) : 

1. Build our application stack on AWS. Infrastructure as code (read Terraform)
2. Build state-of-the-art CI/CD pipelines.
3. Manage data warehouses and data pipelines.
4. Work on infrastructure and data security.
5. State-of-the-art log management system and tooling around them.
6. Monitoring and alerting system.

What do we expect from you?
1. 3 to 10 years of experience with DevOps or SRE principles.
2. Good fundamentals of database management and other distributed systems management.
3. Experience in infrastructure as code or other configuration management systems.
4. Experience in scripting languages (like bash, python, go lang etc.)
5. Good understanding of Linux systems
6. Strong debugging and troubleshooting skills
7. Experience in tooling around monitoring, CI/CD, log management systems. 
Why apply to jobs via Cutshort
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly. No 3rd party agencies here.
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
2101133
Matches delivered
3712187
Network size
15000
Companies hiring

Similar jobs

Python
DevOps
Amazon Web Services (AWS)
Ansible
Terraform
Kubernetes
CI/CD
Git
Linux/Unix
icon
Bengaluru (Bangalore)
icon
4 - 8 yrs
icon
₹25L - ₹60L / yr
We are a digital B2B platform that offers loans, working capital, and payment services to small businesses.

Candidate MUST HAVE product-based company experience and a minimum of 3years of experience in DevOps.

What you will do (or learn) : 

1. Build our application stack on AWS. Infrastructure as code (read Terraform)
2. Build state-of-the-art CI/CD pipelines.
3. Manage data warehouses and data pipelines.
4. Work on infrastructure and data security.
5. State-of-the-art log management system and tooling around them.
6. Monitoring and alerting system.

What do we expect from you?
1. 3 to 10 years of experience with DevOps or SRE principles.
2. Good fundamentals of database management and other distributed systems management.
3. Experience in infrastructure as code or other configuration management systems.
4. Experience in scripting languages (like bash, python, go lang etc.)
5. Good understanding of Linux systems
6. Strong debugging and troubleshooting skills
7. Experience in tooling around monitoring, CI/CD, log management systems. 
Job posted by
Shalaka ZawarRathi
Windows Azure
Microsoft Windows Azure
DevOps
Terraform
Solution architecture
SQL Azure
Linux/Unix
Ansible
ARM TEMPLATES
DESIGN
icon
Bengaluru (Bangalore)
icon
5 - 8 yrs
icon
₹5L - ₹20L / yr

Senior Cloud Engineer / Jr. Cloud Solutions Architect

 

Roles and Responsibilities

  • Define, implement, deploy and maintain development, QA & production environments for cloud-based Azure architecture.

  • Create a strategy for establishing a secure and well-managed enterprise environment in Azure

  • Define and implement security architecture for production, ensure data security at all levels.

  • Provision Infrastructure as code using Azure CLI Powershell ARM templates and or Terraform with Ansible or other tools.

  • Develop scripts to automate the deployment of resource stacks and associated configurations

  • Extend MLP standard systems management processes into the cloud including change, incident, and problem management

  • Establish and implement monitoring and management infrastructure for both availability and performance management

  • Implement observability patterns using Azure Monitor Azure Application Insights and Log Analytics Workspace.

  • Provide internal training to the team.

 

Primary Skills/Requirements

  • 5+ years of experience in IT and infrastructure

  • 3+ years of experience in Azure design, support and management for a large-scale organization

  • Experience in design and implementation of high availability architecture.

  • Strong experience in Azure CLI Powershell and ARM Templates Terraform.

  • Strong understanding of IT Security and related audits

  • Experience with deploying applications on Linux - Ubuntu

  • Should know Azure offerings (Storage, OS instances, Availability zones, DR, Load balancers, VPN tunnel, Application Gateway, etc.)Cloud monitoring Experience with Azure Log Analytics Azure Monitor.

  • Experience with log collection tools and analysis, as well as infrastructure performance monitoring tools and optimization practices

  • Microsoft Azure Certification MCSE: Cloud Platform and Infrastructure or equivalent certification would be an added advantage

  • Experience with Postgres SQL Database

Behavioural

  • Positive work ethics

  • Ability to adapt to dynamic environment

  • Time Management

  • Team Player

  • Communication skills

  • Ability to work independently

Job posted by
Ashwini HC

Site Reliability Engineers

at Sarvaha Systems Private Limited

Founded 2011  •  Products & Services  •  20-100 employees  •  Profitable
Google Cloud Platform (GCP)
Amazon Web Services (AWS)
Microsoft Windows Azure
DevOps
Python
Kubernetes
Jenkins
Cassandra
Terraform
Windows Azure
Java
ELKI
SRE
Grafana
icon
Remote only
icon
3 - 5 yrs
icon
₹12L - ₹20L / yr

           JD: Site Reliability Engineers         

           Location: PUNE, Remote

     

Sarvaha would like to welcome experienced SRE specialists with minimum of 5 years of professional experience in Google Cloud Platform or AWS based deployments and automation. Sarvaha is a niche software development company that works with some of the best funded startups and established companies across the globe. Your will be expected to work with a globally distributed team and contribute independently as well as lead a team of engineers. This is a hands-on position that would require you to be responsible for production software deployments across global availability zones. 

 

Key Responsibilities

 

  • Design, write and run services that provide visibility into a leading IoT platform & underlying services
  • Automate deployments, diagnostic and debugging tools
  • Participate in on-call rotations
  • Adhere to industry-standard security best practices  
  • Work with other teams in troubleshooting and keeping the systems up and running

 

Skills Required

 

  • Minimum Bachelor’s Degree in Computer Science or related degree
  • Minimum 5+ years of total experience with at least 4 years of experience in SRE, DevOps or similar role. More experience in highly desired
  • 4+ years of hands-on experience with one of AWS/Azure/GCP is must have for this position
  • 1+ years of experience debugging code written in Python, Java or any strongly typed language
  • 3+ years of experience with Kubernetes, Prometheus, ELK, Grafana, Nagios
  • 2+ years of experience with Jenkins or similar build and deploy orchestration tool
  • 2+ years of experience with RDBMs and no-SQL databases (MySQL, Oracle, Cassandra, CDH)
  • 1+ years of experience writing infrastructure as code using Terraform
  • Excellent verbal and written communication and strong interpersonal skills are requisite for success of this position
  • Strong listening and interpersonal skills and attention to details is highly desired

 

Position Benefits

 

  • Top-notch remuneration with non-linear growth
  • Work with industry best cloud architects, DevOPs team and developers
  • Excellent, no-nonsense work environment with the very best people to work with
  • Cutting edge work with Fortune 500 businesses and learn from high-visibility systems that drive public facing, high-traffic systems
Job posted by
Santosh Maskar

Senior DevOps Engineer

at Biostrap

Founded 2016  •  Products & Services  •  20-100 employees  •  Bootstrapped
Amazon Web Services (AWS)
DevOps
Terraform
Kubernetes
Python
Go Programming (Golang)
Shell Scripting
Javascript
Docker
Ansible
System Administration
Elastic Search
Monitoring
Amazon RDS
MySQL
SQL
Prometheus
ELK
Grafana
icon
Remote only
icon
4 - 10 yrs
icon
₹12L - ₹30L / yr

Hey there!

 

Biostrap is based in Los Angeles, California with our team working remotely in several countries around the globe. This is a remote position, you’ll need a computer and a high speed internet connection.

 

We are looking for the tough kinds, the warrior ones, always learning  Sr. Devops Engineers to take care of our infrastructure and site reliability @ Biostrap. As an engineer at Biostrap, you will be a part of a lean but extremely passionate team of engineers and work towards making and keeping Biostrap as the go-to best health platform

 

Responsibilities: What would the job be like?

  • Work closely with the engineering team to deploy and maintain the infrastructure.
  • Add automation at every part of the development and deployment lifecycle.
  • Analyze and help in Infrastructure cost optimizations.
  • Build and work with CI + CD workflows..
  • Build robust observability system for system monitoring and tracing.
  • Architect scalable logging servers.
  • Add extensive alerting systems for various important issues, events using monitoring and logging services.
  • Work with other engineers in developing architecture that is scalable and resilient to changes in product requirements and usage in an agile environment.
  • Security Hardening of cloud infrastructure against known/unknown vulnerabilities
  • Write Infrastructure as Code for most of the cloud.
  • Suggest and implement pragmatic changes to infrastructure to increase performance, resilience and availability and to fool-proof infrastructure for future.
  • Build auditing systems for various resource accesses and have a breach detection notification system.
  • Do periodic security reviews and implement improvements.
  • Be incharge of and manage deployments of various services.
  • Work with aws resources, containers and systems like Ansible/EKS/kubernetes.

 

Qualifications: Who should apply for this role?

  • You have 3+ years of working in small to medium size teams building and shipping products.
  • Strong grasp of at least one of the scripting or systems languages like Python, Javascript, Golang etc.
  • Good experience managing various AWS resources.
  • Well equipped with Linux and Bash/Shell scripting
  • Working knowledge of Docker or container management.
  • Have some development experience with Kubernetes.
  • You spin out containers as if it's your fantasy war ground. 
  • Understand deployment tools like Ansible or similar.
  • Built and worked with CI+CD systems like Gitlab Ci, Jenkins, CircleCi, Travis etc.
  • Working knowledge of GIT for version control.
  • Experience with database management and security.
  • Experience with Terraform for Infrastructure as Code.
  • Knowledge of configuration management and secrets/keys management services like AWS KMS, Vault etc.
  • Required to be proficient in English (both speaking and writing).

 

 

Brownie Points for (:D):

  • You already use Biostrap and have plenty of feedback to provide.
  • You can lecture developers on scalable infrastructures.
  • You have built or worked with Prometheus, Grafana, ELK systems.
  • You have a story to tell about how you managed a failure or was part of a disaster recovery.
  • You contribute to Open Source projects or have a good Github/GitLab presence to showcase your past projects.
  • You have sent your code to Space and it runs “a” Rover on Mars. :P
Job posted by
Anirban Das

DevOps Engineer

at wwwsourcewizco

Founded 2020  •  Product  •  0-20 employees  •  Raised funding
Docker
Terraform
Amazon Web Services (AWS)
DevOps
icon
Bengaluru (Bangalore)
icon
1 - 5 yrs
icon
₹5L - ₹20L / yr
At Sourcewiz, we are building tools to help exporters grow their businesses. Our first product is a vertical sales software built for exporters, which allows them to market their unique creations to more buyers, generate more inquiries and increase their sales conversion.

Founded by a passionate team of serial entrepreneurs and alumni of IIT Delhi, U.C Berkeley, and well-known tech companies such as Uber and Zomato.

Sourcewiz is on a mission to increase India’s export GDP. This is a unique opportunity to
join a funded early-stage startup and have a massive impact on our product, culture, and
direction. It's a lot of work and a roller coaster ride. But, if you are up for it, you can join us
in replacing the tiresome and slow sales process for importers and exporters and have a
significant impact on our customers. We are not a company that believes engineers should be hidden away from decisions, churning out code for features decided from upon high. Instead, our Engineers form strong bonds with cross-functional peers in Product Management, Product Design and others to become experts in their product domain.

We’re looking for people with a strong interest in building successful products or systems;
are comfortable in dealing with lots of moving pieces; have exquisite attention to detail, and
comfortable learning new technologies and systems.

As a Site Reliability Engineer at Sourcewiz, you will...
• Own and improve the scalability and reliability of our products
• Working directly with product engineering team
• Work with RDBMS, Search, Caching and queuing
• Contribute expertise towards architectural planning and ensure the company builds
sustainable services that meet our customer expectations while leveraging appropriate
tools and frameworks.
• Ongoing participation in the review and testing
Job posted by
Saakshi Bhartiya
DevOps
Terraform
Ansible
CI/CD
Linux administration
Kubernetes
Amazon Web Services (AWS)
Puppet
Chef
Python
Java
Go Programming (Golang)
icon
Bengaluru (Bangalore)
icon
6 - 11 yrs
icon
₹20L - ₹38L / yr

 

Roles and Responsibilities

  • Managing Availability, Performance, Capacity of infrastructure and applications.
  • Building and implementing observability for applications health/performance/capacity.
  • Optimizing On-call rotations and processes.
  • Documenting “tribal” knowledge.
  • Managing Infra-platforms like Mesos/Kubernetes,CICD,Observability (Prometheus/New Relic/ELK),Cloud Platforms (AWS/ Azure),Databases,Data Platforms Infrastructure
  • Providing help in onboarding new services with production readiness review process.
  • Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
  • Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
  • Working with Dev team to have in depth understanding of the application architecture

          and its bottlenecks.

  • Identifying observability gaps in product services, infrastructure and working with stake

          owners to fix it.

  • Managing Outages and doing detailed RCA with developers and identifying ways to

          avoid that situation.

  • Managing/Automating upgrades of the infrastructure services.
  • Automate toil work.
  •  

Experience & Skills

  • 6+ years of total experience
  • Experience as an SRE/DevOps/Infrastructure Engineer on large scale microservices and infrastructure.
  • A collaborative spirit with the ability to work across disciplines to influence, learn, and

         deliver.

  • A deep understanding of computer science, software development, and networking principles.
  • Demonstrated experience with languages, such as Python, Java, Golang etc.
  • Extensive experience with Linux administration and good understanding the various

linux kernel subsystems (memory, storage, network etc).

  • Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
  • Expertise in GitOps, Infrastructure as a Code tools such as Terraform etc.. and
  • Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
  • Expertise of Amazon Web Services (AWS) and/or other relevant Cloud Infrastructure

solutions like Microsoft Azure or Google Cloud.

  • Experience in building CI/CD solutions with tools such as Jenkins, GitLab, Spinnaker,

Argo etc.

  • Experience in managing and deploying containerized environments using Docker,

Mesos/Kubernetes is a plus.

Job posted by
RAKESH RANJAN

Site Reliability Engineer

at Uniphore Software Systems

Founded 2008  •  Product  •  500-1000 employees  •  Raised funding
SRE
Site Reliability Engineer
Reliability engineering
DevOps
Kubernetes
Terraform
Linux/Unix
Amazon Web Services (AWS)
Java
Python
icon
Bengaluru (Bangalore)
icon
5 - 10 yrs
icon
₹25L - ₹40L / yr
Your Responsibilities
  • We are looking for a Senior SRE with a proven track record of success leading complex cloud-hybrid environments. You will have:
  • Strong sense of Being an Owner, Wearing the Customer Shoes, with the ability to Empower Others demonstrated through clear
  • communication and collaboration.
  • Skills to work independently with multiple global teams, developing, configuring, deploying, and operating our global infrastructure on AWS and on-prem.
  • Operational experience in complex distributed and real-time systems, including experience with SLO/SLAs towards high availability,reliability and DR goals.
  • DevOps experience in building tools and frameworks, with an understanding of continuous deployment processes.
  • Ability to think at scale, bringing a focus on continuous delivery methodologies from design through deployment and operations.
  • Experience building and managing systems with tools including Kubernetes, Chef/Ansible/Puppet, Kafka, Docker, and Terraform.
Required Skill
  • 5+ years experience in a Software and/or Site Reliability Engineering role
  • Experience writing automation code in GoLang, Python or Java
  • Experience developing and operating large scale distributed systems with Kubernetes and Docker
  • Experience in running real time and low latency high available applications (Kafka, gRPC, RTP)
  • Experience running public cloud environments on AWS
  • Experience running hybrid clouds and on-prem infrastructures on Red Hat Enterprise Linux / CentOS
  • Bachelor degree in Engineering, Computer Science or equivalent experience
  • The ability to lead, partner, and collaborate cross functionally across an engineering organization
Job posted by
Sandesh HS

Site Reliability Engineer

at SteelEye is a fast growing FinTech company based in London

Agency job
via Beiing
Python
Amazon Web Services (AWS)
Ansible
Terraform
Docker
icon
Remote, Bengaluru (Bangalore)
icon
3 - 8 yrs
icon
₹15L - ₹30L / yr
What you’ll do

• Develop and Maintain IAC using Terraform and Ansible
• Draft design documents that translate requirements into code.
• Deal with challenges associated with scale.
• Assume responsibilities from technical design through technical client support.
• Manage expectations with internal stakeholders and context-switch in a fast paced environment.
• Thrive in an environment that uses Elasticsearch extensively.
• Keep abreast of technology and contribute to the engineering strategy.
• Champion best development practices and provide mentorship.

What we’re looking for

• An AWS Certified Engineer with strong skills in
o Terraform
o Ansible
o *nix and shell scripting
• Preferably with experience in:
o Elasticsearch
o Circle CI
o CloudFormation
o Python
o Packer
o Docker
o Prometheus and Grafana
o Challenges of scale
o Production support
• Sharp analytical and problem-solving skills.
• Strong sense of ownership.
• Demonstrable desire to learn and grow.
• Excellent written and oral communication skills.
• Mature collaboration and mentoring abilities.
Job posted by
Divya R
site reliability
cloudformation
Terraform
Ansible
Cloud Automation
Software Development
AWS CloudFormation
Algorithms
Data Structures
Python
Powershell
DynamoDB
MySQL
icon
Hyderabad
icon
5 - 11 yrs
icon
₹10L - ₹20L / yr
  • 5+ years of software development or site reliability engineering or equivalent experience
  • Skilled at problem solving, algorithms, and data structures
  • Building tools and scripting frameworks from scratch
  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli
  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc.
  • Configuration automation using Ansible or equivalent tools
  • Exposure to Windows, Linux administration skills
  • Project management tools like Jira, Trello
  • Prior experience in dealing with Datastore technologies like Postgres, MySQL, SQL, DynamoDB is desirable
  • Familiarity with basic networking, security and cloud engineering concepts
  • Team player who is eager to help others to succeed through mentoring and leading by example
  • Highly collaborative with effective written and verbal communication skills
Job posted by
Pradeep Kumar Burra

Site Reliability Engineer

at Shuttl

Founded 2015  •  Product  •  100-500 employees  •  Raised funding
Terraform
Kubernetes
Ansible
icon
NCR (Delhi | Gurgaon | Noida)
icon
3 - 6 yrs
icon
₹10L - ₹21L / yr
WHAT WILL I DO? You will work as a Site Reliability Engineer responsible for the availability, performance, monitoring, and incident response, among other things, of the platforms and services used and owned by Shuttl. The SRE Team works alongside the Engineering team and owns every aspect of service availability as well as disaster recovery and business continuity plans. You will work with other Site Reliability Engineers and report to the Lead of Site Reliability Engineering Team. HOW DO WE WORK? Our engineering process is a five step process which consists of phases for planning, developing, testing & profiling, releasing and monitoring. The planning phase consists of documenting of the feature/task to be done followed by various discussions. These discussions cover product, delivery estimates, release plan, monitoring plan, test plans, architecture, code design, technology choices and best practice adoption. The development and testing phase coexist and involve writing code, unit tests, performance tests, profiling, stress testing, code reviews and QA testing. This phase is punctuated with daily scrums and standups. The release phase is largely about managing and communicating the release to customers and internal stakeholders and activating features. The last phase is the monitoring phase where relevant metrics and exceptions are tracked and any critical refinement for the delivered feature is undertaken. This phase culminates with a retrospective. SREs get involved in this process as early as possible to provide general guidance, recommendations and help with designing the application to be in compliance with community standards such as CNCF and 12 Factor. SRE involvement and influence tends to increase during mid to final stages of development where the application is primed for beta evaluation and all the tooling and instrumentation is finalized. WHAT SKILLS SHOULD I HAVE? For this role we expect you to have 3+ years of experience working as a DevOps Engineer or SRE. You should have a good grasp of Unix like systems, access control, networking nuances, process isolation by the means of kernel provided features, distributed applications and algorithms, job schedulers and secret management among other things. At Shuttl we are a big proponent of Immutable infrastructure. All our infrastructure is hosted with Amazon Web Services and we use Hashicorp's Terraform to manage the infrastructure as code. A good handle on AWS and Terraform is therefore a definitive plus. Since SREs are expected to write a lot of code, you are also expected to be skillful in a programming language, preferably Python or Go.
Job posted by
Tanika Monga
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
Get to hear about interesting companies hiring right now
iconFollow Cutshort
Want to apply to this role at Digital B2B Platform?
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Learn more
Get to hear about interesting companies hiring right now
iconFollow Cutshort