Jobs at Toast
The recruiter has not been active on this job recently. You may apply but please expect a delayed response.
Now, more than ever, the Toast team is committed to our customers. We’re taking steps to help restaurants navigate these unprecedented times with technology, resources, and community. Our focus is on building a restaurant platform that helps restaurants adapt, take control, and get back to what they do best: building the businesses they love. And because our technology is purpose-built for restaurants by restaurant people, restaurants can trust that we’ll deliver on their needs for today while investing in experiences that will power their restaurant of the future.
At Toast, our Site Reliability Engineers (SREs) are responsible for keeping all customer-facing services and other Toast production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople who apply sound software engineering principles, operational discipline, and mature automation to our environments and our codebase. Our decisions are based on instrumentation and continuous observability, as well as predictions and capacity planning.
About this roll* (Responsibilities)
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through automation and uplift
- Balance feature development speed and reliability with well-defined service level objectives
Troubleshooting and Supporting Escalations:
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Diagnose performance bottlenecks and implement optimizations across infrastructure, databases, web, and mobile applications
- Implement strategies to increase system reliability and performance through on-call rotation and process optimization
- Perform and run blameless RCAs on incidents and outages aggressively, looking for answers that will prevent the incident from ever happening again
Do you have the right ingredients? (Requirements)
- Extensive industry experience with at least 7+ years in SRE and/or DevOps roles
- Polyglot technologist/generalist with a thirst for learning
- Deep understanding of cloud and microservice architecture and the JVM
- Experience with tools such as APM, Terraform, Ansible, GitHub, Jenkins, and Docker
- Experience developing software or software projects in at least four languages, ideally including two of Go, Python, and Java
- Experience with cloud computing technologies ( AWS cloud provider preferred)
Bread puns are encouraged but not required
Similar companies
shoptap
About the company
Jobs
1
LimeTray
About the company
Jobs
6
Posist Technologies
About the company
Jobs
1
Tosh Innovations
About the company
Jobs
1
Lasper Technologies Pvt Ltd
About the company
We are an innovative startup empowering restaurants to excel in the online food delivery business. Our platform aggregates all online food delivery orders into a single consolidated dashboard, improving the efficiency of order management and aiding restaurant growth. With a strong presence in the Indian market and partnerships with major delivery services like Zomato, Swiggy, Dunzo, Thrive, and magicpin, we are making a significant impact in the restaurant industry. Backed by flat6labs Bahrain, our mission is to revolutionize how restaurants conduct their online food delivery business.
For more details visit our website www.foaps.co
Jobs
1
Touch2success
About the company
Jobs
1
Tosall
About the company
Jobs
4
Toobr
About the company
Jobs
0
Turgajo Technologies
About the company
Jobs
1