- Managing Availability, Performance, Capacity of infrastructure and applications.
- Building and implementing observability for applications health/performance/capacity.
- Optimizing On-call rotations and processes.
- Documenting “tribal” knowledge.
- Managing Infra-platforms like Mesos/Kubernetes, CICD, Observability (Prometheus/New
Relic/ELK, Databases, Data Platforms Infrastructure
- Providing help in onboarding new services with a production readiness review process.
- Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
- Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
- Working with the Dev team to have in depth understanding of the application architecture and
its bottlenecks.
- Identifying observability gaps in product services, infrastructure and working with stake owners
to fix it.
- Managing Outages and doing detailed RCA with developers and identifying ways to avoid that
situation.
- Managing/Automating upgrades of the infrastructure services.
- Automate toil work.
- A collaborative spirit with the ability to work across disciplines to influence, learn, and deliver.
About Its a big product based company
Similar jobs
CoinFantasy is looking for a tech enthusiast working primarily on blockchain technology to be part of the core blockchain team at CoinFantasy. You would be a part of the Roadmap team that is working on the architecture, design, development, and deployment of our decentralised platform.
Your primary responsibilities would be analysing requirements, designing blockchain technology around a certain business model, and writing smart contracts.
Job Responsibilities
- Administer our blockchain, database, and DevOps infrastructure.
- Cross team collaboration to coordinate safe, efficient releases.
- Build complex pipelines for
- Databases, Messaging, Storage, Compute in AWS.
- Build deployment pipeline with Github CI (Actions).
- Build tools to reduce occurrences of errors and improve our protocols.
- Develop software to integrate with internal back-end systems.
- Perform root cause analysis for production errors.
- Investigate and resolve technical issues.
- Design procedures for system troubleshooting and maintenance.
Requirements
- 8+ years of Experience working with DevOps, Infrastructure, Site Reliability or Cloud Engineering
- Understanding the entire tech stack of Blockchain Dapps
- Strong experience working with any configuration management tools
- Languages: Any modern programming language
- Experience working with some of the major public clouds. e.g. AWS, Azure
- Competent with the “basics”: E.g. Computer Networking
- Self-motivated individual with enthusiasm for learning and building things
- Collaborative, communicative, and confident in their abilities to work well with all team members at all seniority and skill levels
- Hands-on experience with Rust/Substrate and Contribution to open-source blockchain projects is an added advantage
About Us
CoinFantasy is a Play to Invest platform that brings the world of investment to users through engaging games. With multiple categories of games, it aims to make investing fun, intuitive, and enjoyable for users.
It features a sandbox environment in which users are exposed to the end-to-end investment journey without risking financial losses.
Website: https://www.coinfantasy.io/
Benefits
- Competitive Salary
- An opportunity to be part of the Core team in a fast-growing company
- A fulfilling, challenging and flexible work experience
- Practically unlimited professional and career growth opportunities