About the team Site Reliability Engineering team in Media.Net is responsible for managing scaling, performance, monitoring, security, availability of the production environment. The focus is to architect, develop, automate and deploy products and infrastructure based on Linux and Linux application stacks. Our environment consists of our own BareMetal and private cloud across co-located datacenter facility and the AWS public cloud. Our engineering teams follow DevOps practices and we rely heavily on open source tools like Jenkins, Selenium, Git, Puppet, Docker, Kubernetes, Open stack, Nagios/Icinga, Kafka, Graphite, Hadoop, Graphite, ELK, Vault etc. We use Python and Go majorly in SRE teams. What is the job like? Engage with product and engineering team to design, build and maintain the system / software for high availability proactively and drive operation best practices Identify and drive opportunities in making resilient systems that help maintain business continuity Proactively perform troubleshooting, RCA and implement permanent resolution of issues across the stacks – hardware, software, database, network and so on Implementation of proactive monitoring, alerting, trend analysis and self-healing systems Develop continuous delivery for multiple platforms in production and staging environments Find areas of existing manual intervention, and replace with automation wherever possible Demonstrate ability to design, implement and manage highly available, scalable and reliable systems Infrastructure and platform security Effectively use and maintain Infrastructure and config management tools like puppet, chef, ansible, terraform to deploy and manage infrastructure Demonstrate technical mentoring and coaching to team members Adaptable to work in a fast-paced environment and alter priorities as per business needs
Work on the toughest problems in stock markets. Lead our efforts to reach 99.999% availability for 25+ services across 200+ servers. Summary We are building the fastest, most reliable & intelligent trading platform. That requires highly available, scalable & performant systems. And you will be playing one of the most crucial roles in making this happen. You will be leading our efforts in designing, automating, deploying, scaling and monitoring all our core products. Tech Facts so Far 1. 8+ services deployed on 50+ servers2. 35K+ concurrent users on average 3. 1M+ algorithms run every min 4. 100M+ messages/min We are a 4-member backend team with 1 Devops Engineer. Yes! this is all done by this incredible lean team. Big Challenges for You 1. Manage 25+ services on 200+ servers2. Achieve 99.999% (5 Nines) availability 3. Make 1-minute automated deployments possible If you like to work on extreme scale, complexity & availability, then you will love it here. Who are we We are on a mission to help retail traders prosper in the stock market. In just 3 years, we have the 3rd most popular app for the stock markets in India. And we are aiming to be the de-facto trading app in the next 2 years. We are a young, lean team of ordinary people that is building exceptional products, that solve real problems. We love to innovate, thrill customers and work with brilliant & humble humans. Key Objectives for You • Spearhead system & network architecture • CI, CD & Automated Deployments• Achieve 99.999% availability • Ensure in-depth & real-time monitoring, alerting & analytics• Enable faster root cause analysis with improved visibility• Ensure a high level of security Possible Growth Paths for You • Be our Lead DevOps Engineer • Be a Performance & Security Expert Perks • Challenges that will push you beyond your limits • A democratic place where everyone is heard & aware
Important to have : 1. Linux experience 2. Nginx or any webtier 3. Any scripting like terraform or ansible 4. Jenkins pipeline 5. Ci/CD 6. Containers and dockers
Looking for someone who breathes and speaks infrastructure and automation. We want to setup an internal PaaS which achieves single click deployments, 100 percent uptime, full application monitoring and ability to do 100 Deployments per day if needed and finally should be cloud agnostic. This is the opportunity and someone who can do this under 6 months is our guy. About Chalo: Chalo is a free app that tracks buses live and tells you what time your bus will reach your stop. Now you never have to wait at a bus stop ever again. Transport is fast becoming a fundamental need - like water or air. More people are travelling more and more every day. Yet, only around 15% of Indians can afford their own car, private taxi or a two-wheeler. The remaining 85% depend on public transport. Moreover, with growing traffic and congestion it’s becoming increasingly clear that private transport is causing more problems that it is solving; and a small but significant (and growing) segment are outright rejecting car ownership, even if they can afford it. At Chalo, our core purpose is to make travel better for everyone, and we believe that our cities, our health and our lives will be better when we improve the way we travel. We have begun our journey by focusing on bus travel - making it easier for those that rely on it, and also for those that are choosing it. Buses, as a mode of transport, form the core architecture of any city's transport system. They also offer the largest improvement areas and the largest opportunity to create an impact, with 2 out of 3 public transport users depending on buses for their travel. We welcome a conversation with anyone who shares our vision and values, and especially those who wish to offer a counter-point :-) Chalo!