Job Description Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Rapido's services—both our internally critical and our externally-visible systems—have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance. SRE is also a mindset and a set of engineering approaches to running better production systems—we build our own creative engineering solutions to operations problems. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. As SREs are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to solve a broad spectrum of problems. Practices such as limiting time spent on operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement that is key to both product quality and interesting and dynamic day-to-day work. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow. # **Responsibilities** * Engage in and improve the whole lifecycle of services rom inception and design, through deployment, operation and refinement. * Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. * Maintain services once they are live by measuring and monitoring availability, latency and overall system health. * Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. * Practice sustainable incident response and blameless postmortems. # **Qualifications** ## Minimum qualifications: * Bachelor's degree in Computer Science or equivalent practical experience. * Experience in one or more of the following: C, C++, Java, Python, Go, Perl, Ruby or shell scripting. * Experience with Unix/Linux operating systems internals and administration (e.g., filesystems, inodes, system calls) or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN). * 1 Year of Experience with working with Kubernetes or Dockers on Production. * 3 Years of Minimum experience in Devops (AND or OR Software development) ## Preferred qualifications: * Expertise in designing, analyzing and troubleshooting large-scale distributed systems. * Ability to debug and optimize code and automate routine tasks. * Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
About Company: Tracxn is a Bangalore based product company providing a research and deal sourcing platform for Venture Capital, Private Equity, CorpDevs & professionals working around the startup ecosystem. We are a team of 600+ working professionals serving customers across the globe. Our clients include Funds like Andreessen Horowitz, Sequoia Capital, Accel Partners, NEA; and Large Corporates such as ING, Societe Generale, LG and Royal Bank of Canada. We are backed by prominent investors like Ratan Tata, Nandan Nilekani, and SAIF Partners Founders: - Neha Singh (ex-Sequoia, BCG | MBA - Stanford GSB) - Abhishek Goyal (ex-Accel Partners, Amazon | BTech - IIT Kanpur) Roles and Responsibilities: - Design, Develop and Deliver products and frameworks (like Queuing System, Schedulers, etc) that will be used across all the Engineering teams in Tracxn. - Evaluating, Benchmarking and rolling out platform components like API Gateway, Load Balancers etc. - Driving centralized solutions like Service Discovery, Rate limiting etc for teams across Tracxn. - Extend or develop frameworks on top of docker to solve Tracxn's needs for scaling. - Working with Application Development teams to refactor the apps or build new modules to help onboard new architectures. Skills and Experience: - Must have experience in building fault-tolerant and scalable infrastructure. - Must have good conceptual, architectural & design skills. - Must have experience in at least one of the programming languages such as Java, C#, Python, Shell Script, Bash Script. - Must have experience in any one of the following databases like RDBMS, NoSQL databases. - Must have a deep understanding of how the software works at the systems level, familiarity with low-level aspects of performance, multi-threading, performance analysis, and optimization. - Good to have experience in working with Cloud Platforms like AWS, Google Cloud etc. - Good to have experience in Docker, Kubernetes, Ansible, Chef, Puppet. - Good to have experience in frameworks such as Kafka, HAproxy, Nginx. - Team management experience will be an added bonus. What we have to offer? - Work with a performance oriented team driven by ownership. - Learn to design system for high accuracy, efficiency, and scalability. - Focus on delivering quality work rather than deadlines. - Meritocracy driven, candid culture. - Very high visibility regarding startups ecosystem. Above all, you love to build and ship products that Customers will use every day.
We, Dotball, are developing a comprehensive fan engagement platform for cricket. We launched our fantasy gaming platform @ dotball.com . Cricket is the common language spoken by its 1 billion fans and we’re unifying all of them. We’re on a hiring spree for designers, developers and marketeers. We’re working out of a kick-ass, hi-tech apartment in central Bangalore, off Cunningham Road. All our designers and developers are provided with a Mac Book Pro MPXQ2HN/A. If this isn’t enough to catch your eye, we provide accommodation for a month *, so that you can peacefully do house hunting. And of course, lunch is on us. If working in style is your cup of tea, come join us. Skills required: Knowledge of Node.js and frameworks such as Express, StrongLoop, etc Understanding the nature of asynchronous programming and its quirks and workarounds Good understanding of server-side templating languages like Jade, EJS, etc Experience with micro-services or highly scalable infrastructure Experience working with Docker, Redis, MySQL, MongoDB Responsibilities: Integration of user-facing elements developed by front-end developers with server-side logic Building reusable code and libraries for future use Optimization of the application for maximum speed and scalability Implementation of security and data protection Design and implementation of data storage solutions Test coverage for the written code Troubleshoot and debug applications Job Perks Mac Book Pro MPXQ2HN/A will be provided for work purposes. Flexible work-hour policy Lunch is provided on all working days Short-term accommodation is offered for candidates re-locating to Bangalore.
We're looking for Senior Backend Engineer (2+ years experience) for our company - Spotmentor Technologies. Right now our Technology team has 4 members and this role is for early team member and carries significant ESOPs with it. We need someone who can lead the data science function with both vision and hands-on work and is excited to use this area to develop B2B products for enterprise productivity.RESPONSIBILITIES----------------------- • Collaborate with cross-functional team members to develop software libraries, tools, and methodologies as critical components of our computation platforms. • Use independent judgment to take existing code, understand its function, and change/enhance as needed. • Work as a team leader rather than a member.REQUIREMENTS--------------------• Proficient in Python with sound knowledge in Django/Flask.• Experience in building modular and efficient applications which can run on the scale.• Proficient in writing database queries (NoSQL preferably).• Basic knowledge of working with containers (Docker).• Follow best practices while writing code - PEP8, TDD, SOA etc.• Full understanding of VCS (mainly GIT).• Strong problem-solving skills and analytical thinking.
We are looking for a smart DevOps Engineer for one of our client project who: 1. Needs to be conversant with CI / CD 2. Needs to have ability to code and script (python, shell) 3. Needs to be conversant with open source technologies related to CI /CD < Jenkins, Docker, Docker Compose, Bash, Unix System Commands, Unix shell commands, Cloud Provisioning software like Kubernates and virtual containers) 4. Needs to be conversant with automation tools towards deployment verifications 5. Strong data and asset management 6. Needs to have a strong focus on business outcomes 7. Should be self-driven and creative 8. Needs to collaborate between various functions (Dev, QA, Infra) within the product ecosystem
Site Reliability EngineerJob Description :You will be administering the infrastructure of an indigenous one-of-a-kind artificial intelligence cloud platform. You will be working with the dev teams to deploy, monitor and scale the distributed platform to handle real time AI analysis and loads and loads of visual data (images and videos in various formats). We're looking for people with extensive dev-ops experience and strong system programming skills.Responsibilities :1. You will be responsible for the up time and reliability of infrastructure of SigTuple and help backend teams achieve it by writing reliable software and automation2. Work with other development teams to automate deployment of modules and manage the build and release pipeline.3. Extensive process-level and node-level monitoring and auto healing of entire cluster.4. Managing, provisioning and servicing cloud servers.5. Contribution to back-end services to contribute to its infrastructure system design.Requirements :1. BTech/MTech in any engineering discipline.2. 3-6 years of experience in an Dev-Ops/Software Engineering role.3. Experience in management of cloud computing services. Extensive knowledge of any one cloud platform (Kubernetes, AWS, GCP, Azure etc.)4. Proficiency with any major monitoring framework (Sensu, Nagios etc.).5. Comfortable with any one scripting language (Python, Perl) and a Configuration management or Orchestration Tool (Ansible, Chef etc)6. Proficiency with OS and network fundamentals and strong Linux administrator skills.7. Experience with Container Tools (Docker ecosystem) will be a plus8. Experience of working with issues of scale of a system.9. Experience of working in a startup is a plus.
Required Skills: Strong experience in AWS / Google Cloud. Strong development experience in Perl, Python, Docker, and Postgres. Strong experience in build/release management. Working experience on Linux. Excellent knowledge of shell scripts. Knowledge of Virtualization Platforms VMware. Working experience on Configuration Management tools. Working experience on Test and Build Systems Jenkins/Maven Should have strong communication skills, a passion to learn, and an ability to work well with people at all levels of an organization. Roles and Responsibilities: Create Deployment Unit consisting build, documents and installation artifacts. Preparing Delivery definition / Release Note / Production turn-over Note documents. Establish DevOps Policies. Communicate with developers, product managers and technical support specialists on product issues. Assist in Creating and maintaining Configuration and Change Management Plan for the project. Choosing suitable DevOps tools. Setting up Configuration Management Environment. Assist in routine back-up and archival of project repository.
Building Mobile & Saas products for Skill Training institutes focused at the bottom of the pyramid and connecting them to jobs. Looking for someone who is passionate and driven, and wants make a difference solving hard problems.
Will be part of product development team and will be supporting the team with design and implementation of several features for distributed docker deployments. - 2+ Years of experience in using, deploying Docker, LXC or Rkt containers. - Programming experience in C, C++, GoLang. Engineers with other programming language experience with willingness to learn GoLang are also welcome. - Experience in the following areas is added advantage REST interface implementation, OAUTH, ProtoBuff, RabbitMQ etc.
Build, deploy and release automation engineer. Will be part of product development team and will be supporting the team with formulating different workflows for build, deploy and release. Product uses new age technologies such as GoLang, AngularJS, Docker etc. - 5 years of experience managing SCM & build environments. - Experience with GIT tooling and scripting. - Programming/scripting experience in Python, Bash etc - Branching, Versioning management and process automation. - Experience with Docker image management as part of product build and release. - Deployment on Cloud, on virtual machines. - Experience with service deployment on Kubernetes, Mesos etc will be added advantage.