
Position – SRE Engineer
Location – Navi Mumbai
Who are we
Based out of IIT Bombay, HaystackAnalytics is a HealthTech company creating clinical genomics products, which enable diagnostic labs and hospitals to offer accurate and personalized diagnostics. Supported by India's most respected science agencies (DST, BIRAC, DBT), we created and launched a portfolio of products to offer genomics in infectious diseases. Our genomics-based diagnostic solution for Tuberculosis was recognized as one of the top innovations supported by BIRAC in the past 10 years, and was launched by the Prime Minister of India in the BIRAC Showcase event in Delhi, 2022.
About the Role:
We are seeking a highly skilled Site Reliability Engineer (SRE) to join our infrastructure team. The ideal candidate will bring deep expertise in Linux systems, cloud infrastructure automation, and network administration, with a strong focus on reliability, scalability, and performance. ensuring firewall/network security compliance.
Key Responsibilities:
- Develop and maintain automation scripts (Bash, Python, etc.) for system and cloud infrastructure operations.
- Manage, monitor, and troubleshoot Linux servers in production environments
- Familiarity with Network tools in Linus to triage network connectivity and performance issues .
- Configure, maintain, and secure network infrastructure including switches, routers, and firewalls.
- Design and execute Network Continuity Plans (NCP) and disaster recovery strategies.
- Collaborate with cross functional team for triaging and resolving site specific issues in production related to server deployment , SOP adherence , Triage and defect resolution
- Collaborate with developers and DevOps teams to define SLAs, SLOs, and error budgets.
- Implement proactive monitoring and alerting using modern observability tools.
- Participate in on-call rotations and incident response processes.
Objectives of this Role:
- Build, automate, and manage scalable infrastructure and deployment pipelines to support development and production environments.
- Enable rapid, secure, and reliable software delivery across engineering teams.
- Ensure system availability, performance, and security in cloud and containerized environments.
- Implement and enforce best practices in CI/CD, observability, and incident response.
- Collaborate with cross-functional teams including software developers, QA, and product managers.
- Proactively identify and automate manual processes to improve engineering efficiency.
- 3–7 years of experience as an SRE, DevOps, or Systems/Network Engineer.
- Strong scripting skills (e.g., Bash, Python, or Go).
- Proficiency in Linux administration and performance tuning.
- Deep understanding of TCP/IP, routing, DNS, firewalls, and networking protocols.
- Hands-on experience managing network infrastructure and firewall rules (e.g., iptables, pfSense, Palo Alto, etc.).
- Docker (multi-stage builds, docker-compose)
Soft Skills
- Proactive problem-solving and ownership mindset
- Strong documentation and communication skills
- Ability to mentor junior engineers
- Curiosity to learn and experiment with emerging DevOps tools
Preferred Qualifications:
- Certifications such as CKA/CKAD, RHCE, AWS Certified SysOps Administrator, or Cisco CCNA/CCNP.
- Experience with service meshes (e.g., Istio), observability stacks (e.g., Prometheus, Grafana), or network policy management (e.g., Calico, Cilium).
- Exposure to secure DevOps practices and compliance standards (e.g., ISO, NIST).
To know more about us – https://haystackanalytics.in

Similar jobs
VLAN, STP and many other switchings configurations.
taking ownership of the technical support for assigned analytical and ability to work under pressure, future job scope involves implementation, troubleshooter in routing switching domain ( L2 and L3)


We are looking networking professionals with the following skill set,
Experience :6+ years of experience in the networking domain
Key skills:
- Must have 6+ years of experience in C/C++ programming language.
- Knowledge of Go programming language and Python programming language is a big plus.
- Strong background in L4-L7 Internet Protocols TCP, HTTP, HTTP2, GRPC and HTTPS/SSL/TLS.
- Background in Internet security related products such as Web Application Firewalls, API Security Gateways, Reverse Proxies and Forward Proxies
- Proven knowledge of Linux kernel internals (process scheduler, memory management, etc.)
- Experience with eBPF is a plus.
- Hands-on experience in cloud architectures (SaaS, PaaS, IaaS, distributed systems) with continuous delivery
- Familiar with containerization solutions like Docker/Kubernetes etc.
- Familiar with server less technologies such as AWS Lambda.
- Exposure to machine learning technologies and distributed systems is a plus
Purpose of the Role
Responsible for 100% uptime of Network with efficient and timely first support on a 24x7 basis to meet Business Goals; providing the highest level of Technology Support, Introducing and maintaining the state of the art Technology in line with the Business Strategy.
Desired Skills and Competencies
- Ability to Influence
- Customer Sensitivity
- Execution Excellence
- Functional Expertise
- Situation Handling
- Team Management
- Working Together
Role Description:
- Work Experience: Should have work experience in Telco / ISP service Provider background
- Should have experience in Planning and Operations of Broadband network.
- Should Lead a Team and Manage Day to Day NOC Operation.
- Manage Outages in the Network and Provide resolution within SLA; Handle Customer Escalation with Minimal Turnaround time.
- Escalating Issues and Co-ordinating with vendor team on Bug fixes and Resolutions.
- Maintain the Uptime of the network at 100%.
- Should Monitor the Network and planning maintenance activities timely.
- Should work on building redundancy and elimination of Point of Failures.
- Should work on Bandwidth Management and Traffic Load Balancing Mechanism.
- Provide backend support to cross functional teams for new provisioning and Fault Repair.
- Should be available 24*7 for support for responding to emergency situations and planned activities.
- Should be Responsible for Inventory Management.
- Manage and Train NOC team, perform evaluations and hiring, and handle disciplinary responsibilities.
- Prepare and Maintain SOPs of critical Process; perform Process Correction on requirement basis.
- Should analyse fault trends and plan preventive actions.
- Should be Responsible for publishing of daily, weekly, monthly reports and statistics reports.
Technical Requirements:
- Should have Good Knowledge of Switching and Routing.
- Hands-on experience of BGP, MPLS, MPLS L2VPN, MPLS L3VPN, ISIS, OSPF, VLAN, DHCP, PPPoE, STP, LAN, WAN, NAT etc
- Should have worked in Layer-2 and Layer-3 Switches, Edge Routers; preferably on Cisco and Huawei Devices.
- Should have worked on GPON Technology and have understanding of Operation and Troubleshooting skills, hands on experience on installation and implementation of OLTs and ONTs.
- Should have worked on NMS Tools and have understanding of SNMP Protocol.
- Should have understanding of BNG workings principles, RADIUS and Diameter protocols.
- Should have Good understanding of Fiber Technology.
- Should have understanding of Wireless Technologies such as 802.1b/g/n, 802.11ac, should be able to configure WIFI routers and debug issues.
- Must have done BE/B.Tech or equivalent in Computer Engineering/Electronics & Communication Engineering/Electronics & Electrical Engineering
- CCNA/CCNP Certification is Preferred.
Expert in configuration, deployment and management of Switches, Routers (Cisco, Arista, Juniper) | ||||||||||||
Design, implement and support multi-site colocation DC Network architecture, ensuring that it meets the business requirements and performance goals | ||||||||||||
Prior experience in designing and maintaining LAN/WAN/MPLS/ISP networks | ||||||||||||
Provide technical assistance with the design, installation, operation, and maintenance of DC Network & Remote/Branch Networks | ||||||||||||
Working with infrastructure teams to maintain standards for critical infrastructure, across Server and Network Security/networking devices | ||||||||||||
Implement, manage, monitor, and upgrade security systems for the protection of the organizations data, systems, and networks | ||||||||||||
Ensure that the organization's data and infrastructure are protected by enabling the appropriate security controls | ||||||||||||
Analyze security systems and seek improvements on a continuous basis | ||||||||||||
Responding to all system and/or network security breaches. | ||||||||||||
Participating in Incident and Change management process. | ||||||||||||
Daily administrative tasks, reporting and communication with the relevant departments in the organization. | ||||||||||||
Acting as a SPOC for all network related issues. | ||||||||||||
Respond to incidents raised by Ops/Support Team | ||||||||||||
Participate at 24/7 rotation for troubleshooting Critical incidents related to Network and Security | ||||||||||||
DataCenter Network Up time tracking | ||||||||||||
Cloud connectivity check as well as site to site tunnels creation to access resources directly from office network. | ||||||||||||
Networking devices LifeCycle Management Key Skillset required:
|
Roles & Responsibilities:
We are looking for a Senior Network Engineer to be responsible for the development and support of a complex networking infrastructure critical to the operation of the Company. This position will develop technical solutions to complex problems requiring the regular use of ingenuity and creativity.
- Maintain, test, and monitor complex local area and wide area network designs encompassing multiple technologies using established procedures and tools.
- Diagnose network issues and provide problem resolution in a timely manner.
- Develop and maintain all network component and system configurations.
- Design, install, and support networking systems and operations.
- Troubleshoot network performance issues.
- Analyse system errors or failures and work to resolve issues. Escalate as necessary to maximize system availability.
- Provide technical support to peers and guidance to users.
- Evaluate new network technologies and make recommendations regarding the integration of relevant technologies into the existing network. Recommend upgrades, patches, and new applications or equipment.
- Maintain knowledge of emerging technologies and enhanced applications.
- Create and maintain network systems documentation and asset management information.
Qualifications:
- 9+ years of networking systems experience is required.
- Experience with Cisco products and support services and procedures is essential; Cisco certifications are preferred, but not required.
- Expert level experience with Cisco Nexus Switching equipment required.
- Expert level experience with Cisco /FMC/ASA Anyconnect.
- Expert level experience with Cisco ISE, posturing RADIUS, TACACS.
- Experience with F5 LTM.
- Expert level experience with routing protocols (EIGRP, OSPF, BGP, DMVPN, IPSEC VPN).
- Expert level experience with L2 technologies (VPC, VTP, port channels)
- Experience with Palo Alto firewalls, and Meraki WIFI/SD WAN.
- Experience with network related tasks with AWS.
- Must be able to work flexible hours on-site and remotely.
- Must be able to work in a complex, dynamic team environment with minimal supervision and possess good organizational skills.
- Strong interpersonal skills are needed to work well with a talented team of infrastructure support personnel and application developers.
- CCIE certification - preferred
- Ability to communicate with individuals at all levels in the Company and with various business contacts outside of the company in an articulate, professional manner.
- Ability to organize, prioritize and handle multiple assignments on a daily basis.
Installation, Configuration and Troubleshooting of Network Equipment e.g CCTV Camera, Network Switch, Router, Wi-fi, Server and Desktop computer etc. Willing to travel entire India based on project requirement.

- Support for Incident and Request management tasks as assigned.
- Administration and understanding of the following network services to support Incident and Request tasks as assigned:
- Palo Alto Firewalls
- LAN Routing and Switching
- Microsoft Azure Networking – Vnets, Express Route, Vnet Peering, VPN Gateways
- LAN to LAN VPN Tunnels
- Arista Network devices
- Work with Networking Monitoring tools such as one or more of the following: Solar Winds, Splunk, Prime, NetMRI, Nagios
- Work directly with telecom carriers and Network infrastructure maintenance providers as needed, such as Verizon, Tata, Bharti, and Cisco.
- Maximize network performance by monitoring performance, troubleshooting network problems and outages, scheduling upgrades, and collaborating with network architects on network optimization
- Supports routing and switching and work on BGP, OSPF protocols
- Support China retail store infrastructure and data center through on-call rotations
QUALIFICATIONS:
- Graduate or above
- 6-7 years experience in LAN/WAN Routing including 2 or more years of LAN/WAN Administration experience within a multi-platform technical environment.
- In-depth understanding of voice and/or data technologies and subject matter expert in at least one area.
- Cisco Certified or equivalent combination of knowledge and experience.
- Understanding on IPsec, VPNs, GRE, NAT/PAT over varied platforms
- Expert knowledge of all LAN switching technologies eg Spanning Tree, VLAN management, and VTP concepts on Cisco, Nexus, and Arista
- Good understanding of MPLS routing and OSPF routing
As an IT Infrastructure Architects design and implement information systems that support an enterprise infrastructure. You will provide the necessary technical infrastructure for the development of new and existing infrastructure technologies and system requirements.
Improve efficiency and streamline operations. enhance design specs, create technical documentation, implement control concepts and deliver expected outcomes. Collect performance data to monitor systems resource usage and failures rates and provide solutions and recommend changes. ensure scalability and anticipate capacity growth through careful planning and awareness of industry, business and client’s growth trends. Design activities rely on accurate data, sensible KPIs and performance metrics to improve processes and bridge gaps
Primary Responsibilities:
Conducting research on emerging and existing technologies. Recommend system alternative technologies and infrastructure development efforts that increase infrastructure flexibility, reliability, stability, scalability, resilience, availability, performance and cost effectiveness. All collective research efforts will contribute to the creation of architectural road maps that leverage software and cloud technologies. Research customer interaction, policy adherence, enterprise processes. May act as the subject matter expert of architectural virtualization.
Needs to guide in execution of Incident, Change, Release, Problem, Performance, and Availability Management
Security of all infrastructure is of paramount importance and is periodically audited, monitored and updated in keeping up with latest threats and risks.
Identifying best practices for future implementation. Architects provide feedback to the enterprise and incorporate all gathered information into future integration plans.
Provides DevOps thought leadership and mentoring in both advisory and delivery contexts, focusing on the requirements of Technology and Business and how these are best served by continuous improvements to our delivery approach
Required Technical skills and Experience
- Over 10+ years of experience as IT Infrastructure Architect
- Bachelor / Masters Degree in Computer Science, Information Technology or related field
- Must have experience in Infrastructure architecting on AWS/ Azure/ Google cloud.
- Should have a very good understanding of Cloud Native services (IaaS, PaaS, SaaS) platforms for application deployment and scalability in a cost effective manner, addresses scalability, availability, service continuity (DR), performance and security requirements. auto-scaling and self-healing.
- Has hands on experience with cloud orchestration using Kubernetes or apache Mesos on marathon would be an advantage
- Evangelizing microservices-based architectures using containerized applications; help to drive strategy and implementation of cloud native infrastructure
- Sound Knowledge of RDBMS, preferably with MYSQL Mongo, Elasticsearch, Redis. Working knowledge of, CDN/WAF
- Proven expertise on Linux, and DevOps tools such as Git, Jenkings, maven, Bamboo Docker, Puppet, Ansible,Kubernetes,terraform. Elastic Beanstalk, Openshift
- Infrastructure security (VPC, tunneling, API management, Governance) and networking security solutions like routing, switching, Firewalls etc.
- Good debugging skills on Linux, Apache, Nginx, PHP, MYSQL and cloud-based application and administration of RHEL, CentOS/Ubuntu
- Experience in Cloud scale APM and Monitoring Tools such as ELK ,Splunk, Nagios, Graffana, XMON Datadog, Dynatrace, Appdynamics, Cloud Monitoring.
- Troubleshoot and debug environment and infrastructure problems found in the production and non-production environments.
- Implements security improvements by assessing current situations; evaluating trends; anticipating requirements.
- Determines security violations and inefficiencies by conducting periodic audits.
- Upgrades system by implementing and maintaining security controls.
- Must have knowledge of leading storage backup solutions.
- Experience with one or more Unix shell scripting languages (Bash, C-Shell)
- Team mentoring and support for ramping up new engineers
- Provide leadership in planning, defining requirements, scoping efforts, and setting appropriate milestones
- Using a data-driven process/mindset, author technical content to support the incident response process (e.g. postmortem/root cause analysis) and develop interim solutions to prevent or quickly resolve issues/problems the next time.
- Experience with networking technologies (routing, switching, IP addressing, DNS, Load balancers, etc.) Knowledge of : - File systems, NFS, CIFS, iSCSI - IPv4 networking, including TCP/IP, SMTP/POP/ IMAP, HTTP/S, LDAP – DNS
- Ability to work independently while tackling complex problems
- Passionate to palm ownership and responsibility of the systems - 24x7
∙ Computer Science related Bachelor's Degree or equivalent experience
∙ Min 2 years of experience on GCP / AWS / Azure cloud system ( Overall 5+ years of experience)
∙ Familiarity with LAMP stack technologies, experience supporting
∙Must be capable of working on Linux System.
∙ Familiarity with SQL, Apache, File storage, Load Balancers and Agile methodologies ∙ Knowledge of shell scripts, Linux system administration.
∙ Strong DNS management and automation background
∙ Strong scripting (bash, php, perl) skills
∙ Good understanding of TCP/IP networking and troubleshooting
∙ Clear communication and documentation of projects and procedures
∙ Strong problem solving skills
∙ Demonstrated ability to manage timelines, dependencies, deliverables, milestones and resource allocation and management in projects.
∙Strong Cloud Architecture experience.
∙Cloud Security
∙Added Advantage with knowledge of Microservices/API/ Queue systems
∙knowledge of cloud computing technologies and current computing trends. ∙Understanding and willingness to embrace CI/CD and automation tooling such as Jenkins and GIT
Responsibilities:
∙ Set up technical server infrastructure, providing technical assistance to development teams, monitoring site performance, and troubleshooting issues when they arise.
∙ Set up and maintain a development server environment and a live server environment with a process for testing and deploying changes to live sites.
∙ Take an active role in designing, implementing and maintaining a scalable and robust enterprise server environment.
∙ Administer Apache web server, Load balancer and MySQL Server
∙ Work on System Security and IPTable configuration
∙ Optimize servers for high traffic, security, and other system issues. ∙ Evaluate and propose new or improved system architecture.
∙ Document system configuration, processes, and procedures.
∙ Share responsibility with team members for rotating on-call duties.

