Requirements
- 3+ years work experience writing clean production code
- Well versed with maintaining infrastructure as code (Terraform, Cloudformation etc). High proficiency with Terraform / Terragrunt is absolutely critical
- Experience of setting CI/CD pipelines from scratch
- Experience with AWS(EC2, ECS, RDS, Elastic Cache etc), AWS lambda, Kubernetes, Docker, ServiceMesh
- Experience with ETL pipelines, Bigdata infra
- Understanding of common security issues
Roles / Responsibilities:
- Write terraform modules for deploying different component of infrastructure in AWS like Kubernetes, RDS, Prometheus, Grafana, Static Website
- Configure networking, autoscaling. continuous deployment, security and multiple environments
- Make sure the infrastructure is SOC2, ISO 27001 and HIPAA compliant
- Automate all the steps to provide a seamless experience to developers.
Similar jobs
Job Title :- Senior DevOps Engineer
Exp :- 4+ years
Location :- Pan India, Jaipur, Indore, Gurgaon
Position :- 2
Job Summary:
We are seeking a skilled and motivated DevOps Engineer with 4+ years of experience to join our dynamic team. The ideal candidate will have a strong background in AWS cloud platforms and a passion for continuous integration, continuous delivery, and automation. As a DevOps Engineer, you will play a critical role in managing our cloud infrastructure, optimizing deployment pipelines, and ensuring the reliability and scalability of our systems.
Key Responsibilities:
- Cloud Infrastructure Management: Design, implement, and manage scalable, secure, and highly available AWS cloud infrastructure.
- Monitor and optimize cloud resources to ensure efficient utilization and cost management.
- CI/CD Pipelines: Develop and maintain CI/CD pipelines using tools such as Jenkins, GitLab CI, or AWS Code Pipeline. Automate build, test, and deployment processes to improve efficiency and reduce errors.
- Configuration Management: Utilize configuration management tools like Ansible, Chef, or Puppet to automate provisioning and configuration of infrastructure. Ensure consistency and repeatability of environments across development, staging, and production.
- Monitoring and Logging: Implement and maintain monitoring and logging solutions to ensure the health and performance of applications and infrastructure. Use tools such as CloudWatch, Prometheus, Grafana, or ELK stack to proactively identify and resolve issues.
- Security and Compliance: Implement best practices for cloud security, including IAM policies, security groups, and data encryption. Ensure compliance with industry standards and regulations, conducting regular security audits and vulnerability assessments.
- Collaboration and Support: Work closely with development, QA, and operations teams to ensure seamless integration and delivery of applications. Provide support for infrastructure-related issues and incidents, participating in on-call rotations as needed.
Qualifications:
- Education: Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
- Experience: Minimum of 4 years of experience in a DevOps role with a focus on AWS cloud services.
- Technical Skills:
- Proficiency in AWS services such as EC2, S3, RDS, Lambda, VPC, CloudFormation, and IAM.
- Experience with containerization and orchestration tools like Docker and Kubernetes.
- Strong scripting skills in languages such as Python, Bash, or Ruby.
- Familiarity with infrastructure as code (IaC) tools like Terraform or CloudFormation.
- Knowledge of version control systems such as Git.
- Soft Skills:
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration abilities.
- Ability to work in a fast-paced, dynamic environment and handle multiple tasks simultaneously.
Preferred Qualifications:
- AWS Certified DevOps Engineer or similar certifications.
- Experience with other cloud platforms (e.g., Azure, GCP) is a plus.
- Knowledge of agile methodologies and experience working in agile teams.
Responsibilities
● Work with application development teams to identify and understand their operational pain points.
● Document these challenges and define goals to be achieved by the infrastructure team.
● Prototype and evaluate multiple solutions, often by experimenting with various vendors and tools available, to achieve the goals undertaken.
● Rollout tools and processes with heavy focus on automation.
● Evangelize and help onboard application development teams on the platforms provided by the infrastructure team.
● Co-own the responsibility with application development teams to ensure the reliability of services.
● Design and implement solutions around observability to ensure ease of maintenance and quick debugging of services
● Establish and implement administrative and operational best practices in the application development teams.
● Find avenues to reduce infrastructure costs and drive optimization in all services.
Qualifications
● 5+ years of experience as a DevOps / Infrastructure engineer with cloud platforms (preferably AWS)
● Experience with git, CI / CD, Docker, etc
● Experience in working with infrastructure as code (Terraform, etc).
● Strong Linux Shell scripting experience
● Experience with one of the programming languages like Python, Java, Kotlin, etc.
- 7-10 years’ total experience, including 6+ years in a production 24/7 high-availability
- multi-site Cloud environment, including application hosting, CDN Networks, security and information protection.
- Experience of leading overall infrastructure for a complex organization and network, including 24x7 monitoring of a media website & digital properties.
- Experience in hosting and managing video streaming applications, React & Node JS based applications.
- Experience with regulatory compliance issues, as well best practices in application and network security.
- Experience in hosting services on Amazon Cloud & Google Cloud.
- Experience in performing Vulnerability Assessment at server & application level
- Experience in managing Live Streaming on digital platforms.
- Experience in managing SVN, Git code repository & Code release management.
- Partners with Technology head lead the technology infrastructure strategy and execution for the enterprise
- Planning, project management and implementation leadership, identifying opportunities for automation, cost savings, and service quality improvement.
- Provides infrastructure services vision, enables innovation and seeks to leverage market trends that can create business value consistent with the company’s requirements and expectations.
- Participate in the formulation of the company's enterprise architecture and business system plans; assessing cost and feasibility, and ensuring the plan is aligned with and supports the strategic goals of the business
- Hands-on technical depth enables direct oversight, problem-solving leadership and participation for complex infrastructure implementation, system upgrades and operational troubleshooting.
- Experience with comprehensive disaster recovery architecture and operations, including storage area network and redundant, highly-available server and network architectures.
- Leadership for delivery of 24/7 service operations and KPI compliance.
- Ensure best practices are followed for code release management & monitoring of traffic on websites & other applications.
Job Description
Devops Engineer [4 to 5 yrs]
Job Description
We are looking for a DevOps engineer who knows Docker inside and out strong experience. They will be responsible for defining and implementing our containerisation technology strategies. They will also design our architecture and define our roadmap. In addition, the candidate will work on building, scaling, and monitoring various highly complex applications and making our container orchestration cluster highly available.
The candidate has to be an expert in Docker and containerisation technologies with knowledge of container orchestration tools like Kubernetes and logging/monitoring tools for these orchestration tools. They will also be responsible for building and improving our application CI/CD pipelines and keeping themselves and their team up-to-date with the latest industry trends and technologies.
Responsibilities:
· Implement and improve monitoring and alerting.
· Build and maintain highly available systems on Kubernetes.
· Implement and manage CI/CD pipelines.
· Implement an auto-scaling system for our Kubernetes nodes.
· Participate in on-call rotations.
Requirements:
· Operations or systems administration experience, particularly on Linux.
· Strong experience on Azure.
· Experience with container networking on Docker.
· Experience with container orchestration.
· Experience with monitoring tools like Prometheus, Grafana, etc.
· Experience with application deployment by using CI/CD.
· Nodejs experience/knowledge would be a perfect Add on.
• Expertise in any one hyper-scale (AWS/AZURE/GCP), including basic services like networking, data and workload management.
o AWS
Networking: VPC, VPC Peering, Transit Gateway, RouteTables, SecurityGroups, etc.
Data: RDS, DynamoDB, ElasticSearch
Workload: EC2, EKS, Lambda, etc.
o Azure
Networking: VNET, VNET Peering,
Data: Azure MySQL, Azure MSSQL, etc.
Workload: AKS, VirtualMachines, AzureFunctions
o GCP
Networking: VPC, VPC Peering, Firewall, Flowlogs, Routes, Static and External IP Addresses
Data: Cloud Storage, DataFlow, Cloud SQL, Firestore, BigTable, BigQuery
Workload: GKE, Instances, App Engine, Batch, etc.
• Experience in any one of the CI/CD tools (Gitlab/Github/Jenkins) including runner setup, templating and configuration.
• Kubernetes experience or Ansible Experience (EKS/AKS/GKE), basics like pod, deployment, networking, service mesh. Used any package manager like helm.
• Scripting experience (Bash/python), automation in pipelines when required, system service.
• Infrastructure automation (Terraform/pulumi/cloudformation), write modules, setup pipeline and version the code.
Optional
• Experience in any programming language is not required but is appreciated.
• Good experience in GIT, SVN or any other code management tool is required.
• DevSecops tools like (Qualys/SonarQube/BlackDuck) for security scanning of artifacts, infrastructure and code.
• Observability tools (Opensource: Prometheus, Elasticsearch, OpenTelemetry; Paid: Datadog, 24/7, etc)
JOB RESPONSIBILITIES:
- Responsible for design, implementation, and continuous improvement on automated CI/CD infrastructure
- Displays technical leadership and oversight of implementation and deployment planning, system integration, ongoing data validation processes, quality assurance, delivery, operations, and sustainability of technical solutions
- Responsible for designing topology to meet requirements for uptime, availability, scalability, robustness, fault tolerance & security
- Implement proactive measures for automated detection and resolution of recurring operational issues
- Lead operational support team manage incidents, document root cause and tracking preventive measures
- Identifying and deploying cybersecurity measures by continuously validating/fixing vulnerability assessment reports and risk management
- Responsible for the design and development of tools, installation procedures
- Develops and maintains accurate estimates, timelines, project plans, and status reports
- Organize and maintain packaging and deployment of various internal modules and third-party vendor libraries
- Responsible for the employment, timely performance evaluation, counselling, employee development, and discipline of assigned employees.
- Participates in calls and meetings with customers, vendors, and internal teams on regular basis.
- Perform infrastructure cost analysis and optimization
SKILLS & ABILITIES
Experience: Minimum of 10 years of experience with good technical knowledge regarding build, release, and systems engineering
Technical Skills:
- Experience with DevOps toolchains such as Docker, Rancher, Kubernetes, Bitbucket
- Experience with Apache, Nginx, Tomcat, Prometheus ,Grafana
- Ability to learn/use a wide variety of open-source technologies and tools
- Sound understanding of cloud technologies preferably AWS technologies
- Linux, Windows, Scripting, Configuration Management, Build and Release Engineering
- 6 years of experience in DevOps practices, with a good understanding of DevOps and Agile principles
- Good scripting skills (Python/Perl/Ruby/Bash)
- Experience with standard continuous integration tools Jenkins/Bitbucket Pipelines
- Work on software configuration management systems (Puppet/Chef/Salt/Ansible)
- Microsoft Office Suite (Word, Excel, PowerPoint, Visio, Outlook) and other business productivity tools
- Working knowledge on HSM and PKI (Good to have)
Location:
- Bangalore
Experience:
- 10 + Years.
About the Role
- Own the end-to-end infrastructure of Sibros Cloud
- Define and introduce security best practices, identify gaps in infrastructure and come up with solutions
- Design and implement tools and software to manage Sibros’ infrastructure
- Stay hands-on, write and review code and documentation, debug and root cause issues in production environment
Minimum Qualifications
- Experience in Infrastructure as Code (IaC) to manage multi-cloud environments using cloud agnostic tools like Terraform or Ansible
- Passionate about security and have good understanding of industry best practices
- Experience in programming languages like Python, Golang, and enjoying automating everything using code
- Good skills and intuition on root cause issues in production environment
Preferred Qualifications
- Experience in database and network management
- Experience in defining security policies and best practices
- Experience in managing a large scale multi cloud environment
- Knowledge of SOC, GDPR or ISO 27001 security compliance standards is a plus
Equal Employment Opportunity
Sibros is committed to a policy of equal employment opportunity. We recruit, employ, train, compensate, and promote without regard to race, color, age, sex, ancestry, marital status, religion, national origin, disability, sexual orientation, veteran status, present or past history of mental disability, genetic information or any other classification protected by state or federal law.
We are looking for a full-time remote DevOps Engineer who has worked with CI/CD automation, big data pipelines and Cloud Infrastructure, to solve complex technical challenges at scale that will reshape the healthcare industry for generations. You will get the opportunity to be involved in the latest tech in big data engineering, novel machine learning pipelines and highly scalable backend development. The successful candidates will be working in a team of highly skilled and experienced developers, data scientists and CTO.
Job Requirements
- Experience deploying, automating, maintaining, and improving complex services and pipelines • Strong understanding of DevOps tools/process/methodologies
- Experience with AWS Cloud Formation and AWS CLI is essential
- The ability to work to project deadlines efficiently and with minimum guidance
- A positive attitude and enjoys working within a global distributed team
Skills
- Highly proficient working with CI/CD and automating infrastructure provisioning
- Deep understanding of AWS Cloud platform and hands on experience setting up and maintaining with large scale implementations
- Experience with JavaScript/TypeScript, Node, Python and Bash/Shell Scripting
- Hands on experience with Docker and container orchestration
- Experience setting up and maintaining big data pipelines, Serverless stacks and containers infrastructure
- An interest in healthcare and medical sectors
- Technical degree with 4 plus years’ infrastructure and automation experience
What we are looking for
Work closely with product & engineering groups to identify and document
infrastructure requirements.
Design infrastructure solutions balancing requirements, operational
constraints and architecture guidelines.
Implement infrastructure including network connectivity, virtual machines
and monitoring.
Implement and follow security guidelines, both policy and technical to
protect our customers.
Resolve incidents as escalated from monitoring solutions and lower tiers.
Identify root cause for issues and develop long term solutions to fix recurring
issues.
Ability to automate recurring tasks to increase velocity and quality.
Partner with the engineering team to build software tolerance for
infrastructure failure or issues.
Research emerging technologies, trends and methodologies and enhance
existing systems and processes.
Qualifications
Master’s/Bachelors degree in Computer Science, Computer Engineering,
Electrical Engineering, or related technical field, and two years of experience
in software/systems or related.
5+ years overall experience.
Work experience must have included:
Proven track record in deploying, configuring and maintaining Ubuntu server
systems on premise and in the cloud.
Minimum of 4 years’ experience designing, implementing and troubleshooting
TCP/IP networks, VPN, Load Balancers & Firewalls.
Minimum 3 years of experience working in public clouds like AWS & Azure.
Hands on experience in any of the configuration management tools like Anisble,
Chef & Puppet.
Strong in performing production operation activities.
Experience with Container & Container Orchestrator tools like Kubernetes, Docker
Swarm is plus.
Good at source code management tools like Bitbucket, GIT.
Configuring and utilizing monitoring and alerting tools.
Scripting to automate infrastructure and operational processes.
Hands on work to secure networks and systems.
Sound problem resolution, judgment, negotiating and decision making skills
Ability to manage and deliver multiple project phases at the same time
Strong analytical and organizational skills
Excellent written and verbal communication skills
Interview focus areas
Networks, systems, monitoring
AWS (EC2, S3, VPC)
Problem solving, scripting, network design, systems administration and
troubleshooting scenarios
Culture fit, agility, bias for action, ownership, communication
● Develop and deliver automation software required for building & improving the functionality, reliability, availability, and manageability of applications and cloud platforms
● Champion and drive the adoption of Infrastructure as Code (IaC) practices and mindset
● Design, architect, and build self-service, self-healing, synthetic monitoring and alerting platform and tools
● Automate the development and test automation processes through CI/CD pipeline (Git, Jenkins, SonarQube, Artifactory, Docker containers)
● Build container hosting-platform using Kubernetes
● Introduce new cloud technologies, tools & processes to keep innovating in commerce area to drive greater business value.
Skills Required:
● Excellent written and verbal communication skills and a good listener.
● Proficiency in deploying and maintaining Cloud based infrastructure services (AWS, GCP, Azure – good hands-on experience in at least one of them)
● Well versed with service-oriented architecture, cloud-based web services architecture, design patterns and frameworks.
● Good knowledge of cloud related services like compute, storage, network, messaging (Eg SNS, SQS) and automation (Eg. CFT/Terraform).
● Experience with relational SQL and NoSQL databases, including Postgres and
Cassandra.
● Experience in systems management/automation tools (Puppet/Chef/Ansible, Terraform)
● Strong Linux System Admin Experience with excellent troubleshooting and problem solving skills
● Hands-on experience with languages (Bash/Python/Core Java/Scala)
● Experience with CI/CD pipeline (Jenkins, Git, Maven etc)
● Experience integrating solutions in a multi-region environment
● Self-motivate, learn quickly and deliver results with minimal supervision
● Experience with Agile/Scrum/DevOps software development methodologies.
Nice to Have:
● Experience in setting-up Elastic Logstash Kibana (ELK) stack.
● Having worked with large scale data.
● Experience with Monitoring tools such as Splunk, Nagios, Grafana, DataDog etc.
● Previously experience on working with distributed architectures like Hadoop, Mapreduce etc.