About the Role
We are seeking an accomplished DevOps Lead with 12+ years of experience in cloud infrastructure, automation, Blockchain, and CI/CD processes. The DevOps Lead will play a pivotal role in architecting scalable cloud environments, driving automation, ensuring secure deployments, and enabling efficient software delivery pipelines. The role involves working with AWS, Huawei Cloud, Kubernetes, Terraform, blockchain-based infrastructure, and modern DevOps toolchains while providing leadership, technical guidance, and client-facing communication.
Key Responsibilities
Leadership & Team Management
● Lead, mentor, and grow a team of DevOps engineers, setting technical direction and ensuring adherence to best practices.
● Facilitate collaboration across engineering, QA, security, and blockchain development teams.
● Act as the primary technical liaison with clients, managing expectations, requirements, and solution delivery.
Infrastructure Automation & Management
● Architect, implement, and manage infrastructure as code (IaC) using Terraform across multi-cloud environments.
● Standardize environments across AWS, Digital Ocean, Huawei Cloud with a focus on scalability, reliability, and security.
● Manage provisioning, scaling, monitoring, and cost optimization of infrastructure resources.
CI/CD & Automation
● Build, maintain, and optimize CI/CD pipelines supporting multiple applications and microservices.
● Integrate automated testing, static code analysis, and security scans into the pipelines.
● Implement blue-green / canary deployments and ensure zero downtime release strategies.
● Promote DevSecOps by embedding security policies into every phase of the delivery pipeline.
Containerization & Orchestration
● Deploy, manage, and monitor applications on Kubernetes clusters (EKS, CCE, or equivalent).
● Utilize Helm charts, Kustomize, and operators for environment consistency.
● Optimize container performance and manage networking, storage, and secrets.
Monitoring, Logging & Incident Response
● Implement and manage monitoring and alerting solutions (Prometheus, Grafana, ELK, CloudWatch, Loki).
● Define SLOs, SLIs, and SLAs for production systems.
● Lead incident response, root cause analysis, and implement preventative measures.
Governance, Security & Compliance
● Implement best practices for secrets management, key rotation, and role-based access control.
● Integrate vulnerability scanning and security audits into pipelines.
Required Skills & Qualifications
● 12+ years of experience in DevOps, with at least 5+ years in a lead capacity.
● Proven expertise with Terraform and IaC across multiple environments.
● Strong hands-on experience with AWS and Huawei Cloud infrastructure services.
● Deep expertise in Kubernetes cluster administration, scaling, monitoring, and networking.
● Advanced experience designing CI/CD pipelines using Jenkins, GitHub Actions, GitLab CI, or similar.
● Solid background in automated deployments, configuration management, and version control (Git, Ansible, Puppet, or Chef).
● Strong scripting and automation skills (Python, Bash, Go, or similar).
● Proficiency with monitoring/observability tools (Prometheus, Grafana, ELK, CloudWatch, Datadog).
● Strong understanding of blockchain infrastructure, node operations, staking setups, and deployment automation.
● Knowledge of container security, network policies, and zero-trust principles.
● Excellent communication, client handling, and stakeholder management skills with proven ability to present complex DevOps concepts to non-technical audiences.
● Ability to design and maintain highly available, scalable, and fault-tolerant systems in production environments.