DevOps / MLOps Engineer

at Capital Squared

DevOps / MLOps Engineer

Capital Squared

Company

Home

DevOps / MLOps Engineer

at Capital Squared

Posted by Hiring Team

5 - 10 yrs

₹25L - ₹55L / yr

Remote only

Skills

MLOps

DevOps

Google Cloud Platform (GCP)

CI/CD

PostgreSQL

Docker

Python

Terraform

yaml

Role: Full-Time, Long-Term Required: Docker, GCP, CI/CD Preferred: Experience with ML pipelines

OVERVIEW

We are seeking a DevOps engineer to join as a core member of our technical team. This is a long-term position for someone who wants to own infrastructure and deployment for a production machine learning system. You will ensure our prediction pipeline runs reliably, deploys smoothly, and scales as needed.

The ideal candidate thinks about failure modes obsessively, automates everything possible, and builds systems that run without constant attention.

CORE TECHNICAL REQUIREMENTS

Docker (Required): Deep experience with containerization. Efficient Dockerfiles, layer caching, multi-stage builds, debugging container issues. Experience with Docker Compose for local development.

Google Cloud Platform (Required): Strong GCP experience: Cloud Run for serverless containers, Compute Engine for VMs, Artifact Registry for images, Cloud Storage, IAM. You can navigate the console but prefer scripting everything.

CI/CD (Required): Build and maintain deployment pipelines. GitHub Actions required. You automate testing, building, pushing, and deploying. You understand the difference between continuous integration and continuous deployment.

Linux Administration (Required): Comfortable on the command line. SSH, diagnose problems, manage services, read logs, fix things. Bash scripting is second nature.

PostgreSQL (Required): Database administration basics—backups, monitoring, connection management, basic performance tuning. Not a DBA, but comfortable keeping a production database healthy.

Infrastructure as Code (Preferred): Terraform, Pulumi, or similar. Infrastructure should be versioned, reviewed, and reproducible—not clicked together in a console.

WHAT YOU WILL OWN

Deployment Pipeline: Maintaining and improving deployment scripts and CI/CD workflows. Code moves from commit to production reliably with appropriate testing gates.

Cloud Run Services: Managing deployments for model fitting, data cleansing, and signal discovery services. Monitor health, optimize cold starts, handle scaling.

VM Infrastructure: PostgreSQL and Streamlit on GCP VMs. Instance management, updates, backups, security.

Container Registry: Managing images in GitHub Container Registry and Google Artifact Registry. Cleanup policies, versioning, access control.

Monitoring and Alerting: Building observability. Logging, metrics, health checks, alerting. Know when things break before users tell us.

Environment Management: Configuration across local and production. Secrets management. Environment parity where it matters.

WHAT SUCCESS LOOKS LIKE

Deployments are boring—no drama, no surprises. Systems recover automatically from transient failures. Engineers deploy with confidence. Infrastructure changes are versioned and reproducible. Costs are reasonable and resources scale appropriately.

ENGINEERING STANDARDS

Automation First: If you do something twice, automate it. Manual processes are bugs waiting to happen.

Documentation: Runbooks, architecture diagrams, deployment guides. The next person can understand and operate the system.

Security Mindset: Secrets never in code. Least-privilege access. You think about attack surfaces.

Reliability Focus: Design for failure. Backups are tested. Recovery procedures exist and work.

CURRENT ENVIRONMENT

GCP (Cloud Run, Compute Engine, Artifact Registry, Cloud Storage), Docker, Docker Compose, GitHub Actions, PostgreSQL 16, Bash deployment scripts with Python wrapper.

WHAT WE ARE LOOKING FOR

Ownership Mentality: You see a problem, you fix it. You do not wait for assignment.

Calm Under Pressure: When production breaks, you diagnose methodically.

Communication: You explain infrastructure decisions to non-infrastructure people. You document what you build.

Long-Term Thinking: You build systems maintained for years, not quick fixes creating tech debt.

EDUCATION

University degree in Computer Science, Engineering, or related field preferred. Equivalent demonstrated expertise also considered.

TO APPLY

Include: (1) CV/resume, (2) Brief description of infrastructure you built or maintained, (3) Links to relevant work if available, (4) Availability and timezone.

Role: Full-Time, Long-Term Required: Docker, GCP, CI/CD Preferred: Experience with ML pipelines

OVERVIEW

The ideal candidate thinks about failure modes obsessively, automates everything possible, and builds systems that run without constant attention.

CORE TECHNICAL REQUIREMENTS

Docker (Required): Deep experience with containerization. Efficient Dockerfiles, layer caching, multi-stage builds, debugging container issues. Experience with Docker Compose for local development.

Linux Administration (Required): Comfortable on the command line. SSH, diagnose problems, manage services, read logs, fix things. Bash scripting is second nature.

PostgreSQL (Required): Database administration basics—backups, monitoring, connection management, basic performance tuning. Not a DBA, but comfortable keeping a production database healthy.

Infrastructure as Code (Preferred): Terraform, Pulumi, or similar. Infrastructure should be versioned, reviewed, and reproducible—not clicked together in a console.

WHAT YOU WILL OWN

Deployment Pipeline: Maintaining and improving deployment scripts and CI/CD workflows. Code moves from commit to production reliably with appropriate testing gates.

Cloud Run Services: Managing deployments for model fitting, data cleansing, and signal discovery services. Monitor health, optimize cold starts, handle scaling.

VM Infrastructure: PostgreSQL and Streamlit on GCP VMs. Instance management, updates, backups, security.

Container Registry: Managing images in GitHub Container Registry and Google Artifact Registry. Cleanup policies, versioning, access control.

Monitoring and Alerting: Building observability. Logging, metrics, health checks, alerting. Know when things break before users tell us.

Environment Management: Configuration across local and production. Secrets management. Environment parity where it matters.

WHAT SUCCESS LOOKS LIKE

ENGINEERING STANDARDS

Automation First: If you do something twice, automate it. Manual processes are bugs waiting to happen.

Documentation: Runbooks, architecture diagrams, deployment guides. The next person can understand and operate the system.

Security Mindset: Secrets never in code. Least-privilege access. You think about attack surfaces.

Reliability Focus: Design for failure. Backups are tested. Recovery procedures exist and work.

CURRENT ENVIRONMENT

GCP (Cloud Run, Compute Engine, Artifact Registry, Cloud Storage), Docker, Docker Compose, GitHub Actions, PostgreSQL 16, Bash deployment scripts with Python wrapper.

WHAT WE ARE LOOKING FOR

Ownership Mentality: You see a problem, you fix it. You do not wait for assignment.

Calm Under Pressure: When production breaks, you diagnose methodically.

Communication: You explain infrastructure decisions to non-infrastructure people. You document what you build.

Long-Term Thinking: You build systems maintained for years, not quick fixes creating tech debt.

EDUCATION

University degree in Computer Science, Engineering, or related field preferred. Equivalent demonstrated expertise also considered.

TO APPLY

Include: (1) CV/resume, (2) Brief description of infrastructure you built or maintained, (3) Links to relevant work if available, (4) Availability and timezone.

Users love Cutshort

Read about what our users have to say about finding their next opportunity on Cutshort.

Shubham Vishwakarma

Full Stack Developer - Averlon

I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.

Companies hiring on Cutshort

About Capital Squared

Founded :

2025

Type :

Product

Size :

0-20

Stage :

Raised funding

About

At Capital Squared, our quant fund harnesses machine learning and deep learning to smartly invest in digital assets, aiming for consistent growth.

Company social profiles

N/A

Similar jobs

Devops Engineer

MNC

Agency job

via rekha by Rekja Gorle

Mumbai

5 - 10 yrs

₹10L - ₹25L / yr

Windows Azure

DevOps

Microsoft Windows Azure

Kubernetes

Google Cloud Platform (GCP)

+6 more

We are hiring a Senior DevOps Engineer (5–10 years experience) with strong hands-on expertise in AWS, CI/CD, Docker, Kubernetes, and Linux. The role involves designing, automating, and managing scalable cloud infrastructure and deployment pipelines. Experience with Terraform/Ansible, monitoring tools, and security best practices is required. Immediate joiners preferred.

DevOps Engineer

at Knowmax

Posted by Bhawna Attri

Gurugram

2 - 3 yrs

₹4L - ₹6L / yr

Amazon Web Services (AWS)

CI/CD

Jenkins

Git

Docker

+8 more

Objectives of this role

•Building and implementing new development tools and infrastructure

•Understanding the needs of stakeholders and conveying them to developers

•Working on ways to automate and improve development and release processes

•Testing and examining code written by others and analysing results

•Ensuring that systems are safe and secure against cybersecurity threats

•Identifying technical problems and developing software updates and fixes

•Working with software developers and software engineers to ensure that development follows established processes and works as intended

•Planning projects and being involved in project management decisions

Responsibilities:

• Set up CI/CD pipelines for automated deployment and delivery

•Setup and management of new and Existing cloud-based Kubernetes cluster services

•Write Ad/Hoc Bash/Python scripts to automate certain operational tasks.

•Designing, maintenance and management of tools for automation of different operational processes.

•Provision of critical system security by leveraging best practices and prolific cloud security solutions.

•System troubleshooting and problem resolution across various application domains and platforms

•Support/maintain development, UAT and production infrastructure.

•Providing recommendations for architecture and process improvements.

•Respond to L2 calls and emails.

•Help administer monitoring systems, alerting, log management, and other IT infrastructure systems.

•Perform root cause analysis of production errors and resolve technical issues

•Design procedures for system troubleshooting and maintenance

Technical Skill Requirements:

•Experience in a DevOps role in AWS/OCI cloud environment.

•Must have experience with CI/CD Pipelines and hands-on experience with DevOps tools such as, Jenkins, Git, Docker, Kubernetes, Ansible, etc.

•Strong knowledge in Terraform for multi-stack cloud infrastructure provisioning.

•Strong knowledge in OCI/AWS-based Kubernetes service management.

•Must have experience with Python/Bash as a scripting language.

•Good knowledge in software debugging, web applications and services (Apache, Nginx, HAProxy)

•Must have knowledge in monitoring setup with Prometheus, Alertmanager, Grafana, Thanos, Loki, Fluentbit, etc.

Good To Have Skills

•PostgreSQL, MySQL, MongoDB, Redis, Keycloak.

•Migrating application from one cloud to another; OCI certifications

•Test Driven Development

Soft Skill Requirements:

•Able to learn new skills and technology quickly.

•Energetic with amazing customer service skills and a team-oriented approach.

•Strong verbal and written communication skills