
We are seeking a highly experienced Azure AI, AIOps & MLOps Architect to lead enterprise-scale AI platform engineering, cloud modernization, DevSecOps transformation, and intelligent automation initiatives.
The ideal candidate should possess deep expertise in Microsoft Azure, Azure AI Foundry, Azure OpenAI, Azure Machine Learning, Kubernetes, Terraform, Azure DevOps, and enterprise observability platforms. The role will focus on designing scalable AI platforms, implementing MLOps and AIOps capabilities, enabling Agentic AI architectures, and driving cloud-native engineering practices across the organization.
Key Responsibilities
Cloud Architecture & Engineering
• Design and implement scalable, secure, and highly available solutions on Microsoft Azure.
• Define cloud architecture standards, reference architectures, and best practices.
• Lead cloud migration and modernisation initiatives across enterprise workloads.
• Implement multi-region disaster recovery and business continuity strategies.
• Oversee Azure networking, identity, security, and governance frameworks.
DevOps & CI/CD
• Architect and implement end-to-end CI/CD pipelines using Azure DevOps or GitHub Actions.
• Drive DevSecOps culture — embedding security scanning, quality gates, and compliance into the delivery pipeline.
• Champion Infrastructure-as-Code (IaC) practices using Terraform, Bicep, or ARM templates.
• Establish branching strategies, release management, and environment promotion standards.
• Define and enforce platform engineering standards and internal developer tooling.
AI & Machine Learning Integration
• Architect AI/ML solutions leveraging Azure AI services — Azure OpenAI, Azure Machine Learning, Azure AI Foundry, and Cognitive Services.
• Design intelligent automation and agentic workflows integrated into enterprise DevOps processes.
• Implement AI-powered capabilities such as code review assistance, anomaly detection, predictive analytics, and natural language automation.
• Define AI governance frameworks: model evaluation, prompt management, responsible AI, and cost controls.
• Design and implement enterprise MLOps frameworks.
• Build automated model training, validation, deployment, and monitoring pipelines.
• Establish model governance and lifecycle management.
Generative AI & Agentic AI
- Design enterprise GenAI solutions using Azure OpenAI.
- Build AI Agents using Azure AI Foundry.
- Develop Agent-to-Agent communication patterns.
- Implement Retrieval Augmented Generation (RAG) architectures.
- Build enterprise Knowledge Management and AI Skill Registry platforms.
- Design multi-agent orchestration frameworks.
Leadership & Stakeholder Engagement
• Serve as the technical authority and subject matter expert for Azure AI and DevOps practices.
• Mentor and guide junior architects, developers, and DevOps engineers.
• Collaborate with business stakeholders, product owners, and vendors to translate requirements into technical solutions.
• Produce architecture documentation, decision records (ADRs), and roadmaps.
• Represent the technology function in enterprise architecture forums and governance boards.
Required Qualifications
• Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
• 10+ years of experience in cloud engineering and architecture.
• 5+ years of hands-on experience with Microsoft Azure across compute, networking, storage, identity, and data services.
• Proven experience designing and implementing enterprise-grade CI/CD pipelines.
• Strong hands-on expertise with Infrastructure-as-Code (Terraform, Bicep, or ARM).
• Demonstrated experience architecting and deploying AI/ML solutions on Azure (Azure OpenAI, Azure ML, AI Foundry).
• Deep knowledge of DevSecOps principles, tools, and practices.
• Experience with containerisation and orchestration: Docker, Kubernetes (AKS).
• Proficiency in scripting and development: Python, PowerShell, Bash.
• Excellent communication and stakeholder management skills.
Preferred Qualifications
• Microsoft Certified: Azure Solutions Architect Expert.
• Microsoft Certified: DevOps Engineer Expert.
• Microsoft Certified: Azure AI Engineer Associate.
• Experience with Azure API Management (APIM), Event Grid, and Azure Functions.
• Familiarity with Datadog, Prometheus, or equivalent observability platforms.
• Experience in the real estate, retail, or enterprise industry sector.
• Knowledge of agentic AI frameworks and LLM orchestration patterns (LangChain, Semantic Kernel, MCP).
• Background in building Internal Developer Platforms (IDP).
Technical Skills
Domain
Technologies / Tools
Cloud Platform
Microsoft Azure (UAE North / Global)
DevOps & CI/CD
Azure DevOps, GitHub Actions, Jenkins
IaC
Terraform, Bicep, ARM Templates
AI / ML
Azure OpenAI, Azure AI Foundry, Azure ML, Cognitive Services
Containers
Docker, Kubernetes (AKS), Azure Container Apps
Identity & Security
Microsoft Entra ID, Azure Policy, Defender for Cloud
Observability
Datadog, Azure Monitor, Log Analytics, Application Insights
Databases
Azure SQL, PostgreSQL, Cosmos DB
Languages
Python, PowerShell, Bash, YAML

About VDart Digital
Similar jobs
Job Title: Senior DevOps Engineer
Location: Gurgaon – Sector 39
Work Mode: 5 Days Onsite
Experience: 5+ Years
About the Role
We are looking for an experienced Senior DevOps Engineer to build, manage, and maintain highly reliable, scalable, and secure infrastructure. The role involves deploying product updates, handling production issues, implementing customer integrations, and leading DevOps best practices across teams.
Key Responsibilities
- Manage and maintain production-grade infrastructure ensuring high availability and performance.
- Deploy application updates, patches, and bug fixes across environments.
- Handle Level-2 support and resolve escalated production issues.
- Perform root cause analysis and implement preventive solutions.
- Build automation tools and scripts to improve system reliability and efficiency.
- Develop monitoring, logging, alerting, and reporting systems.
- Ensure secure deployments following data encryption and cybersecurity best practices.
- Collaborate with development, product, and QA teams for smooth releases.
- Lead and mentor a small DevOps team (3–4 engineers).
Core Focus Areas
Server Setup & Management (60%)
- Hands-on management of bare-metal servers.
- Server provisioning, configuration, and lifecycle management.
- Network configuration including redundancy, bonding, and performance tuning.
Queue Systems – Kafka / RabbitMQ (15%)
- Implementation and management of message queues for distributed systems.
Storage Systems – SAN / NAS (15%)
- Setup and management of enterprise storage systems.
- Ensure backup, recovery, and data availability.
Database Knowledge (5%)
- Working experience with Redis, MySQL/PostgreSQL, MongoDB, Elasticsearch.
- Basic database administration and performance tuning.
Telecom Exposure (Good to Have – 5%)
- Experience with SMS, voice systems, or real-time data processing environments.
Technical Skills Required
- Linux administration & Shell scripting
- CI/CD tools – Jenkins
- Git (GitHub / SVN) and branching strategies
- Docker & Kubernetes
- AWS cloud services
- Ansible for configuration management
- Databases: MySQL, MariaDB, MongoDB
- Web servers: Apache, Tomcat
- Load balancing & HA: HAProxy, Keepalived
- Monitoring tools: Nagios and related observability stacks
DevOps Engineer
We are looking for a hands-on DevOps Engineer to manage and scale our cloud infrastructure, Kubernetes-based microservice deployments, monitoring systems, and data engineering infrastructure.
The person will be responsible for building reliable, secure, scalable, and cost-efficient infrastructure using automation-first practices. This role is important for supporting a high-growth B2C platform where availability, deployment velocity, observability, security, and cost efficiency are critical.
Key Responsibilities
- Manage and automate cloud infrastructure using Terraform.
- Deploy, manage, and troubleshoot microservices on Kubernetes.
- Build and maintain CI/CD pipelines to ensure reliable, controlled deployments.
- Implement safe release practices, including rolling deployments, rollback, and zero-downtime deployments.
- Manage monitoring, logging, alerting, dashboards, and production runbooks.
- Support incident response, production debugging, RCA, and preventive action closure.
- Ensure infrastructure is scalable, secure, highly available, and cost-optimised.
- Support data engineering infrastructure, including ClickHouse, PeerDB, Airflow, Kafka, and related platform components.
- Maintain infra-level security controls, backups, disaster recovery, and access governance.
Required Skills
- Strong experience with Terraform, Infrastructure as Code, and AWS.
- Strong experience with Kubernetes, Docker, Helm, ingress, and autoscaling.
- Experience with CI/CD tools such as GitHub Actions, GitLab CI, Jenkins, ArgoCD, or similar.
- Experience with monitoring and observability tools such as Prometheus, Grafana, ELK/OpenSearch, New Relic, or similar.
- Good understanding of cloud networking, DNS, load balancers, VPC/VPN, SSL/TLS, firewalls, and WAF.
- Experience with Linux administration, shell scripting, and automation.
- Understanding of cloud security, IAM, secrets management, and access governance.
- Exposure to databases, queues, caches, and data infrastructure tools such as ClickHouse, PeerDB, Airflow, Kafka, or similar.
- Strong debugging and problem-solving skills during production incidents.
- Ability to work closely with engineering teams to improve deployment, monitoring, cost, and reliability.
What the role needs
● Review of current DevOps infrastructure & redefine code merging strategy as per product roll out objectives
● Define deploy frequency strategy based on product roadmap document and ongoing product market fit relate tweaks and changes
● Architect benchmark docker configurations based on planned stack
● Establish uniformity of environment across developer machine to multiple production environments
● Plan & execute test automation infrastructure
● Setup automated stress testing environment
● Plan and execute logging & stack trace tools
● Review DevOps orchestration tools & choices
● Coordination with external data centers and AWS in the event of provisioning, outages or maintenance.
Requirements
● Extensive experience with AWS cloud infrastructure deployment and monitoring
● Advanced knowledge of programming languages such as Python and golang, and writing code and scripts
● Experience with Infrastructure as code & devops management tools - Terraform, Packer for devops asset management for monitoring, infrastructure cost estimations, and Infrastructure version management
● Configure and manage data sources like MySQL, MongoDB, Elasticsearch, Redis, Cassandra, Hadoop, etc
● Experience with network, infrastructure and OWASP security standards
● Experience with web server configurations - Nginx, HAProxy, SSL configurations with AWS, understanding & management of sub-domain based product rollout for clients .
● Experience with deployment and monitoring of event streaming & distributing technologies and tools - Kafka, RabbitMQ, NATS.io, socket.io
● Understanding & experience of Disaster Recovery Plan execution
● Working with other senior team members to devise and execute strategies for data backup and storage
● Be aware of current CVEs, potential attack vectors, and vulnerabilities, and apply patches as soon as possible
● Handle incident responses, troubleshooting and fixes for various services
Responsibilities:
- Writing and maintaining the automation for deployments across various cloud (AWS/Azure/GCP)
- Bring a passion to stay on top of DevOps trends, experiment, and learn new CI/CD technologies.
- Creating the Architecture Diagrams and documentation for various pieces
- Build tools and automation to improve the system's observability, availability, reliability, performance/latency, monitoring, emergency response
Requirements:
- 3 - 5 years of professional experience as a DevOps / System Engineer.
- Strong knowledge in Systems Administration & troubleshooting skills with Linux.
- Experience with CI/CD best practices and tooling, preferably Jenkins, Circle CI.
- Hands-on experience with Cloud platforms such as AWS/Azure/GCP or private cloud environments.
- Experience and understanding of modern container orchestration, Well-versed with the containerised applications (Docker, Docker-compose, Docker-swarm, Kubernetes).
- Experience in Infrastructure as code development using Terraform.
- Basic Networking knowledge VLAN, Subnet, VPC, Webserver like Nginx, Apache.
- Experience in handling different SQL and NoSQL databases (PostgreSQL, MySQL, Mongo).
- Experience with GIT Version Control Software.
- Proficiency in any programming or scripting language such as Shell Script, Python, Golang.
- Strong interpersonal and communication skills; ability to work in a team environment.
- AWS / Kubernetes Certifications: AWS Certified Solutions Architect / CKA.
- Setup and management of a Kubernetes cluster, including writing Docker files.
- Experience working in and advocating for agile environments.
- Knowledge in Microservice architecture.
The DevOps Engineer's core responsibilities include automated configuration and management
of infrastructure, continuous integration and delivery of distributed systems at scale in a Hybrid
environment.
Must-Have:
● You have 4-10 years of experience in DevOps
● You have experience in managing IT infrastructure at scale
● You have experience in automation of deployment of distributed systems and in
infrastructure provisioning at scale.
● You have in-depth hands-on experience on Linux and Linux-based systems, Linux
scripting
● You have experience in Server hardware, Networking, firewalls
● You have experience in source code management, configuration management,
continuous integration, continuous testing, continuous monitoring
● You have experience with CI/CD and related tools
* You have experience with Monitoring tools like ELK, Grafana, Prometheus
● You have experience with containerization, container orchestration, management
● Have a penchant for solving complex and interesting problems.
● Worked in startup-like environments with high levels of ownership and commitment.
● BTech, MTech or Ph.D. in Computer Science or related Technical Discipline
We are looking for a tech enthusiast to work in challenging environment, we are looking for a person who is self driven, proactive and has good experience in Azure, DevOps, Asp.Net etc. share your resume today if this interests you.
Job Location: Pune
About us:
JetSynthesys is a leading gaming and entertainment company with a wide portfolio of world class products, platforms, and services. The company has a robust foothold in the cricket community globally with its exclusive JV with Sachin Tendulkar for the popular Sachin Saga game and a 100% ownership of Nautilus Mobile - the developer of India’s largest cricket simulation game Real Cricket, becoming the #1 cricket gaming franchise in the world. Standing atop in the charts of organizations fueling Indian esports gaming industry, Jetsysthesys was the earliest entrant in the e-sports industry with a founding 50% stake in India’s largest esports company, Nodwin Gaming that is recently funded by popular South Korean gaming firm Krafton.
Recently, the company has developed WWE Racing Showdown, a high-octane vehicular combat game borne out of a strategic partnership with WWE. Adding to the list is the newly launched board game - Ludo Zenith, a completely reimagined ludo experience for gamers, built in partnership with Square Enix - a Japanese gaming giant.
JetSynthesys Pvt. Ltd. is proud to be backed by Mr. Adar Poonawalla - Indian business tycoon and CEO of Serum Institute of India, Mr. Kris Gopalakrishnan – Co-founder of Infosys and the family offices of Jetline Group of Companies. JetSynthesys’ partnerships with large gaming companies in the US, Europe and Japan give it an opportunity to build great products not only for India but also for the world.
Responsibilities
- As a Security & Azure DevOps engineer technical specialist you will be responsible for advising and assisting in the architecture, design and implementation of secure infrastructure solutions
- Should be capable of technical deep dives into infrastructure, databases, and application, specifically in operating, and supporting high-performance, highly available services and infrastructure
· Deep understanding of cloud computing technologies across Windows, with demonstrated hands-on experience on the following domains:
· Experience in building, deploying and monitoring Azure services with strong IaaS and PaaS services (Redis Cache, Service Bus, Event Hub, Cloud Service etc.)
· Understanding of API endpoint management
· Able to monitor, maintain serverless architect using function /web apps
· Logs analysis capabilities using elastic search and Kibana dashboard
· Azure Core Platform: Compute, Storage, Networking
· Data Platform: SQL, Cosmo DB, MongoDB and JQL query
· Identity and Authentication: SSO Federation, ADAzure AD etc
· Experience with Azure Storage, Backup and Express Route
· Hands-on experience in ARM templates
· Ability to write PowerShell & Python scripts to automate IT Operations
· Working Knowledge of Azure OMS and Configuration of OMS Dashboards is desired
· VSTS Deployments
· You will help assist in stabilizing developed solutions by understanding the relevant application development, infrastructure and operations implications of the developed solution.
· Use of DevOps tools to deliver and operate end-user services a plus (e.g., Chef, New Relic, Puppet, etc.)
· Able to deploy and re-create of resources
· Building terraforms for cloud infrastructure
· Having ITIL/ITASM standards as best practise
· PEN testing and OWASP security testing will be addon bonus
· Load and performance testing using Jmeter
· Capacity review on daily basis
· Handing repeated issues and solving them using ansible automation
· Jenkin pipelines for CI/CD
· Code review platform using sonarqube
· Cost analysis on regular basis to keep the system and resources optimum
· Experience on ASP.Net, C#, .Net programming is must.
Qualifications:
Minimum graduate (Preferred Stream: IT/Technical)
- Develop and Maintain IAC using Terraform and Ansible
- Draft design documents that translate requirements into code.
- Deal with challenges associated with scale.
- Assume responsibilities from technical design through technical client support.
- Manage expectations with internal stakeholders and context-switch in a fast paced environment.
- Thrive in an environment that uses Elasticsearch extensively.
- Keep abreast of technology and contribute to the engineering strategy.
- Champion best development practices and provide mentorship
An AWS Certified Engineer with strong skills in
- Terraform o Ansible
- *nix and shell scripting
- Elasticsearch
- Circle CI
- CloudFormation
- Python
- Packer
- Docker
- Prometheus and Grafana
- Challenges of scale
- Production support
- Sharp analytical and problem-solving skills.
- Strong sense of ownership.
- Demonstrable desire to learn and grow.
- Excellent written and oral communication skills.
- Mature collaboration and mentoring abilities.
Roles and Responsibilities
● Managing Availability, Performance, Capacity of infrastructure and applications.
● Building and implementing observability for applications health/performance/capacity.
● Optimizing On-call rotations and processes.
● Documenting “tribal” knowledge.
● Managing Infra-platforms like
- Mesos/Kubernetes
- CICD
- Observability(Prometheus/New Relic/ELK)
- Cloud Platforms ( AWS/ Azure )
- Databases
- Data Platforms Infrastructure
● Providing help in onboarding new services with the production readiness review process.
● Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
● Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
● Working with the Dev team to have an in-depth understanding of the application architecture and its bottlenecks.
● Identifying observability gaps in product services, infrastructure and working with stake owners to fix it.
● Managing Outages and doing detailed RCA with developers and identifying ways to avoid that situation.
● Managing/Automating upgrades of the infrastructure services.
● Automate toil work.
Experience & Skills
● 3+ Years of experience as an SRE/DevOps/Infrastructure Engineer on large scale microservices and infrastructure.
● A collaborative spirit with the ability to work across disciplines to influence, learn, and deliver.
● A deep understanding of computer science, software development, and networking principles.
● Demonstrated experience with languages, such as Python, Java, Golang etc.
● Extensive experience with Linux administration and good understanding of the various linux kernel subsystems (memory, storage, network etc).
● Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
● Expertise in GitOps, Infrastructure as a Code tools such as Terraform etc.. and Configuration Management Tools such as Chef, Puppet, Saltstack, Ansible.
● Expertise of Amazon Web Services (AWS) and/or other relevant Cloud Infrastructure solutions like Microsoft Azure or Google Cloud.
● Experience in building CI/CD solutions with tools such as Jenkins, GitLab, Spinnaker, Argo etc.
● Experience in managing and deploying containerized environments using Docker,
Mesos/Kubernetes is a plus.
● Experience with multiple datastores is a plus (MySQL, PostgreSQL, Aerospike,
Couchbase, Scylla, Cassandra, Elasticsearch).
● Experience with data platforms tech stacks like Hadoop, Hive, Presto etc is a plus
Requirements:-
- Must have good understanding of Python and Shell scripting with industry standard coding conventions
- Must possess good coding debugging skills
- Experience in Design & Development of test framework
- Experience in Automation testing
- Good to have experience in Jenkins framework tool
- Good to have exposure to Continuous Integration process
- Experience in Linux and Windows OS
- Desirable to have Build & Release Process knowledge
- Experience in Automating Manual test cases
- Experienced in automating OS / FW related tasks
- Understanding of BIOS / FW QA is a strong plus
- OpenCV experience is a plus
- Good to have platform exposure
- Must have good Communication skills
- Good Leadership capabilities & collaboration capabilities, as individual will have to work with multiple teams and single handedly maintain the automation framework and enable the Manual validation team










