We are seeking a highly experienced Azure AI, AIOps & MLOps Architect to lead enterprise-scale AI platform engineering, cloud modernization, DevSecOps transformation, and intelligent automation initiatives.
The ideal candidate should possess deep expertise in Microsoft Azure, Azure AI Foundry, Azure OpenAI, Azure Machine Learning, Kubernetes, Terraform, Azure DevOps, and enterprise observability platforms. The role will focus on designing scalable AI platforms, implementing MLOps and AIOps capabilities, enabling Agentic AI architectures, and driving cloud-native engineering practices across the organization.
Key Responsibilities
Cloud Architecture & Engineering
• Design and implement scalable, secure, and highly available solutions on Microsoft Azure.
• Define cloud architecture standards, reference architectures, and best practices.
• Lead cloud migration and modernisation initiatives across enterprise workloads.
• Implement multi-region disaster recovery and business continuity strategies.
• Oversee Azure networking, identity, security, and governance frameworks.
DevOps & CI/CD
• Architect and implement end-to-end CI/CD pipelines using Azure DevOps or GitHub Actions.
• Drive DevSecOps culture — embedding security scanning, quality gates, and compliance into the delivery pipeline.
• Champion Infrastructure-as-Code (IaC) practices using Terraform, Bicep, or ARM templates.
• Establish branching strategies, release management, and environment promotion standards.
• Define and enforce platform engineering standards and internal developer tooling.
AI & Machine Learning Integration
• Architect AI/ML solutions leveraging Azure AI services — Azure OpenAI, Azure Machine Learning, Azure AI Foundry, and Cognitive Services.
• Design intelligent automation and agentic workflows integrated into enterprise DevOps processes.
• Implement AI-powered capabilities such as code review assistance, anomaly detection, predictive analytics, and natural language automation.
• Define AI governance frameworks: model evaluation, prompt management, responsible AI, and cost controls.
• Design and implement enterprise MLOps frameworks.
• Build automated model training, validation, deployment, and monitoring pipelines.
• Establish model governance and lifecycle management.
Generative AI & Agentic AI
• Design enterprise GenAI solutions using Azure OpenAI.
• Build AI Agents using Azure AI Foundry.
• Develop Agent-to-Agent communication patterns.
• Implement Retrieval Augmented Generation (RAG) architectures.
• Build enterprise Knowledge Management and AI Skill Registry platforms.
• Design multi-agent orchestration frameworks.
Leadership & Stakeholder Engagement
• Serve as the technical authority and subject matter expert for Azure AI and DevOps practices.
• Mentor and guide junior architects, developers, and DevOps engineers.
• Collaborate with business stakeholders, product owners, and vendors to translate requirements into technical solutions.
• Produce architecture documentation, decision records (ADRs), and roadmaps.
• Represent the technology function in enterprise architecture forums and governance boards.
Required Qualifications
• Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
• 10+ years of experience in cloud engineering and architecture.
• 5+ years of hands-on experience with Microsoft Azure across compute, networking, storage, identity, and data services.
• Proven experience designing and implementing enterprise-grade CI/CD pipelines.
• Strong hands-on expertise with Infrastructure-as-Code (Terraform, Bicep, or ARM).
• Demonstrated experience architecting and deploying AI/ML solutions on Azure (Azure OpenAI, Azure ML, AI Foundry).
• Deep knowledge of DevSecOps principles, tools, and practices.
• Experience with containerisation and orchestration: Docker, Kubernetes (AKS).
• Proficiency in scripting and development: Python, PowerShell, Bash.
• Excellent communication and stakeholder management skills.
Preferred Qualifications
• Microsoft Certified: Azure Solutions Architect Expert.
• Microsoft Certified: DevOps Engineer Expert.
• Microsoft Certified: Azure AI Engineer Associate.
• Experience with Azure API Management (APIM), Event Grid, and Azure Functions.
• Familiarity with Datadog, Prometheus, or equivalent observability platforms.
• Experience in the real estate, retail, or enterprise industry sector.
• Knowledge of agentic AI frameworks and LLM orchestration patterns (LangChain, Semantic Kernel, MCP).
• Background in building Internal Developer Platforms (IDP).

About VDart Digital
About
VDart Digital is a High-growth, Global Digital Solutions, Product Development and Professional Services firm, headquartered in Atlanta, GA, USA with global presence in Canada, Mexico, Belgium, UK, Malaysia, UAE & India with a proven track record in building next generation solutions and connected platforms to transform business and operations.
VDart Digital provides turnkey Digital Transformation, Mobility and Supply Chain Management Solutions using AI/ML/GenAI/Agentic AI, Intelligent Automation, Data Analytics, Blockchain, Cloud, IOT, Identity & Access management, UI/UX, Full-stack and mobile development, Integration etc.
VDart Digital’s Products include TestSamurAI, LendSmartAI; IDocLens, Fleet Management (VGo); Document Verification/Validation Platform (Vvalidate); Employee Engagement Mobile App (V Engage).
Similar jobs

Job Details
- Job Title: Lead DevOps Engineer
- Industry: Consumer Internet, Technology & Travel and Tourism Platform
- Function - IT
- Experience Required: 7-10 years
- Employment Type: Full Time
- Job Location: Bengaluru
- CTC Range: Best in Industry
Criteria:
- Strong Lead DevOps / Infrastructure Engineer Profiles.
- Must have 7+ years of hands-on experience working as a DevOps / Infrastructure Engineer.
- Candidate’s current title must be Lead DevOps Engineer (or equivalent Lead role) in the current organization
- Must have minimum 2+ years of team management / technical leadership experience, including mentoring engineers, driving infrastructure decisions, or leading DevOps initiatives.
- Must have strong hands-on experience with Kubernetes (container orchestration) including deployment, scaling, and cluster management.
- Must have experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, Chef, or Puppet.
- Must have strong scripting and automation experience using Python, Go, Bash, or similar scripting languages.
- Must have working experience with distributed databases or data systems such as MongoDB, Redis, Cassandra, Elasticsearch, or Puppet.
- Must have strong hands-on experience in Observability & Monitoring, CI/CD architecture, and Networking concepts in production environments.
- (Company) – Must be from B2C Product Companies only.
- (Education) – B.E/ B.Tech
Preferred
- Experience working in microservices architecture and event-driven systems.
- Exposure to cloud infrastructure, scalability, reliability, and cost optimization practices.
- (Skills) – Understanding of programming languages such as Go, Python, or Java.
- (Environment) – Experience working in high-growth startup or large-scale production environments.
Job Description
As a DevOps Engineer, you will be working on building and operating infrastructure at scale, designing and implementing a variety of tools to enable product teams to build and deploy their services independently, improving observability across the board, and designing for security, resiliency, availability, and stability. If the prospect of ensuring system reliability at scale and exploring cutting-edge technology to solve problems, excites you, then this is your fit.
Job Responsibilities:
- Own end-to-end infrastructure right from non-prod to prod environment including self-managed DBs
- Codify our infrastructure
- Do what it takes to keep the uptime above 99.99%
- Understand the bigger picture and sail through the ambiguities
- Scale technology considering cost and observability and manage end-to-end processes
- Understand DevOps philosophy and evangelize the principles across the organization
- Strong communication and collaboration skills to break down the silos
Key Qualifications :
- At least 2 years of hands-on experience with cloud infrastructure on AWS or GCP
- Exposure to configuration management and orchestration tools at scale (e.g. Terraform, Ansible, Packer)
- Knowledge in DevOps tools (e.g. Jenkins, Groovy, and Gradle)
- Familiarity with monitoring and alerting tools(e.g. CloudWatch, ELK stack, Prometheus)
- Proven ability to work independently or as an integral member of a team
Preferable Skills :
- Familiarity with standard IT security practices such as encryption, credentials and key management
- Proven ability to acquire various coding languages (Java, Python- ) to support DevOps operation and cloud transformation
- Familiarity in web standards (e.g. REST APIs, web security mechanisms)
- Multi-cloud management experience with GCP / Azure
- Experience in performance tuning, services outage management and troubleshooting
At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio. We are currently seeking 4 DevOps Support Engineer to join one of our clients' teams in India who can start until 20th of July. If you're looking for an exciting opportunity to grow in a innovative environment, this could be the perfect fit for you.
Job requirements
Key Responsibilities:
- Monitor and troubleshoot AWS and/or Azure environments to ensure optimal performance and availability.
- Respond promptly to incidents and alerts, investigating and resolving issues efficiently.
- Perform basic scripting and automation tasks to streamline cloud operations (e.g., Bash, Python).
- Communicate clearly and fluently in English with customers and internal teams.
- Collaborate closely with the Team Lead, following Standard Operating Procedures (SOPs) and escalation workflows.
- Work in a rotating shift schedule, including weekends and nights, ensuring continuous support coverage.
Shift Details:
- Engineers rotate shifts, typically working 4–5 shifts per week.
- Each engineer works about 4 to 5 shifts per week, rotating through morning, evening, and night shifts—including weekends—to cover 24/7 support evenly among the team
- Rotation ensures no single engineer is always working nights or weekends; the load is shared fairly among the team.
Qualifications:
- 2–5 years of experience in DevOps or cloud support roles.
- Strong familiarity with AWS and/or Azure cloud environments.
- Experience with CI/CD tools such as GitHub Actions or Jenkins.
- Proficiency with monitoring tools like Datadog, CloudWatch, or similar.
- Basic scripting skills in Bash, Python, or comparable languages.
- Excellent communication skills in English.
- Comfortable and willing to work in a shift-based support role, including night and weekend shifts.
- Prior experience in a shift-based support environment is preferred.
What We Offer:
- Remote work opportunity — work from anywhere in India with a stable internet connection.
- Comprehensive training program including:
- Shadowing existing processes to gain hands-on experience.
- Learning internal tools, Standard Operating Procedures (SOPs), ticketing systems, and escalation paths to ensure smooth onboarding and ongoing success.
Bito is a startup that is using AI (ChatGPT, OpenAI, etc) to create game-changing productivity experiences for software developers in their IDE and CLI. Already, over 100,000 developers are using Bito to increase their productivity by 31% and performing more than 1 million AI requests per week.
Our founders have previously started, built, and taken a company public (NASDAQ: PUBM), worth well over $1B. We are looking to take our learnings, learn a lot along with you, and do something more exciting this time. This journey will be incredibly rewarding, and is incredibly difficult!
We are building this company with a fully remote approach, with our main teams for time zone management in the US and in India. The founders happen to be in Silicon Valley and India.
We are hiring a DevOps Engineer to join our team.
Responsibilities:
- Collaborate with the development team to design, develop, and implement Java-based applications
- Perform analysis and provide recommendations for Cloud deployments and identify opportunities for efficiency and cost reduction
- Build and maintain clusters for various technologies such as Aerospike, Elasticsearch, RDS, Hadoop, etc
- Develop and maintain continuous integration (CI) and continuous delivery (CD) frameworks
- Provide architectural design and practical guidance to software development teams to improve resilience, efficiency, performance, and costs
- Evaluate and define/modify configuration management strategies and processes using Ansible
- Collaborate with DevOps engineers to coordinate work efforts and enhance team efficiency
- Take on leadership responsibilities to influence the direction, schedule, and prioritization of the automation effort
Requirements:
- Minimum 4+ years of relevant work experience in a DevOps role
- At least 3+ years of experience in designing and implementing infrastructure as code within the AWS/GCP/Azure ecosystem
- Expert knowledge of any cloud core services, big data managed services, Ansible, Docker, Terraform/CloudFormation, Amazon ECS/Kubernetes, Jenkins, and Nginx
- Expert proficiency in at least two scripting/programming languages such as Bash, Perl, Python, Go, Ruby, etc.
- Mastery in configuration automation tool sets such as Ansible, Chef, etc
- Proficiency with Jira, Confluence, and Git toolset
- Experience with automation tools for monitoring and alerts such as Nagios, Grafana, Graphite, Cloudwatch, New Relic, etc
- Proven ability to manage and prioritize multiple diverse projects simultaneously
What do we offer:
At Bito, we strive to create a supportive and rewarding work environment that enables our employees to thrive. Join a dynamic team at the forefront of generative AI technology.
· Work from anywhere
· Flexible work timings
· Competitive compensation, including stock options
· A chance to work in the exciting generative AI space
· Quarterly team offsite events
Exp:8 to 10 years notice periods 0 to 20 days
Job Description :
- Provision Gcp Resources Based On The Architecture Design And Features Aligned With Business Objectives
- Monitor Resource Availability, Usage Metrics And Provide Guidelines For Cost And Performance Optimization
- Assist It/Business Users Resolving Gcp Service Related Issues
- Provide Guidelines For Cluster Automation And Migration Approaches And Techniques Including Ingest, Store, Process, Analyse And Explore/Visualise Data.
- Provision Gcp Resources For Data Engineering And Data Science Projects.
- Assistance With Automated Data Ingestion, Data Migration And Transformation(Good To Have)
- Assistance With Deployment And Troubleshooting Applications In Kubernetes.
- Establish Connections And Credibility In How To Address The Business Needs Via Design And Operate Cloud-Based Data Solutions
Key Responsibilities / Tasks :
- Building complex CI/CD pipelines for cloud native PaaS services such as Databases, Messaging, Storage, Compute in Google Cloud Platform
- Building deployment pipeline with Github CI (Actions)
- Building terraform codes to deploy infrastructure as a code
- Working with deployment and troubleshooting of Docker, GKE, Openshift, and Cloud Run
- Working with Cloud Build, Cloud Composer, and Dataflow
- Configuring software to be monitored by Appdynamics
- Configuring stackdriver logging and monitoring in GCP
- Work with splunk, Kibana, Prometheus and grafana to setup dashboard
Your skills, experience, and qualification :
- Total experience of 5+ Years, in as Devops. Should have at least 4 year of experience in Google could and Github CI.
- Should have strong experience in Microservices/API.
- Should have strong experience in Devops tools like Gitbun CI, teamcity, Jenkins and Helm.
- Should know Application deployment and testing strategies in Google cloud platform.
- Defining and setting development, test, release, update, and support processes for DevOps operation
- Strive for continuous improvement and build continuous integration, continuous development, and constant deployment pipeline (CI/CD Pipeline)
- Excellent understanding of Java
- Knowledge on Kafka, ZooKeeper, Hazelcast, Pub/Sub is nice to have.
- Understanding of cloud networking, security such as software defined networking/firewalls, virtual networks and load balancers.
- Understanding of cloud identity and access
- Understanding of the compute runtime and the differences between native compute, virtual and containers
- Configuration and managing databases such as Oracle, Cloud SQL, and Cloud Spanner.
- Excellent troubleshooting
- Working knowledge of various tools, open-source technologies
- Awareness of critical concepts of Agile principles
- Certification in Google professional Cloud DevOps Engineer is desirable.
- Experience with Agile/SCRUM environment.
- Familiar with Agile Team management tools (JIRA, Confluence)
- Understand and promote Agile values: FROCC (Focus, Respect, Openness, Commitment, Courage)
- Good communication skills
- Pro-active team player
- Comfortable working in multi-disciplinary, self-organized teams
- Professional knowledge of English
- Differentiators : knowledge/experience about
- 7+ years of experience in System Administration, Networking, Automation, Monitoring
- Excellent problem solving, analytical skills and technical troubleshooting skills
- Experience managing systems deployed in public cloud platforms (Microsoft Azure, AWS or Google Cloud)
- Experience implementing and maintaining CI/CD pipelines (Jenkins, Concourse, etc.)
- Linux experience, flavours: Ubuntu, Redhat, CentOS (sysadmin, bash scripting)
- Experience setting up monitoring (Datadog, Splunk, etc.)
- Experience in Infrastructure Automation tools like Terraform
- Experience in Package Manager for Kubernetes like Helm Charts
- Experience with databases and data storage (Oracle, MongoDB, Postgres SQL, ELK stack)
- Experience with Docker
- Experience with orchestration technologies (Kubernetes or DC/OS)
- Familiar with Agile Software Development
Minimum 4 years exp
Skillsets:
- Build automation/CI: Jenkins
- Secure repositories: Artifactory, Nexus
- Build technologies: Maven, Gradle
- Development Languages: Python, Java, C#, Node, Angular, React/Redux
- SCM systems: Git, Github, Bitbucket
- Code Quality: Fisheye, Crucible, SonarQube
- Configuration Management: Packer, Ansible, Puppet, Chef
- Deployment: uDeploy, XLDeploy
- Containerization: Kubernetes, Docker, PCF, OpenShift
- Automation frameworks: Selenium, TestNG, Robot
- Work Management: JAMA, Jira
- Strong problem solving skills, Good verbal and written communication skills
- Good knowledge of Linux environment: RedHat etc.
- Good in shell scripting
- Good to have Cloud Technology : AWS, GCP and Azure

- Proficiency in Integration of various Azure resources (IaaS and PaaS - SQL DB , App Service , Application Insights , databricks , Storage accounts etc) to deliver an end to end automation.
- Thorough understanding of Continuous integration and continuous delivery using Azure DevOps/VSTS.
- Performing cost analysis of the Azure platform to identify where cost efficiencies could be had.
- Proficiency and thorough understanding of Azure RBAC model.
- Sound understanding of Azure Active directory and conditional access policies.
- Good grasp of Azure governance principles and hands-on experience in rolling out compliance and governance polices.
- Proficiency in developing infrastructure automation scripts in the form of ARM templates and Azure Power Shell scripts which can then be provided to application teams as consumables.
- Effective communication skills, both written and verbal for technical and non-technical audiences.
- Good working and hands on knowledge of Azure IaaS , Vnet , Subnets , Firewalls and NSG. Sound understanding of networking knowledge on DNA and Firewall security like Palo Alto.
- Experience working with Confluence, JIRA, Bitbucket, git, Jenkins, Sonar for collaboration and continuous integration.
- Experience with agile methods, along with having found their limitations and ways to overcome them.
DevOps Engineer Skills Building a scalable and highly available infrastructure for data science Knows data science project workflows Hands-on with deployment patterns for online/offline predictions (server/serverless)
Experience with either terraform or Kubernetes
Experience of ML deployment frameworks like Kubeflow, MLflow, SageMaker Working knowledge of Jenkins or similar tool Responsibilities Owns all the ML cloud infrastructure (AWS) Help builds out an entirely CI/CD ecosystem with auto-scaling Work with a testing engineer to design testing methodologies for ML APIs Ability to research & implement new technologies Help with cost optimizations of infrastructure.
Knowledge sharing Nice to Have Develop APIs for machine learning Can write Python servers for ML systems with API frameworks Understanding of task queue frameworks like Celery








