Cutshort logo
VMax eSolutions India Pvt Ltd
VMax eSolutions India Pvt Ltd cover picture
VMax eSolutions India Pvt Ltd logo

VMax eSolutions India Pvt Ltd

https://vmaxindia.com
Founded :
2008
Type :
Services
Size :
100-1000
Stage :
Profitable

About

VMAX is an ISO 90012015 and ISO 27001:2013 certified organisation based in Hyderabad, India. It builds custom, tailor-made, and scalable products that can be easily integrated with third-party systems. VMAX provides cutting-edge solutions to its clients and has helped organizations emerge as winners in their respective industries.
Read more

Company social profiles

twitter

Jobs at VMax eSolutions India Pvt Ltd

VMax eSolutions India Pvt Ltd
at VMax eSolutions India Pvt Ltd
Bachu Sai Nikheel
Posted by Bachu Sai Nikheel
icon

The recruiter has not been active on this job recently. You may apply but please expect a delayed response.

Hyderabad
10 - 15 yrs
₹35L - ₹45L / yr
Generative AI
PEFT (Parameter-Efficient Fine-Tuning)
Voice processing
Artificial Intelligence (AI)
GPU computing
+3 more

We are seeking an experienced AI Architect to design, build, and scale production-ready AI voice conversation agents deployed locally (on-prem / edge / private cloud) and optimized for GPU-accelerated, high-throughput environments.

You will own the end-to-end architecture of real-time voice systems, including speech recognition, LLM orchestration, dialog management, speech synthesis, and low-latency streaming pipelines—designed for reliability, scalability, and cost efficiency.

This role is highly hands-on and strategic, bridging research, engineering, and production infrastructure.


Key Responsibilities

Architecture & System Design

  • Design low-latency, real-time voice agent architectures for local/on-prem deployment
  • Define scalable architectures for ASR → LLM → TTS pipelines
  • Optimize systems for GPU utilization, concurrency, and throughput
  • Architect fault-tolerant, production-grade voice systems (HA, monitoring, recovery)

Voice & Conversational AI

  • Design and integrate:
  • Automatic Speech Recognition (ASR)
  • Natural Language Understanding / LLMs
  • Dialogue management & conversation state
  • Text-to-Speech (TTS)
  • Build streaming voice pipelines with sub-second response times
  • Enable multi-turn, interruptible, natural conversations

Model & Inference Engineering

  • Deploy and optimize local LLMs and speech models (quantization, batching, caching)
  • Select and fine-tune open-source models for voice use cases
  • Implement efficient inference using TensorRT, ONNX, CUDA, vLLM, Triton, or similar

Infrastructure & Production

  • Design GPU-based inference clusters (bare metal or Kubernetes)
  • Implement autoscaling, load balancing, and GPU scheduling
  • Establish monitoring, logging, and performance metrics for voice agents
  • Ensure security, privacy, and data isolation for local deployments

Leadership & Collaboration

  • Set architectural standards and best practices
  • Mentor ML and platform engineers
  • Collaborate with product, infra, and applied research teams
  • Drive decisions from prototype → production → scale

Required Qualifications

Technical Skills

  • 7+ years in software / ML systems engineering
  • 3+ years designing production AI systems
  • Strong experience with real-time voice or conversational AI systems
  • Deep understanding of LLMs, ASR, and TTS pipelines
  • Hands-on experience with GPU inference optimization
  • Strong Python and/or C++ background
  • Experience with Linux, Docker, Kubernetes

AI & ML Expertise

  • Experience deploying open-source LLMs locally
  • Knowledge of model optimization:
  • Quantization
  • Batching
  • Streaming inference
  • Familiarity with voice models (e.g., Whisper-like ASR, neural TTS)

Systems & Scaling

  • Experience with high-QPS, low-latency systems
  • Knowledge of distributed systems and microservices
  • Understanding of edge or on-prem AI deployments

Preferred Qualifications

  • Experience building AI voice agents or call automation systems
  • Background in speech processing or audio ML
  • Experience with telephony, WebRTC, SIP, or streaming audio
  • Familiarity with Triton Inference Server / vLLM
  • Prior experience as Tech Lead or Principal Engineer

What We Offer

  • Opportunity to architect state-of-the-art AI voice systems
  • Work on real-world, high-scale production deployments
  • Competitive compensation and equity (if applicable)
  • High ownership and technical influence
  • Collaboration with top-tier AI and infrastructure talent
Read more
VMax eSolutions India Pvt Ltd
at VMax eSolutions India Pvt Ltd
Bachu Sai Nikheel
Posted by Bachu Sai Nikheel
Hyderabad
3 - 5 yrs
₹20L - ₹25L / yr
Artificial Intelligence (AI)
AI Agents
Voice recognition
Generative AI
skill iconMachine Learning (ML)

Company Description


VMax e-Solutions India Private Limited, based in Hyderabad, is a dynamic organization specializing in Open Source ERP Product Development and Mobility Solutions. As an ISO 9001:2015 and ISO 27001:2013 certified company, VMax is dedicated to delivering tailor-made and scalable products, with a strong focus on e-Governance projects across multiple states in India. The company's innovative technologies aim to solve real-life problems and enhance the daily services accessed by millions of citizens. With a culture of continuous learning and growth, VMax provides its team members opportunities to develop expertise, take ownership, and grow their careers through challenging and impactful work.


About the Role


We’re hiring a Senior Data Scientist with deep real-time voice AI experience and strong backend engineering skills.


1. You’ll own and scale our end-to-end voice agent pipeline that powers AI SDRs, customer support 2. agents, and internal automation agents on calls. This is a hands-on, highly technical role where you’ll design and optimize low-latency, high-reliability voice systems.


3. You’ll work closely with our founders, product, and platform teams, with significant ownership over architecture, benchmarks.


What You’ll Do


1. Own the voice stack end-to-end – from telephony / WebRTC entrypoints to STT, turn-taking, LLM reasoning, and TTS back to the caller.


2. Design for real-time – architect and optimize streaming pipelines for sub-second latency, barge-in, interruptions, and graceful recovery on bad networks.


3. Integrate and tune models – evaluate, select, and integrate STT/TTS/LLM/VAD providers (and self-hosted models) for different use-cases, balancing quality, speed, and cost.


4. Build orchestration & tooling – implement agent orchestration logic, evaluation frameworks, call simulators, and dashboards for latency, quality, and reliability.


5. Harden for production – ensure high availability, observability, and robust fault-tolerance for thousands of concurrent calls in customer VPCs.


6. Shape the voice roadmap – influence how voice fits into our broader Agentic OS vision (simulation, analytics, multi-agent collaboration, etc.).


You’re a Great Fit If You Have


1. 6+ years of software engineering experience (backend or full-stack) in production systems.


2. Strong experience building real-time voice agents or similar systems using:


STT / ASR (e.g. Whisper, Deepgram, Assembly, AWS Transcribe, GCP Speech)


TTS (e.g. ElevenLabs, PlayHT, AWS Polly, Azure Neural TTS)


VAD / turn-taking and streaming audio pipelines


LLMs (e.g. OpenAI, Anthropic, Gemini, local models)


3. Proven track record designing and operating low-latency, high-throughput streaming systems (WebRTC, gRPC, websockets, Kafka, etc.).


4. Hands-on experience integrating ML models into live, user-facing applications with real-time inference & monitoring.


5. Solid backend skills with Python and TypeScript/Node.js; strong fundamentals in distributed systems, concurrency, and performance optimization.


6. Experience with cloud infrastructure – especially AWS (EKS, ECS, Lambda, SQS/Kafka, API Gateway, load balancers).


7. Comfortable working in Kubernetes / Docker environments, including logging, metrics, and alerting.


8. Startup DNA – at least 2 years in an early or mid-stage startup where you shipped fast, owned outcomes, and worked close to the customer.


Nice to Have


1. Experience self-hosting AI models (ASR / TTS / LLMs) and optimizing them for latency, cost, and reliability.


2. Telephony integration experience (e.g. Twilio, Vonage, Aircall, SignalWire, or similar).


3. Experience with evaluation frameworks for conversational agents (call quality scoring, hallucination checks, compliance rules, etc.).


4. Background in speech processing, signal processing, or dialog systems.


5. Experience deploying into enterprise VPC / on-prem environments and working with security/compliance constraints.

Read more
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo

Similar companies

NonStop io Technologies Pvt Ltd cover picture
NonStop io Technologies Pvt Ltd's logo

NonStop io Technologies Pvt Ltd

https://nonstopio.com
Founded
2015
Type
Products & Services
Size
20-100
Stage
Profitable

About the company

NonStop io Technologies Pvt. Ltd.est. in 2015 is a software product development company.We invest in our client’s vision, build the technology and make sure the end-product is in alignment with their end business goals over short and the long term.

Jobs

10

HighLevel Inc. cover picture
HighLevel Inc.'s logo

HighLevel Inc.

https://gohighlevel.com
Founded
2018
Type
Product
Size
100-500
Stage
Profitable

About the company

About Us

HighLevel is an AI powered, all-in-one white-label sales & marketing platform that empowers agencies, entrepreneurs, and businesses to elevate their digital presence and drive growth. We are proud to support a global and growing community of over 2 million businesses, comprised of agencies, consultants, and businesses of all sizes and industries. HighLevel empowers users with all the tools needed to capture, nurture, and close new leads into repeat customers. As of mid 2025, HighLevel processes over 15 billion API hits and handles more than 2.5 billion message events every day. Our platform manages over 470 terabytes of data distributed across five databases, operates with a network of over 250 microservices, and supports over 1 million domain names.


Our People

With over 1,500 team members across 15+ countries, we operate in a global, remote-first environment. We are building more than software; we are building a global community rooted in creativity, collaboration, and impact. We take pride in cultivating a culture where innovation thrives, ideas are celebrated, and people come first, no matter where they call home.


Our Impact

As of mid 2025, our platform powers over 1.5 billion messages, helps generate over 200 million leads, and facilitates over 20 million conversations for the more than 2 million businesses we serve each month. Behind those numbers are real people growing their companies, connecting with customers, and making their mark - and we get to help make that happen.


EEO Statement:

At HighLevel, we value diversity. In fact, we understand it makes our organisation stronger. We are committed to inclusive hiring/promotion practices that evaluate skill sets, abilities, and qualifications without regard to any characteristic unrelated to performing the job at the highest level. Our objective is to foster an environment where really talented employees from all walks of life can be their true and whole selves, cherished and welcomed for their differences while providing excellent service to our clients and learning from one another along the way! Reasonable accommodations may be made to enable individuals with disabilities to perform essential functions.

Jobs

9

Inferigence Quotient cover picture
Inferigence Quotient's logo

Inferigence Quotient

https://inferq.com
Founded
2017
Type
Products & Services
Size
20-100
Stage
Bootstrapped

About the company

Deep Tech Startup Focusing on Autonomy and Intelligence for Unmanned Systems. Guidance and Navigation, AI-ML, Computer Vision, Information Fusion, LLMs, Generative AI, Remote Sensing

Jobs

4

LogIQ Labs Pvt.Ltd. cover picture
LogIQ Labs Pvt.Ltd.'s logo

LogIQ Labs Pvt.Ltd.

https://eshipz.com
Founded
2019
Type
Services
Size
20-100
Stage
Raised funding

About the company

]eShipz: Simplifying Global Shipping for Businesses: At eShipz, we are revolutionizing how businesses manage their shipping processes. Our platform is designed to offer seamless multi-carrier integration, enabling businesses of all sizes to ship effortlessly across the globe. Whether you're an e-commerce brand, a manufacturer, or a logistics provider, eShipz helps streamline your supply chain with real-time tracking, automated shipping labels, cost-effective shipping rates, and comprehensive reporting.


Our goal is to empower businesses by simplifying logistics, reducing shipping costs, and improving operational efficiency. With an easy-to-use dashboard and a dedicated support team, eShipz ensures that you focus on scaling your business while we handle your shipping needs.


Jobs

9

Sun King cover picture
Sun King's logo

Sun King

https://sunking.com
Founded
2009
Type
Product
Size
1000-5000
Stage
Profitable

About the company

Sun King is a leading global provider of off-grid solar energy solutions, designed to serve the 1.8 billion people who lack reliable or affordable access to traditional electrical grids. With a mission to power brighter lives, the company focuses on underserved markets across Africa and Asia. Sun King's product range includes solar lanterns, solar home systems, and solar inverters, tailored to meet a variety of energy needs—from portable lighting to powering entire homes.


The company's innovative solutions, such as the recently launched PowerHub 3300 and expandable solar home systems, reflect their commitment to evolving customer demands. With operations in over 40 countries and millions of products sold, Sun King makes solar energy accessible through pay-as-you-go financing options. The company’s network of field agents plays a key role in selling, installing, and servicing products, driving local economic development. Rooted in sustainability, Sun King also implements a Sustainable Financing Framework and ensures customer satisfaction through extensive service centers and after-sales support.

Jobs

0

Auxo AI cover picture
Auxo AI's logo

Auxo AI

https://auxoai.com
Founded
2022
Type
Products & Services
Size
100-1000
Stage
Raised funding

About the company

Jobs

7

Hunarstreet Technologies Pvt Ltd cover picture
Hunarstreet Technologies Pvt Ltd's logo

Hunarstreet Technologies Pvt Ltd

https://hunarstreet.com
Founded
2022
Type
Services
Size
10-50
Stage
Profitable

About the company

At Hunarstreet Technologies Pvt Ltd, we specialize in delivering India’s fastest hiring solutions, tailored to meet the unique needs of businesses across various industries. Our mission is to connect companies with exceptional talent, enabling them to achieve their growth and operational goals swiftly and efficiently.

We are able to achieve a success rate of 87% in relevancy of candidates to the job position and 62% success rate in closing positions shared with us.

Jobs

598

13KBS cover picture
13KBS's logo

13KBS

https://13kbs.com
Founded
2018
Type
Products & Services
Size
0-20
Stage
Bootstrapped

About the company

Jobs

2

Hello Edge cover picture
Hello Edge's logo

Hello Edge

https://helloedge.in
Founded
2025
Type
Services
Size
10-50
Stage
Bootstrapped

About the company

Jobs

3

Skillinabox cover picture
Skillinabox's logo

Skillinabox

https://skillinabox.in
Founded
2022
Type
Products & Services
Size
100-1000
Stage
Profitable

About the company

Skillinabox is India's first Online Fashion Designing & Cosmetology platform to help you become a Fashion Designer right from your home and start your income.

Jobs

1

Want to work at VMax eSolutions India Pvt Ltd?
VMax eSolutions India Pvt Ltd's logo
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs