Cutshort logo
VMax eSolutions India Pvt Ltd
VMax eSolutions India Pvt Ltd cover picture
VMax eSolutions India Pvt Ltd logo

VMax eSolutions India Pvt Ltd

https://vmaxindia.com
Founded :
2008
Type :
Services
Size :
100-1000
Stage :
Profitable

About

VMAX is an ISO 90012015 and ISO 27001:2013 certified organisation based in Hyderabad, India. It builds custom, tailor-made, and scalable products that can be easily integrated with third-party systems. VMAX provides cutting-edge solutions to its clients and has helped organizations emerge as winners in their respective industries.
Read more

Company social profiles

twitter

Jobs at VMax eSolutions India Pvt Ltd

VMax eSolutions India Pvt Ltd
at VMax eSolutions India Pvt Ltd
nikhil g
Posted by nikhil g
Hyderabad
10 - 15 yrs
₹35L - ₹45L / yr
Generative AI
PEFT (Parameter-Efficient Fine-Tuning)
Voice processing
Artificial Intelligence (AI)
GPU computing
+3 more

We are seeking an experienced AI Architect to design, build, and scale production-ready AI voice conversation agents deployed locally (on-prem / edge / private cloud) and optimized for GPU-accelerated, high-throughput environments.

You will own the end-to-end architecture of real-time voice systems, including speech recognition, LLM orchestration, dialog management, speech synthesis, and low-latency streaming pipelines—designed for reliability, scalability, and cost efficiency.

This role is highly hands-on and strategic, bridging research, engineering, and production infrastructure.


Key Responsibilities

Architecture & System Design

  • Design low-latency, real-time voice agent architectures for local/on-prem deployment
  • Define scalable architectures for ASR → LLM → TTS pipelines
  • Optimize systems for GPU utilization, concurrency, and throughput
  • Architect fault-tolerant, production-grade voice systems (HA, monitoring, recovery)

Voice & Conversational AI

  • Design and integrate:
  • Automatic Speech Recognition (ASR)
  • Natural Language Understanding / LLMs
  • Dialogue management & conversation state
  • Text-to-Speech (TTS)
  • Build streaming voice pipelines with sub-second response times
  • Enable multi-turn, interruptible, natural conversations

Model & Inference Engineering

  • Deploy and optimize local LLMs and speech models (quantization, batching, caching)
  • Select and fine-tune open-source models for voice use cases
  • Implement efficient inference using TensorRT, ONNX, CUDA, vLLM, Triton, or similar

Infrastructure & Production

  • Design GPU-based inference clusters (bare metal or Kubernetes)
  • Implement autoscaling, load balancing, and GPU scheduling
  • Establish monitoring, logging, and performance metrics for voice agents
  • Ensure security, privacy, and data isolation for local deployments

Leadership & Collaboration

  • Set architectural standards and best practices
  • Mentor ML and platform engineers
  • Collaborate with product, infra, and applied research teams
  • Drive decisions from prototype → production → scale

Required Qualifications

Technical Skills

  • 7+ years in software / ML systems engineering
  • 3+ years designing production AI systems
  • Strong experience with real-time voice or conversational AI systems
  • Deep understanding of LLMs, ASR, and TTS pipelines
  • Hands-on experience with GPU inference optimization
  • Strong Python and/or C++ background
  • Experience with Linux, Docker, Kubernetes

AI & ML Expertise

  • Experience deploying open-source LLMs locally
  • Knowledge of model optimization:
  • Quantization
  • Batching
  • Streaming inference
  • Familiarity with voice models (e.g., Whisper-like ASR, neural TTS)

Systems & Scaling

  • Experience with high-QPS, low-latency systems
  • Knowledge of distributed systems and microservices
  • Understanding of edge or on-prem AI deployments

Preferred Qualifications

  • Experience building AI voice agents or call automation systems
  • Background in speech processing or audio ML
  • Experience with telephony, WebRTC, SIP, or streaming audio
  • Familiarity with Triton Inference Server / vLLM
  • Prior experience as Tech Lead or Principal Engineer

What We Offer

  • Opportunity to architect state-of-the-art AI voice systems
  • Work on real-world, high-scale production deployments
  • Competitive compensation and equity (if applicable)
  • High ownership and technical influence
  • Collaboration with top-tier AI and infrastructure talent
Read more
VMax eSolutions India Pvt Ltd
at VMax eSolutions India Pvt Ltd
nikhil g
Posted by nikhil g
Hyderabad
6 - 10 yrs
₹30L - ₹35L / yr
Artificial Intelligence (AI)
AI Agents
Voice recognition
Generative AI
skill iconMachine Learning (ML)

Company Description


VMax e-Solutions India Private Limited, based in Hyderabad, is a dynamic organization specializing in Open Source ERP Product Development and Mobility Solutions. As an ISO 9001:2015 and ISO 27001:2013 certified company, VMax is dedicated to delivering tailor-made and scalable products, with a strong focus on e-Governance projects across multiple states in India. The company's innovative technologies aim to solve real-life problems and enhance the daily services accessed by millions of citizens. With a culture of continuous learning and growth, VMax provides its team members opportunities to develop expertise, take ownership, and grow their careers through challenging and impactful work.


About the Role


We’re hiring a Senior Data Scientist with deep real-time voice AI experience and strong backend engineering skills.


1. You’ll own and scale our end-to-end voice agent pipeline that powers AI SDRs, customer support 2. agents, and internal automation agents on calls. This is a hands-on, highly technical role where you’ll design and optimize low-latency, high-reliability voice systems.


3. You’ll work closely with our founders, product, and platform teams, with significant ownership over architecture, benchmarks.


What You’ll Do


1. Own the voice stack end-to-end – from telephony / WebRTC entrypoints to STT, turn-taking, LLM reasoning, and TTS back to the caller.


2. Design for real-time – architect and optimize streaming pipelines for sub-second latency, barge-in, interruptions, and graceful recovery on bad networks.


3. Integrate and tune models – evaluate, select, and integrate STT/TTS/LLM/VAD providers (and self-hosted models) for different use-cases, balancing quality, speed, and cost.


4. Build orchestration & tooling – implement agent orchestration logic, evaluation frameworks, call simulators, and dashboards for latency, quality, and reliability.


5. Harden for production – ensure high availability, observability, and robust fault-tolerance for thousands of concurrent calls in customer VPCs.


6. Shape the voice roadmap – influence how voice fits into our broader Agentic OS vision (simulation, analytics, multi-agent collaboration, etc.).


You’re a Great Fit If You Have


1. 6+ years of software engineering experience (backend or full-stack) in production systems.


2. Strong experience building real-time voice agents or similar systems using:


STT / ASR (e.g. Whisper, Deepgram, Assembly, AWS Transcribe, GCP Speech)


TTS (e.g. ElevenLabs, PlayHT, AWS Polly, Azure Neural TTS)


VAD / turn-taking and streaming audio pipelines


LLMs (e.g. OpenAI, Anthropic, Gemini, local models)


3. Proven track record designing and operating low-latency, high-throughput streaming systems (WebRTC, gRPC, websockets, Kafka, etc.).


4. Hands-on experience integrating ML models into live, user-facing applications with real-time inference & monitoring.


5. Solid backend skills with Python and TypeScript/Node.js; strong fundamentals in distributed systems, concurrency, and performance optimization.


6. Experience with cloud infrastructure – especially AWS (EKS, ECS, Lambda, SQS/Kafka, API Gateway, load balancers).


7. Comfortable working in Kubernetes / Docker environments, including logging, metrics, and alerting.


8. Startup DNA – at least 2 years in an early or mid-stage startup where you shipped fast, owned outcomes, and worked close to the customer.


Nice to Have


1. Experience self-hosting AI models (ASR / TTS / LLMs) and optimizing them for latency, cost, and reliability.


2. Telephony integration experience (e.g. Twilio, Vonage, Aircall, SignalWire, or similar).


3. Experience with evaluation frameworks for conversational agents (call quality scoring, hallucination checks, compliance rules, etc.).


4. Background in speech processing, signal processing, or dialog systems.


5. Experience deploying into enterprise VPC / on-prem environments and working with security/compliance constraints.

Read more
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo

Similar companies

Codebrahma Technologies Pvt. Ltd. cover picture
Codebrahma Technologies Pvt. Ltd.'s logo

Codebrahma Technologies Pvt. Ltd.

https://codebrahma.com
Founded
2012
Type
Products & Services
Size
20-100
Stage
Bootstrapped

About the company

Do you want to deliver code for a Y Combinator-funded startup?

Do you want to build world-class applications used by millions of people? 

Do you want to grow along with a fast-growing company? 

If YES, Codebrahma is THE place for you! 

Codebrahma is a software boutique based out of Ascendas ITPL, Bangalore.

 

We have been technology partners for some of the most exciting startups in the world which includes 5 Y Combinator funded startups. Most of the companies that have worked with us have gone on to raise major rounds of funding and disrupting their spaces.

 

Now that you are all excited about what we do. 

We are looking for amazing Developers!

Jobs

2

Incubyte cover picture
Incubyte's logo

Incubyte

https://incubyte.co
Founded
2020
Type
Services
Size
20-100
Stage
Bootstrapped

About the company

Who we are

We are Software Craftspeople. We are proud of the way we work and the code we write. We embrace and are evangelists of eXtreme Programming practices. We heavily believe in being a DevOps organization, where developers own the entire release cycle and thus own quality. And most importantly, we never stop learning!


We work with product organizations to help them scale or modernize their legacy technology solutions. We work with startups to help them operationalize their idea efficiently. We work with large established institutions to help them create internal applications to automate manual opperations and achieve scale.


We design software, design the team a well as the organizational strategy required to successfully release robust and scalable products. Incubyte strives to find people who are passionate about coding, learning and growing along with us. We work with a limited number of clients at a time on dedicated, long term commitments with an aim to bringing a product mindset into services. More on our website: https://www.incubyte.co/

 

Join our team! We’re always looking for like minded people!

Jobs

10

ICloudEMS cover picture
ICloudEMS's logo

ICloudEMS

https://icloudems.com
Founded
2010
Type
Product
Size
20-100
Stage
Profitable

About the company

Home

Jobs

6

Fortune Consultancy cover picture
Fortune Consultancy's logo

Fortune Consultancy

https://fortuneconsultancy.in
Founded
2006
Type
Services
Size
10-50
Stage
Profitable

About the company

Fortune Consultancy is "founded for solutions!" we offer solutions for all kind of Human Resource related issues of an organization at large and Recruitment Solutions in particular. We also offer Staffing, Training & Development and PR & Liaison solutions.

Jobs

3

Automate Accounts cover picture
Automate Accounts's logo

Automate Accounts

https://automateaccounts.com
Founded
2015
Type
Services
Size
0-20
Stage
Bootstrapped

About the company

Automate Accounts is a technology-driven company dedicated to building intelligent automation solutions that streamline business operations and boost efficiency. We leverage modern platforms and tools to help businesses transform their workflows with cutting-edge solutions.

Jobs

2

Founded
2021
Type
Products & Services
Size
10-50
Stage
Bootstrapped

About the company

BPO Hirings

Jobs

10

Founded
2025
Type
Services
Size
0-10
Stage
Bootstrapped

About the company

Peak Hire Solutions is a leading Recruitment Firm that provides our clients with innovative IT / Non-IT Recruitment Solutions. We pride ourselves on our creativity, quality, and professionalism. Join our team and be a part of shaping the future of Recruitment.

Jobs

120

ZestFindz Private Limited cover picture
ZestFindz Private Limited's logo

ZestFindz Private Limited

https://zestfindz.com
Founded
2025
Type
Products & Services
Size
0-20
Stage
Bootstrapped

About the company

ZestFindz Private Limited is a Hyderabad-based startup founded in February 2025.

We simplify online retail by offering a curated marketplace for everyday essentials, fashion, home goods, skincare, and more backed by powerful seller tools. Our goal: make selling and shopping seamless with solid tech, transparent operations and customer-first design.

Jobs

0

CodeDecoders cover picture
CodeDecoders's logo

CodeDecoders

https://codedecoders.io
Founded
2023
Type
Products & Services
Size
0-20
Stage
Profitable

About the company

We are an award-winning software agency that specializes in crafting cutting-edge solutions for a variety of industries.


Our expertise spans across a wide range of areas, including Full Stack development, Blockchain Technology, and Game Development.

Jobs

1

Stairio cover picture
Stairio's logo

Stairio

https://stairio.com
Founded
2025
Type
Services
Size
0-20
Stage
Bootstrapped

About the company

Jobs

1

Want to work at VMax eSolutions India Pvt Ltd?
VMax eSolutions India Pvt Ltd's logo
Why apply via Cutshort?
Connect with actual hiring teams and get their fast response. No spam.
Find more jobs