
We are seeking an experienced AI Architect to design, build, and scale production-ready AI voice conversation agents deployed locally (on-prem / edge / private cloud) and optimized for GPU-accelerated, high-throughput environments.
You will own the end-to-end architecture of real-time voice systems, including speech recognition, LLM orchestration, dialog management, speech synthesis, and low-latency streaming pipelines—designed for reliability, scalability, and cost efficiency.
This role is highly hands-on and strategic, bridging research, engineering, and production infrastructure.
Key Responsibilities
Architecture & System Design
- Design low-latency, real-time voice agent architectures for local/on-prem deployment
- Define scalable architectures for ASR → LLM → TTS pipelines
- Optimize systems for GPU utilization, concurrency, and throughput
- Architect fault-tolerant, production-grade voice systems (HA, monitoring, recovery)
Voice & Conversational AI
- Design and integrate:
- Automatic Speech Recognition (ASR)
- Natural Language Understanding / LLMs
- Dialogue management & conversation state
- Text-to-Speech (TTS)
- Build streaming voice pipelines with sub-second response times
- Enable multi-turn, interruptible, natural conversations
Model & Inference Engineering
- Deploy and optimize local LLMs and speech models (quantization, batching, caching)
- Select and fine-tune open-source models for voice use cases
- Implement efficient inference using TensorRT, ONNX, CUDA, vLLM, Triton, or similar
Infrastructure & Production
- Design GPU-based inference clusters (bare metal or Kubernetes)
- Implement autoscaling, load balancing, and GPU scheduling
- Establish monitoring, logging, and performance metrics for voice agents
- Ensure security, privacy, and data isolation for local deployments
Leadership & Collaboration
- Set architectural standards and best practices
- Mentor ML and platform engineers
- Collaborate with product, infra, and applied research teams
- Drive decisions from prototype → production → scale
Required Qualifications
Technical Skills
- 7+ years in software / ML systems engineering
- 3+ years designing production AI systems
- Strong experience with real-time voice or conversational AI systems
- Deep understanding of LLMs, ASR, and TTS pipelines
- Hands-on experience with GPU inference optimization
- Strong Python and/or C++ background
- Experience with Linux, Docker, Kubernetes
AI & ML Expertise
- Experience deploying open-source LLMs locally
- Knowledge of model optimization:
- Quantization
- Batching
- Streaming inference
- Familiarity with voice models (e.g., Whisper-like ASR, neural TTS)
Systems & Scaling
- Experience with high-QPS, low-latency systems
- Knowledge of distributed systems and microservices
- Understanding of edge or on-prem AI deployments
Preferred Qualifications
- Experience building AI voice agents or call automation systems
- Background in speech processing or audio ML
- Experience with telephony, WebRTC, SIP, or streaming audio
- Familiarity with Triton Inference Server / vLLM
- Prior experience as Tech Lead or Principal Engineer
What We Offer
- Opportunity to architect state-of-the-art AI voice systems
- Work on real-world, high-scale production deployments
- Competitive compensation and equity (if applicable)
- High ownership and technical influence
- Collaboration with top-tier AI and infrastructure talent

About VMax eSolutions India Pvt Ltd
About
Similar jobs
We have opening for Inside Sales Engineer for a Manufacturing company in Vikhroli Mumbai
Job Requirement :
Experience of 4-6 years as Sales Engineer / Fresher with Degree or Diploma in Mechanical can apply
Job Role:
Provide pre-sales technical assistance and product education to customers.Respond to inquiries and resolve issues promptly to ensure customer satisfaction.Collaborate with the sales team to develop and implement effective sales strategies.Identify opportunities for upselling and cross-selling our products and services.Utilize engineering knowledge to understand customer needs and recommend suitableproducts or solutions. Conduct product demonstrations and technical presentations.Prepare and deliver accurate and competitive sales quotations, proposals, andcontracts. Ensure all customer communications are clear, professional, and effective.Build and maintain strong relationships with existing and potential customers. Act asthe primary point of contact for customer inquiries and follow up on leads.Work closely with the engineering, marketing, and manufacturing, accounts andlogistics teams to ensure alignment between customer needs and product offerings.Track and report on sales activities, including lead generation, customer interactions,and sales performance. Provide regular updates to senior management.
we are looking for a talented and passionate Python Engineer to join our team. As part of our Insights backend team, you will be building new and improving existing services powering the Insights platform. This is a fast-paced role with high growth, visibility, impact, and where many of the decisions for new projects will be driven by you and your team from inception through production. If you are seeking an environment where you get to do meaningful work with other great engineers, then we want to hear from you!
Skills & Requirements
- At least 1 years of experience with Python, Django.
- Well versed in building the backend logic of web applications.
- Strong database skills.
- Solid foundation in designing and developing scalable API’s.
- Understanding of general web architecture.
- A Bachelors, Masters, or PhD in Computer Science, Information Technology, Computer Engineering or some related discipline, or equivalent experience.
- Hands on experience with Django, Flask or other Python frameworks.
- Basic understanding of front-end technologies, such as JavaScript, HTML5, and CSS3.
- Debugging applications to ensure low-latency and high-availability.
- Integrating user-facing elements with server-side logic.
- Implementing security and data protection.
Good to have skills :
- Excellent interpersonal, organizational, written communication, oral communication and listening skills.
- Should come up with the work estimation and should provide inputs to managers on resource and risk planning.
- Ability to coordinate with, stakeholders, manage timelines, escalation & provide on time status.
- Familiarity with some ORM (Object Relational Mapper) libraries.
- Web frameworks and RESTful APIs experience.
Salary: ₹600,000.00 - ₹700,000.00 per year
Job Type: Full-time
Benefits: Leave encashment ,Paid sick time
Schedule: Day shift
Supplemental pay types: Performance bonus
Ability to commute/relocate: Lucknow, Uttar Pradesh: Reliably commute or planning to relocate before starting work (Required)
Education: Bachelor's (Preferred)
- 2-5 years of experience in building API services using NodeJS Express and related
frameworks
- Expert level understanding of NodeJS asynchronous runtime
- Expert level understanding of Javascript concepts on callbacks and closures
- Experience with Postgres, NoSQL, Redis, and Firebase real-time database
- Experience with AWS services like Elastic Beanstalk, Cloudfront, S3, EC2, Lambda,
API Gateway, SQS, etc
- Understanding of patterns and techniques for building scalable back-end
infrastructure including caching, rate limiting, authentication, and authorization
schemes.
- Experience in building highly scalable and high throughput services with
millisecond response times
- Experience working in a collaborative team environment
- Excellent communication & interpersonal skills
- Willingness to learn and pick up new technology along with patience to mentor
Bonus skills -
- Experience with ElasticSearch, Puppeteer
- Experience writing unit tests
Nature of Work:
This being an advanced level position in the S/W development team the individual is expected to:
Participate as a team member in all phases of the S/W lifecycle, including the analysis and design of S/W systems.
Participate in a detailed level of coding, code walk – through and unit testing of S/W modules.
Participate in integrated testing of product/ package.
Participate in difficult and typical coding assignments with responsibility of a small module consisting of 3 – 5 members.
Participate in exploration/ feasibility study of products.
Have a thorough understanding of the assigned product/ project.
Participate in generating technical documentation of products/ packages.
Providing technical training to the juniors.
Manage allocated resources, keep compliance with discipline and decorum of the organization.
Keep compliance with Systems and Procedures.
Reports to:
Project Manager
Skill Set:
Thorough knowledge of current technological trends in Web based Software
Strong Working knowledge of JSF/JSP, Servlets, Spring, Web application development and Core Java
- Working knowledge of working with databases using EJB.
- Knowledge of Bug Tracking tool such as Jira, Bugzilla and source code version control systems (SVN, GIT)
- Knowledge of working in Scrum methodology.
- Good to have knowledge of SonarQube and Web Security Aspects.
- Knowledge of Responsive Front End Development using HTML5, JavaScript, CSS3, JQuery, Ajax and JSON.
- Should be able to write test cases for the feature.
- Ability to gather and analyze data & draw logical conclusions
∙ Understanding of company’s vision & goals, business operations
and market
∙ Clear and concise oral and written communication skills
∙ Ability to establish and maintain effective work relationships at
all levels
∙ Great passion for S/W development.
∙ Ability to mentor and guide the juniors.
Office Address:
Tech Mahindra Ltd
Empire Tower, A Wing 9th floor- NB 902,Gut no. 31, Cloud City campus, Village Elthan,Thane – Belapur road Airoli (E), Navi Mumbai- 400708 (Maharashtra) IndiaNode.js Developer Responsibilities:
- Developing and maintaining all server-side network components.
- Ensuring optimal performance of the central database and responsiveness to front-end requests.
- Collaborating with front-end developers on the integration of elements.
- Designing back-end services for various business processes.
- Developing high-performance applications by writing testable, reusable, and efficient code.
- Implementing effective security protocols, data protection measures, and storage solutions.
- Running diagnostic tests, repairing defects, and providing technical support.
- Documenting Node.js processes, including database schemas, as well as preparing reports.
- Recommending and implementing improvements to processes and technologies.
- Keeping informed of advancements in the field of Node.js development.
Node.js Developer Requirements:
- Bachelor's degree in computer science, information science, or similar.
- At least two years of experience as a Node.js developer.
- Extensive knowledge of JavaScript, web stacks, libraries, and frameworks.
- Knowledge of front-end technologies such as HTML5 and CSS3.
- Superb interpersonal, communication, and collaboration skills.
- Exceptional analytical and problem-solving aptitude.
- Great organizational and time management skills.
- Availability to resolve urgent web application issues outside of business hours
- Should know CI/CD process
Graphic Designer Requirements:
Bachelor’s degree in graphic design or related field.
Experience as a graphic designer or in related field.
Demonstrable graphic design skills with a strong portfolio.
Proficiency with required desktop publishing tools, including Photoshop, InDesign and Illustrator.
A strong eye for visual composition.
Effective time management skills and the ability to meet deadlines.
Able to give and receive constructive criticism.
Understanding of marketing, production, website design, corporate identity, product packaging, advertisements, and multimedia design.
Experience with computer-aided design.
Strong internet and computer
Roles And Responsibilities:
Planning concepts by studying relevant information and materials.
Illustrating concepts by designing examples of art arrangement, size, type size and style and submitting them for approval.
Preparing finished art by operating necessary equipment and software.
Coordinating with outside agencies, art services, web designer, marketing, printers, and colleagues as necessary.
Contributing to team efforts by accomplishing tasks as needed.
Communicating with clients about layout and design.
Creating a wide range of graphics and layouts for product illustrations, company logos, and websites with software such as photoshop.
Reviewing final layouts and suggesting improvements when necessary.







