
form :https://forms.gle/ncGqEJrJDvEDhXtL7
AI Researcher, Speech & Audio, Intern -
Internship Opportunity at JoshTalks AI Lab
(ai.joshtalks.com)
Location: Gurgaon, India
Type: Full-time Internship (6–12 months)
Who: Final-year engineering students or recent graduates passionate about AI/ML
in speech
About Us
At JoshTalks AI Lab, we believe that voice will be the primary medium of interaction
between man and machine. Our mission is simple yet ambitious:
● Help machines talk like humans.
● Build the benchmarks and datasets that become the backbone of global
progress in speech AI.
● Drive improvements not just through compute or algorithms — but through
high-quality, diverse, real-world data.
Our datasets today power some of the largest and most widely used speech
models in the world (you’ve definitely used them, even if we can’t name them
😉).
What You’ll Work On
This is not a “just another internship.” You’ll be directly contributing to the global
race to perfect speech AI:
1. Benchmarking the world’s speech models
○ Design and run evaluations for ASR and speech-to-speech systems.
○ Create benchmarks that will guide top AI labs on where their models
fail and where they shine.
2. Modeling & Fine-Tuning
○ Fine-tune speech recognition systems (like Whisper/wav2vec2) to push
Word Error Rates toward ~5%.
○ Experiment with multilingual, code-switched, and noisy speech
to mimic real-world conditions.
3. Impact at Scale
○ Your work won’t just sit in a paper. It will influence how the
world’s largest AI models get built, tested, and improved.
Who We’re Looking For
● Final-year undergraduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related fields.
● Strong interest in speech, audio, NLP, or multimodal AI.
● Hands-on experience in one or more of:
○ Fine-tuning speech or language models (Whisper, wav2vec2,
HuBERT, SER, etc.)
○ Building speech-driven projects (assistants, classifiers, chatbots,
SER systems)
○ Working with PyTorch, TensorFlow, or Hugging Face transformers.
● Bonus: past projects on GitHub, Kaggle, or research papers.
Why Join Us
● Ownership: Even as a final-year student, you’ll get the chance to own
problems of global importance — from reducing ASR word error rates toward
5% to building benchmarks that influence how the next generation of
speech-to-speech models are developed. These are not side projects:
the problems you’ll work on may define how billions of people interact
with machines in the future.
● Front-row seat in speech AI: Your work will shape benchmarks and datasets
used by the world’s top model labs.
● Learning: Work with experts solving speech challenges across 20+
Indian languages and noisy, real-world audio.
● Impactful projects: The benchmarks and models you help build will
set direction for global AI progress.
● Startup energy, global scale: Small team, big impact — perfect for ambitious
builders.
● Co-Authorship: If any of the work you contribute to is published as a paper,
benchmark report, or dataset release, you will be credited as a co-author.
This means your contributions won’t just stay inside the lab — they’ll be
visible to the wider research community and part of the academic and
industry record.
Details
● Location: Gurgaon (on-site preferred for collaboration)
● Duration: 6–12 months
● Type: Paid Internship (full-time)
● Start Date: Flexible for final-year students (aligns with academic calendar)
If you’re someone who dreams of making speech AI as natural as human
conversation, this is your chance to work on the real frontier. Super interested?

About Josh Talks
About
Company video


Connect with the team
Similar jobs
Technical Expertise
- Advanced proficiency in Python
- Expertise in Deep Learning Frameworks: PyTorch and TensorFlow
- Experience with Computer Vision Models:
- YOLO (Object Detection)
- UNet, Mask R-CNN (Segmentation)
- Deep SORT (Object Tracking)
Real-Time & Deployment Skills
- Real-time video analytics and inference optimization
- Model pipeline development using:
- Docker
- Git
- MLflow or similar tools
- Image processing proficiency: OpenCV, NumPy
- Deployment experience on Linux-based GPU systems and edge devices (Jetson Nano, Google Coral, etc.)
Professional Background
- Minimum 4+ years of experience in AI/ML, with a strong focus on Computer Vision and System-Level Design
- Educational qualification: B.E./B.Tech/M.Tech in Computer Science, Electrical Engineering, or a related field
- Strong project portfolio or experience in production-level deployments
The Role
We’re hiring a Senior AI/ML Engineer with deep expertise in computer vision, generative AI, and production-grade ML systems. This is a 100% hands-on individual contributor role where you will build the AI engines behind the platform — automated image processing, generative content creation, intelligent workflows, and large-scale ML pipelines.
You will work across computer vision, generative models, automation, and ML infrastructure to deliver production-ready AI systems.
What You’ll Build
1. Computer Vision & Image Understanding
- Product image analysis, object detection, and segmentation
- Automated background removal, image enhancement, and preprocessing
- Classification, attribute extraction, and visual search systems
- Quality assessment and edge-case detection models
- Depth estimation and scene understanding from 2D images
- Real-time object detection for AR try-on
- Multi-view image analysis and camera pose estimation
2. Generative AI & Content Creation
- Fine-tune generative models for visual and marketing asset creation
- Build text-to-image and image-to-image model pipelines
- Generate AI-based product descriptions, tags, and metadata
- Work with diffusion models, GANs, and transformer architectures
- Develop texture generation, style transfer, and image editing tools
- Build synthetic data generation pipelines
- Experiment with the latest foundation models and diffusion techniques
3. Intelligent Automation & ML Systems
- Build end-to-end automation pipelines for large-scale product catalog processing
- Develop recommendation and personalization models
- Implement automated workflows for quality control, moderation, and validation
- Build predictive models for engagement and conversion
- Implement anomaly detection and platform monitoring
- Design continuous learning and self-improving systems
4. Production ML Infrastructure
- Deploy and optimize ML models on AWS
- Build scalable inference pipelines with low latency and high throughput
- Implement model versioning, A/B testing, and CI/CD for ML
- Create data pipelines for annotation, augmentation, and quality control
- Optimize models for speed, efficiency, and cost
- Build monitoring systems for model drift, quality, and performance
- Develop APIs and microservices for ML model serving
5. Research & Innovation
- Explore the latest AI/ML trends and emerging models
- Prototype rapidly using state-of-the-art models such as GPT-4V, Diffusion, and SAM
- Integrate open-source tools into the production stack
- Run feasibility experiments and influence model architecture decisions
- Document learnings and share insights internally
Technical Stack
AI / ML Frameworks
- PyTorch, TensorFlow, Hugging Face
- OpenCV, YOLO, Detectron2
- Stable Diffusion, ControlNet, Diffusers
- scikit-learn, XGBoost
Deployment & Infrastructure
- FastAPI, ONNX, TorchScript, TensorRT
- AWS services including SageMaker, Lambda, EC2, and S3
- Docker and Kubernetes
- PostgreSQL, Redis, MongoDB, Pinecone
Languages & APIs
- Python (primary)
- JavaScript / Node.js (working knowledge)
- REST, GraphQL, WebSocket APIs
Nice to Have (3D / Graphics)
- Understanding of rendering pipelines
- Familiarity with glTF or USDZ formats
- Experience with Three.js or Unity / Unreal
What We’re Looking For
Must-Haves
- 5–8+ years of experience in AI/ML with a strong computer vision background
- Deep expertise in PyTorch or TensorFlow
- Experience deploying production ML systems
- Strong understanding of CNNs, transformers, detection, and segmentation models
- Hands-on experience with diffusion models or GANs
- Strong Python skills and ML system design experience
- Cloud experience with AWS, GCP, or Azure
- Proven track record of shipping ML-powered products
- Passion for experimenting with emerging AI models
Highly Desirable
- Experience in e-commerce, retail imaging, or content pipelines
- Background in automation and intelligent workflow systems
- Experience with recommendation or personalization systems
- Familiarity with multimodal models combining vision and language
- Experience with neural rendering or 3D generation
- Open-source contributions or research publications
- Experience optimizing real-time inference
- Strong understanding of MLOps practices
- Ability to build end-to-end ML-driven product features
Problems You’ll Solve
- Automating large-scale product image processing
- Generating high-quality product visuals at scale
- Extracting structured attributes from image datasets
- Reducing manual processes through intelligent automation
- Optimizing inference speed and infrastructure cost
- Personalizing user experiences using ML
- Monitoring and evaluating ML models reliably in production
Why Ctruh
- Work across computer vision, generative AI, automation, and NLP
- Use cutting-edge models and a modern AI stack
- High-impact role influencing millions of shoppers
- Culture focused on experimentation and rapid iteration
- Strong learning environment with research exposure
- Small engineering team with high ownership and visibility
- Supported by Microsoft, NVIDIA, Google, and AWS
Location & Culture
- Location: Bengaluru
- Schedule: 6 days a week (5 days in office, Saturdays work-from-home)
- Culture: Fast-paced, experimentation-driven, and execution-focused
- Team: Highly skilled engineers focused on impact
- Resources: GPU compute, modern tooling, and research access
The Ideal Candidate
You are an AI builder who enjoys experimenting and shipping real systems. You have worked with diffusion models, vision transformers, SAM, GPT-4 Vision, and similar technologies. You are comfortable switching between computer vision, generative models, automation systems, and production ML pipelines.
You care about impact — building models that improve real product experiences. You move quickly, choose the right tools for the problem, prototype rapidly, and optimize systems for production reliability.
You thrive in environments where you can innovate continuously and own AI systems end-to-end.
Urgent Hiring !!!
We are looking for Flutter Developer,
1+ years of experience in developing native and cross-platform mobile applications
Knowledge of Flutter SDK, Getex, Provider, Android Studio and IntelliJ, Visual Studio Code,
SQLite, MySQL, PostgreSQL databases
REST APIs
Experience with Git and Jira
Familiarity with Agile development approaches
Experience: 1 to 2 years
Location: Surat (full-time onsite only)
Thanks!
Company
Egregore Labs (http://www.egregorelabs.com/" target="_blank">www.egregorelabs.com) is a financial software company founded in 2017 by Prashant Vijay (ISB, Tulane) & Hari Balaji (IIM Ahmedabad, IIT Madras) both of whom have spent over a decade each in Financial Services, with a majority of their experience at Goldman Sachs across New York, Hong Kong & Singapore in roles across Trading, Quant & Technology.
Opportunity
We are looking for an experienced fullstack engineer with front-end development experience to join our team.
We will share our workload as a team and we expect you to work on a broad range of tasks.
Here’s are some of the things you might have to do on any given day:
- Implement responsive and performant UIs with user centered approach with frontend technologies including Angular 2, Javascript(ES 6), Typescript, SCSS, etc
- Build back-end REST APIs on Python 3 based server frameworks for deployment and scaling of our product(s)
- Write meaningful test cases for frontend & backend platforms
- Integrate our products with 3rd party products/tools/services
- Develop Infrastructure for delivering services using a performance driven approach, build databases, schedule automated jobs, etc
Ideal Background / Experience
- At least 24 months of diverse experience in web development for product or services oriented environment with exposure to working production deployments
- Expertise in programming using Python/Javascript or similar scripting languages
- In-depth exposure to technologies used in web-based SaaS products, including REST APIs
- Sound understanding of Postgres and NoSQL databases such as MongoDB
Nice to have exposure to any of
- AWS
- Azure
- ELK
- Object Relational Models (SQLAlchemy, etc)
- Google APIs
- Microservices Architecture Pattern
- NodeJS / ExpressJS
Desirables
We are looking for a person who has
Resourcefulness - we're looking for versatile developers who are good at figuring out what they need to use, learn, build, re-purpose to get the job done quickly and efficiently.
Ownership - We like to be directive and not prescriptive in our management. We’d love for you to take ownership of what you work on, and tell us what to do, rather than the other way round.
Work Ethic - We’ve grown up on Wall Street. We work hard, and have aggressive goals. We want our team-mates to be focused, goal-oriented and consistent high achievers.
Execution Focus - Our business is about getting things done, and getting things done right. We want outcome focused colleagues who can multi-task, and execute quickly and elegantly.
What else you need to know
We are an early stage company. Working here is not for the faint-hearted. An intense and unstructured work environment, lots of excitement and a group of motivated colleagues is what we bring to the table. We ask you to bring your undivided attention, strong worth ethic & resourcefulness. We are Delhi based and work 6 days a week. We operate in a Python environment.
Roles and Responsibility:
· Develop web applications in Node (JavaScript/Typescript),
· Design and develop backend APIs for complex custom business applications as per requirement
· Design and develop database schema, queries, stored procedures
· Develop frontend SPA using AngularJS/Angular 2+, API integration, Data binding
· Collaboration with developer team, Project managers to ideate software solutions
· Essential communication skills for customer conference calls and meetings
· Interact with clients and other stakeholders to understand their requirements/problems, provide daily updates, plan and module delivery
· Test software to ensure quality and efficiency
· Responsible to troubleshoot bugs and fix them as well as maintain/enhance existing projects
· Write technical documentation
· Working with Agile and Scrum methodologies
· Extensive knowledge of database performance optimization strategies, indexing, sharing
· Develop applications using TDD (Test Driven Development), Unit testing, Integration testing, Jasmine/Mocha framework
· Design HLD/LLD architecture diagrams, Infrastructure diagrams, ER Diagrams
- You have a minimum of 6 years of experience building high-performance consumer-facing mobile applications at Product companies of a decent scale.
- You have founded/worked in a start-up and have worked in an enterprise environment You have a keen eye for mobile architecture and have led/participated in architectural discussions.
- You have a passion for mentoring and helping people on your team grow and achieve their goals and work with cross teams.
- You practice test-driven development and you are able to drive agile practices.
- You have worked with multiple languages/frameworks and have expertise in any one programming language/framework/stack.
- You have published reusable packages.
- You have worked with building automation, devise farms, multi-target testing.
- You are able to optimize the application for performance and speed.
- You are an excellent collaborator & communicator.
- You know that start-ups are a team sport.
- You listen to others, aren’t afraid to speak your mind and always try to ask the right questions.
- You are excited by the prospect of working in a distributed team and company
Tech Stack
- HTML5 (DOM)
- CSS 3
- Javascript (E6)/Typescript
- React Native/React/Angular/NativeScript/Vue
- In-depth knowledge of how frameworks works
- GraphQL
- Javascript ecosystem
- Linters
- Code formatters
- Transpilers
- Bundlers
- Testing tools(Jest, Enzyme)
- You are comfortable with caching, performance optimization, etc
- You are comfortable with native mobile development
- Java/Kotlin/Android
- Swift/iOs
Are you passionate about using technology to make people's lives better? Are you interested in becoming a part of one of the hottest trends in the world of start-ups today? Are you excited about joining the online ultra-fast grocery delivery service business pioneer and driving the trend forward? Then this may be the right opportunity for you.
Role and Responsibilities- Create detailed, and well-structured test plans and test cases. Estimate, prioritize and plan testing activities to meet project deadlines. Ability to work efficiently under pressure.
- Facilitate end-to-end mobile, working directly with developers.
- Expert knowledge of QA methodologies and software life cycle, previous working experience in Agile environment is preferred.
- Review requirements specifications, attend sprint grooming and planning meetings to provide timely and meaningful feedback wherever applicable
- Highly effective collaborator and experience working across multiple departments such as Engineering, Product, Client Success, etc.
- Ability to troubleshoot issues and help developers, product or client success.
- Participate in release management; available for testing for code releases and smoking testing, etc.
- Continuous improvement - contribute in development, improvement, and maintenance of the testing processes and structures
Requirements
- BTech and/or MTech in Engineering or related fields
- 2+ years of experience using or creating shared test framework
- Ability to create and maintain functional tests within fast-moving Agile delivery
- Experience using test case management system like TestRail, Test Complete or Test Lodge
- Experience using JIRA and, Atlassian suite
- Excellent communication and documentation skills
- Solid knowledge of Continuous Integration
- Proficiency in analyzing requirements, functional specs, and technical specs
- Ability to effectively communicate technical solutions and innovative ideas
- Strong quality mindset, great team spirit and being detail oriented
- Able to work in a very dynamic workplace
Who You Are
- Passionate about technology and making an impact.
- A perpetual learner, who stretches their boundaries and enjoys new ideas.
- A doer who takes initiative and works well in a team.








