
SDE 2 / SDE 3 – AI Infrastructure & LLM Systems Engineer
Location: Pune / Bangalore (India)
Experience: 4–8 years
Compensation: no bar for the right candidate
Bonus: Up to 10% of base
About the Company
AbleCredit builds production-grade AI systems for BFSI enterprises, reducing OPEX by up to 70% across onboarding, credit, collections, and claims.
We run our own LLMs on GPUs, operate high-concurrency inference systems, and build AI workflows that must scale reliably under real enterprise traffic.
Role Summary (What We’re Really Hiring For)
We are looking for a strong backend / systems engineer who can:
- Deploy AI models on GPUs
- Expose them via APIs
- Scale inference under high parallel load using async systems and queues
This is not a prompt-engineering or UI-AI role.
Core Responsibilities
- Deploy and operate LLMs on GPU infrastructure (cloud or on-prem).
- Run inference servers such as vLLM / TGI / SGLang / Triton or equivalents.
- Build FastAPI / gRPC APIs on top of AI models.
- Design async, queue-based execution for AI workflows (fan-out, retries, backpressure).
- Plan and reason about capacity & scaling:
- GPU count vs RPS
- batching vs latency
- cost vs throughput
- Add observability around latency, GPU usage, queue depth, failures.
- Work closely with AI researchers to productionize models safely.
Must-Have Skills
- Strong backend engineering fundamentals (distributed systems, async workflows).
- Hands-on experience running GPU workloads in production.
- Proficiency in Python (Golang acceptable).
- Experience with Docker + Kubernetes (or equivalent).
- Practical knowledge of queues / workers (Redis, Kafka, SQS, Celery, Temporal, etc.).
- Ability to reason quantitatively about performance, reliability, and cost.
Strong Signals (Recruiter Screening Clues)
Look for candidates who have:
- Personally deployed models on GPUs
- Debugged GPU memory / latency / throughput issues
- Scaled compute-heavy backends under load
- Designed async systems instead of blocking APIs
Nice to Have
- Familiarity with LangChain / LlamaIndex (as infra layers, not just usage).
- Experience with vector DBs (Qdrant, Pinecone, Weaviate).
- Prior work on multi-tenant enterprise systems.
Not a Fit If
- Only experience is calling OpenAI / Anthropic APIs.
- Primarily a prompt engineer or frontend-focused AI dev.
- No hands-on ownership of infra, scaling, or production reliability.

About AbleCredit
About
AI that writes Credit Reports on its own !!
AbleCredit is a friendly, supportive credit assistant. It generates Credit Appraisal Memos based on your policies, without any human intervention.
Candid answers by the company
AI that writes Credit Reports on its own !!
AbleCredit is your friendly, supportive credit assistant. It generates Credit Appraisal Memos based on your policies, without any human intervention.
Similar jobs
Job Title : Principal Backend Engineer (AI-Driven)
Experience : 10+ Years
Location : Chandigarh
Tech Stack : PHP, Node.js, Laravel, MySQL, MongoDB
Additional Requirement : Hands-on experience with AI technologies, APIs, or ML integrations
Role Overview :
We're looking for a Principal Backend Engineer (AI-Driven) to design and lead scalable backend systems while driving AI adoption across products.
The role involves integrating AI-powered features, architecting intelligent systems, and mentoring engineering teams on modern backend and AI implementation.
Key Responsibilities :
- Design and lead backend architecture using PHP (Laravel/CodeIgniter) and Node.js
- Build scalable microservices / modular backend systems
- Develop APIs and backend workflows for AI-driven features
- Integrate AI APIs (OpenAI, LangChain or similar frameworks)
- Work with LLMs, embeddings, vector databases, and AI pipelines
- Ensure performance, scalability, and security of backend systems
- Mentor engineering teams and drive backend + AI best practices
Requirements :
- 10+ years of backend development experience
- Strong expertise in PHP / Node.js, MySQL, MongoDB
- Hands-on experience integrating AI/ML APIs or AI-powered features
- Strong system design and architecture skills
- Experience leading engineering teams
Good to Have :
- Prompt engineering or AI cost optimization
- Exposure to MLOps / ML pipelines
Intern (GenAI - Python) - 3 Months Unpaid Internship
Job Title: GenAI Intern (Python) - 3 Months Internship (Unpaid)
Location: Ahmedabad (On-Site)
Duration: 3 Months
Stipend: Unpaid Internship
Company: Softcolon Technologies
About the Internship:
Softcolon Technologies is seeking a dedicated GenAI Intern who is eager to delve into real-world AI applications. This internship provides hands-on experience in Generative AI development, focusing on RAG systems and AI Agents. It is an ideal opportunity for individuals looking to enhance their skills in Python-based AI development through practical project involvement.
Eligibility:
- Freshers or currently pursuing BE (IT/CE) or related field
- Strong interest in Generative AI and real-world AI product development
Required Skills (Must Have):
- Basic knowledge of Python
- Basic understanding of Python frameworks like FastAPI and basic Django
- Familiarity with APIs and JSON
- Submission of resume, GitHub Profile/Project Portfolio, and any AI/Python project links
What You Will Learn (Internship Goals):
You will gain hands-on experience in:
- Fundamentals of Generative AI (GenAI)
- Building RAG (Retrieval-Augmented Generation) applications
- Working with Vector Databases and embeddings
- Creating AI Agents using Python
- Integrating LLMs such as OpenAI (GPT models), Claude, Gemini
- Prompt Engineering + AI workflow automation
- Building production-ready APIs using FastAPI
Responsibilities:
- Assist in developing GenAI-based applications using Python
- Support RAG pipeline implementation (embedding + search + response)
- Work on API integrations with OpenAI/Claude/Gemini
- Assist in building backend services using FastAPI
- Maintain project documentation and GitHub updates
- Collaborate with team members for tasks and daily progress updates
Selection Process:
- Resume + GitHub portfolio screening
- Short technical discussion (Python + basics of APIs)
- Final selection by the team
Why Join Us?
- Practical experience in GenAI through real projects
- Mentorship from experienced developers
- Opportunity to work on portfolio-level projects
- Certificate + recommendation (based on performance)
- Potential for a paid role post-internship (based on performance)
How to Apply:
Share your resume and GitHub portfolio link via:
Role
We are looking for a Full Stack Engineer who can own the entire technical stack, design systems that scale, and ship products fast. You will work across frontend, backend, and AI systems, making key architectural decisions while building a product used by real users.
This role offers high ownership, where engineers move ideas to production quickly and take responsibility for both technical decisions and product impact.
What would you do?
- Build and own the end-to-end platform using React, Node.js microservices, Python AI agents, and AWS.
- Design and implement scalable system architecture, including caching, databases, and state management between AI and UI.
- Develop AI-powered backend services and orchestrate LLM workflows using modern frameworks.
- Build highly interactive front-end experiences using modern React and real-time communication tools.
- Define and maintain engineering best practices, including CI/CD pipelines, monorepo structures, and development workflows.
- Collaborate closely with users and product teams to identify problems and ship impactful solutions.
- Continuously simplify systems by removing unnecessary complexity and keeping architecture clean.
Who should apply?
- Engineers with 4+ years of experience building and shipping production-grade products.
- Strong understanding of system design, architecture, and scalable backend systems.
- Hands-on experience with Python (FastAPI, async systems) and LLM-based applications.
- Proficiency in JavaScript / TypeScript with Node.js and modern backend frameworks.
- Experience building modern frontend applications using React (React 18+).
- Familiarity with databases such as Redis, PostgreSQL, or MongoDB, and designing scalable APIs.
- Engineers comfortable working in fast-paced environments with high ownership and minimal process overhead.
Technical Skills
- Backend: Node.js, Express, Python, FastAPI
- Frontend: React (React 18+), interactive UI development
- AI/LLM Systems: LLM orchestration, multi-model integrations
- Databases: Redis, PostgreSQL, MongoDB
- Infrastructure: AWS, CI/CD pipelines, microservices architecture
- Real-time Systems: Socket.IO, Server-Sent Events (SSE)
Company: Grey Chain AI
Location: Remote
Experience: 7+ Years
Employment Type: Full Time
About the Role
We are looking for a Senior Python AI Engineer who will lead the design, development, and delivery of production-grade GenAI and agentic AI solutions for global clients. This role requires a strong Python engineering background, experience working with foreign enterprise clients, and the ability to own delivery, guide teams, and build scalable AI systems.
You will work closely with product, engineering, and client stakeholders to deliver high-impact AI-driven platforms, intelligent agents, and LLM-powered systems.
Key Responsibilities
- Lead the design and development of Python-based AI systems, APIs, and microservices.
- Architect and build GenAI and agentic AI workflows using modern LLM frameworks.
- Own end-to-end delivery of AI projects for international clients, from requirement gathering to production deployment.
- Design and implement LLM pipelines, prompt workflows, and agent orchestration systems.
- Ensure reliability, scalability, and security of AI solutions in production.
- Mentor junior engineers and provide technical leadership to the team.
- Work closely with clients to understand business needs and translate them into robust AI solutions.
- Drive adoption of latest GenAI trends, tools, and best practices across projects.
Must-Have Technical Skills
- 7+ years of hands-on experience in Python development, building scalable backend systems.
- Strong experience with Python frameworks and libraries (FastAPI, Flask, Pydantic, SQLAlchemy, etc.).
- Solid experience working with LLMs and GenAI systems (OpenAI, Claude, Gemini, open-source models).
- Good experience with agentic AI frameworks such as LangChain, LangGraph, LlamaIndex, or similar.
- Experience designing multi-agent workflows, tool calling, and prompt pipelines.
- Strong understanding of REST APIs, microservices, and cloud-native architectures.
- Experience deploying AI solutions on AWS, Azure, or GCP.
- Knowledge of MLOps / LLMOps, model monitoring, evaluation, and logging.
- Proficiency with Git, CI/CD, and production deployment pipelines.
Leadership & Client-Facing Experience
- Proven experience leading engineering teams or acting as a technical lead.
- Strong experience working directly with foreign or enterprise clients.
- Ability to gather requirements, propose solutions, and own delivery outcomes.
- Comfortable presenting technical concepts to non-technical stakeholders.
What We Look For
- Excellent communication, comprehension, and presentation skills.
- High level of ownership, accountability, and reliability.
- Self-driven professional who can operate independently in a remote setup.
- Strong problem-solving mindset and attention to detail.
- Passion for GenAI, agentic systems, and emerging AI trends.
Why Grey Chain AI
Grey Chain AI is a Generative AI-as-a-Service and Digital Transformation company trusted by global brands such as UNICEF, BOSE, KFINTECH, WHO, and Fortune 500 companies. We build real-world, production-ready AI systems that drive business impact across industries like BFSI, Non-profits, Retail, and Consulting.
Here, you won’t just experiment with AI — you will build, deploy, and scale it for the real world.
About Asha Health
Asha Health helps medical practices launch their own AI clinics. We're backed by Y Combinator, General Catalyst, 186 Ventures, Reach Capital and many more. We recently raised an oversubscribed seed round from some of the best investors in Silicon Valley. Our team includes AI product leaders from companies like Google, physician executives from major health systems, and more.
About the Role
We're looking for a Software Engineer to join our engineering team in our Bangalore office. We're looking for someone who is an all-rounder, but has particularly exceptional backend engineering skills.
Our ideal candidate has built AI agents at the orchestration layer level and leveraged clever engineering techniques to improve latency & reliability for complex workflows.
We pay well above market for the country's best talent and provide a number of excellent perks.
Responsibilities
In this role, you will have the opportunity to build state-of-the-art AI agents, and learn what it takes to build an industry-leading multimodal, multi-agent suite.
You'll wear many hats. Your responsibilities will fall into 2 categories:
AI Engineering
- Develop AI agents with a high bar for reliability and performance.
- Build SOTA LLM-powered tools for providers, practices, and patients.
- Architect our data annotation, fine tuning, and RLHF workflows.
- Live on the bleeding-edge ensuring that every week, we have the most cutting edge agents as the industry evolves.
Full-Stack Engineering (80% backend, 20% frontend)
- Lead the team in designing scalable architecture to support performant web applications.
- Develop features end-to-end for our web applications with industry leading product and user experience (Typescript, nodeJS, python etc).
Perks of Working at Asha Health
#1 Build cool stuff: work on the latest, cutting-edge tech (build frontier AI agents with technologies that evolve every 2 weeks).
#2 Surround yourself with top talent: our team includes senior AI product leaders from companies like Google, experienced physician executives, and top 1% engineering talent (the best of the best).
#3 Rocketship trajectory: we get more customer interest than we have time to onboard, it's a good problem to have :)
About Asha Health
Asha Health helps medical practices launch their own AI clinics. We're backed by Y Combinator, General Catalyst, 186 Ventures, Reach Capital and many more. We recently raised an oversubscribed seed round from some of the best investors in Silicon Valley. Our team includes AI product leaders from companies like Google, physician executives from major health systems, and more.
About the Role
We're looking for a Software Engineer to join our engineering team in our Bangalore office. We're looking for someone who is an all-rounder, but has particularly exceptional backend engineering skills.
Our ideal candidate has built AI agents at the orchestration layer level and leveraged clever engineering techniques to improve latency & reliability for complex workflows.
We pay well above market for the country's best talent and provide a number of excellent perks.
Responsibilities
In this role, you will have the opportunity to build state-of-the-art AI agents, and learn what it takes to build an industry-leading multimodal, multi-agent suite.
You'll wear many hats. Your responsibilities will fall into 2 categories:
AI Engineering
- Develop AI agents with a high bar for reliability and performance.
- Build SOTA LLM-powered tools for providers, practices, and patients.
- Architect our data annotation, fine tuning, and RLHF workflows.
- Live on the bleeding-edge ensuring that every week, we have the most cutting edge agents as the industry evolves.
Full-Stack Engineering (80% backend, 20% frontend)
- Lead the team in designing scalable architecture to support performant web applications.
- Develop features end-to-end for our web applications with industry leading product and user experience (Typescript, nodeJS, python etc).
Perks of Working at Asha Health
#1 Build cool stuff: work on the latest, cutting-edge tech (build frontier AI agents with technologies that evolve every 2 weeks).
#2 Surround yourself with top talent: our team includes senior AI product leaders from companies like Google, experienced physician executives, and top 1% engineering talent (the best of the best).
#3 Rocketship trajectory: we get more customer interest than we have time to onboard, it's a good problem to have :)
Company Overview
McKinley Rice is not just a company; it's a dynamic community, the next evolutionary step in professional development. Spiritually, we're a hub where individuals and companies converge to unleash their full potential. Organizationally, we are a conglomerate composed of various entities, each contributing to the larger narrative of global excellence.
Redrob by McKinley Rice: Redefining Prospecting in the Modern Sales Era
Backed by a $40 million Series A funding from leading Korean & US VCs, Redrob is building the next frontier in global outbound sales. We’re not just another database—we’re a platform designed to eliminate the chaos of traditional prospecting. In a world where sales leaders chase meetings and deals through outdated CRMs, fragmented tools, and costly lead-gen platforms, Redrob provides a unified solution that brings everything under one roof.
Inspired by the breakthroughs of Salesforce, LinkedIn, and HubSpot, we’re creating a future where anyone, not just enterprise giants, can access real-time, high-quality data on 700 M+ decision-makers, all in just a few clicks.
At Redrob, we believe the way businesses find and engage prospects is broken. Sales teams deserve better than recycled data, clunky workflows, and opaque credit-based systems. That’s why we’ve built a seamless engine for:
- Precision prospecting
- Intent-based targeting
- Data enrichment from 16+ premium sources
- AI-driven workflows to book more meetings, faster
We’re not just streamlining outbound—we’re making it smarter, scalable, and accessible. Whether you’re an ambitious startup or a scaled SaaS company, Redrob is your growth copilot for unlocking warm conversations with the right people, globally.
EXPERIENCE
Duties you'll be entrusted with:
- Develop and execute scalable APIs and applications using the Node.js or Nest.js framework
- Writing efficient, reusable, testable, and scalable code.
- Understanding, analyzing, and implementing – Business needs, feature modification requests, and conversion into software components
- Integration of user-oriented elements into different applications, data storage solutions
- Developing – Backend components to enhance performance and receptiveness, server-side logic, and platform, statistical learning models, highly responsive web applications
- Designing and implementing – High availability and low latency applications, data protection and security features
- Performance tuning and automation of applications and enhancing the functionalities of current software systems.
- Keeping abreast with the latest technology and trends.
Expectations from you:
Basic Requirements
- Minimum qualification: Bachelor’s degree or more in Computer Science, Software Engineering, Artificial Intelligence, or a related field.
- Experience with Cloud platforms (AWS, Azure, GCP).
- Strong understanding of monitoring, logging, and observability practices.
- Experience with event-driven architectures (e.g., Kafka, RabbitMQ).
- Expertise in designing, implementing, and optimizing Elasticsearch.
- Work with modern tools including Jira, Slack, GitHub, Google Docs, etc.
- Expertise in Event driven architecture.
- Experience in Integrating Generative AI APIs.
- Working experience in high user concurrency.
- Experience in scaled databases for handling millions of records - indexing, retrieval, etc.,
Technical Skills
- Demonstrable experience in web application development with expertise in Node.js or Nest.js.
- Knowledge of database technologies and agile development methodologies.
- Experience working with databases, such as MySQL or MongoDB.
- Familiarity with web development frameworks, such as Express.js.
- Understanding of microservices architecture and DevOps principles.
- Well-versed with AWS and serverless architecture.
Soft Skills
- A quick and critical thinker with the ability to come up with a number of ideas about a topic and bring fresh and innovative ideas to the table to enhance the visual impact of our content.
- Potential to apply innovative and exciting ideas, concepts, and technologies.
- Stay up-to-date with the latest design trends, animation techniques, and software advancements.
- Multi-tasking and time-management skills, with the ability to prioritize tasks.
THRIVE
Some of the extensive benefits of being part of our team:
- We offer skill enhancement and educational reimbursement opportunities to help you further develop your expertise.
- The Member Reward Program provides an opportunity for you to earn up to INR 85,000 as an annual Performance Bonus.
- The McKinley Cares Program has a wide range of benefits:
- The wellness program covers sessions for mental wellness, and fitness and offers health insurance.
- In-house benefits have a referral bonus window and sponsored social functions.
- An Expanded Leave Basket including paid Maternity and Paternity Leaves and rejuvenation Leaves apart from the regular 20 leaves per annum.
- Our Family Support benefits not only include maternity and paternity leaves but also extend to provide childcare benefits.
- In addition to the retention bonus, our McKinley Retention Benefits program also includes a Leave Travel Allowance program.
- We also offer an exclusive McKinley Loan Program designed to assist our employees during challenging times and alleviate financial burdens.
About Synorus
Synorus is building a next-generation ecosystem of AI-first products. Our flagship legal-AI platform LexVault is redefining legal research, drafting, knowledge retrieval, and case intelligence using domain-tuned LLMs, private RAG pipelines, and secure reasoning systems.
If you are passionate about AI, legaltech, and training high-performance models — this internship will put you on the front line of innovation.
Role Overview
We are seeking passionate AI/LLM Engineering Interns who can:
- Fine-tune LLMs for legal domain use-cases
- Train and experiment with open-source foundation models
- Work with large datasets efficiently
- Build RAG pipelines and text-processing frameworks
- Run model training workflows on Google Colab / Kaggle / Cloud GPUs
This is a hands-on engineering and research internship — you will work directly with senior founders & technical leadership.
Key Responsibilities
- Fine-tune transformer-based models (Llama, Mistral, Gemma, etc.)
- Build and preprocess legal datasets at scale
- Develop efficient inference & training pipelines
- Evaluate models for accuracy, hallucinations, and trustworthiness
- Implement RAG architectures (vector DBs + embeddings)
- Work with GPU environments (Colab/Kaggle/Cloud)
- Contribute to model improvements, prompt engineering & safety tuning
Must-Have Skills
- Strong knowledge of Python & PyTorch
- Understanding of LLMs, Transformers, Tokenization
- Hands-on experience with HuggingFace Transformers
- Familiarity with LoRA/QLoRA, PEFT training
- Data wrangling: Pandas, NumPy, tokenizers
- Ability to handle multi-GB datasets efficiently
Bonus Skills
(Not mandatory — but a strong plus)
- Experience with RAG / vector DBs (Chroma, Qdrant, LanceDB)
- Familiarity with vLLM, llama.cpp, GGUF
- Worked on summarization, Q&A or document-AI projects
- Knowledge of legal texts (Indian laws/case-law/statutes)
- Open-source contributions or research work
What You Will Gain
- Real-world training on LLM fine-tuning & legal AI
- Exposure to production-grade AI pipelines
- Direct mentorship from engineering leadership
- Research + industry project portfolio
- Letter of experience + potential full-time offer
Ideal Candidate
- You experiment with models on weekends
- You love pushing GPUs to their limits
- You prefer research + implementation over theory alone
- You want to build AI that matters — not just demos
Location - Remote
Stipend - 5K - 10K
bout the Role
We are seeking an experienced Python Data Engineer with a strong foundation in API and basic UI development. This role is essential for advancing our analytics capabilities for AI products, helping us gain deeper insights into product performance and driving data-backed improvements. If you have a background in AI/ML, familiarity with large language models (LLMs), and a solid grasp of Python libraries for AI, we’d like to connect!
Key Responsibilities
• Develop Analytics Framework: Build a comprehensive analytics framework to evaluate and monitor AI product performance and business value.
• Define KPIs with Stakeholders: Collaborate with key stakeholders to establish and measure KPIs that gauge AI product maturity and impact.
• Data Analysis for Actionable Insights: Dive into complex data sets to identify patterns and provide actionable insights to support product improvements.
• Data Collection & Processing: Lead data collection, cleaning, and processing to ensure high-quality, actionable data for analysis.
• Clear Reporting of Findings: Present findings to stakeholders in a clear, concise manner, emphasizing actionable insights.
Required Skills
• Technical Skills:
o Proficiency in Python, including experience with key AI/ML libraries.
o Basic knowledge of UI and API development.
o Understanding of large language models (LLMs) and experience using them effectively.
• Analytical & Communication Skills:
o Strong problem-solving skills to address complex, ambiguous challenges.
o Ability to translate data insights into understandable reports for non-technical stakeholders.
o Knowledge of machine learning algorithms and frameworks to assess AI product effectiveness.
o Experience in statistical methods to interpret data and build metrics frameworks.
o Skilled in quantitative analysis to drive actionable insights.
What you’ll be doing
Weare much more than our job descriptions, but here is where you will begin:
As a Senior Software Engineer Data & ML You’ll Be:
● Architect, design, test, implement, deploy, monitor and maintain end-to-end backend
services. You build it, you own it.
● Work with people from other teams and departments on a day to day basis to ensure
efficient project execution with a focus on delivering value to our members.
● Regularly aligning your team’s vision and roadmap with the target architecture within your
domain and to ensure the success of complex multi domain initiatives.
● Integrate already trained ML and GenAI models (preferably GCP in services.
ROLE:
Whatyou’ll need,
Like us, you’ll be deeply committed to delivering impactful outcomes for customers.
What Makes You a Great Fit
● 5 years of proven work experience as a Backend Python Engineer
● Understanding of software engineering fundamentals OOPS, SOLID, etc.)
● Hands-on experience with Python libraries like Pandas, NumPy, Scikit-learn,
Lang chain/LLamaIndex etc.
● Experience with machine learning frameworks such as PyTorch or TensorFlow, Keras, being
proficient in Python
● Hands-on Experience with frameworks such as Django or FastAPI or Flask
● Hands-on experience with MySQL, MongoDB, Redis and BigQuery (or equivalents)
● Extensive experience integrating with or creating REST APIs
● Experience with creating and maintaining CI/CD pipelines- we use GitHub Actions.
● Experience with event-driven architectures like Kafka, RabbitMq or equivalents.
● Knowledge about:
o LLMs
o Vector stores/databases
o PromptEngineering
o Embeddings and their implementations
● Somehands-onexperience in implementations of the above ML/AI will be preferred
● Experience with GCP/AWS services.
● You are curious about and motivated by the future trends in data, AI/ML, analytics













