Cutshort logo
IT Industry  logo
AI Runtime Lead (LLM DevOps, PyTorch)
IT Industry
AI Runtime Lead (LLM DevOps, PyTorch)
IT Industry 's logo

AI Runtime Lead (LLM DevOps, PyTorch)

at IT Industry

Agency job
5 - 8 yrs
₹12L - ₹16L / yr
Bengaluru (Bangalore)
Skills
skill iconPython

Review Criteria:

Mandatory:

  • Strong AI Runtime Engineering (Lead / Staff) Profiles
  • Must have 4+ years of software engineering experience
  • Must have proven 1+ years of experience designing, building, and owning AI runtime infrastructure supporting distributed training and/or inference at scale
  • Must have hands-on experience optimizing deep learning runtimes such as PyTorch, TensorFlow, etc
  • Must have strong low-level performance engineering experience, including profiling, debugging, and optimizing system throughput, latency, and reliability
  • Must have experience leading or mentoring a team, including technical guidance, code reviews, and delivery ownership
  • Must have strong programming skills in Python, Java, C++ , etc


Preferred:

Experience with Kubernetes, Ray, TorchElastic, or custom AI job orchestration frameworks

Exposure to LLM training pipelines, checkpointing, elastic or distributed training orchestration


Role & Responsibilities:

As Lead/Staff AI Runtime Engineer, you’ll play a pivotal role in the design, development, and optimization of the core runtime infrastructure that powers distributed training and deployment of large AI models (LLMs and beyond). This is a hands-on leadership role - perfect for a systems-minded software engineer who thrives at the intersection of AI workloads, runtimes, and performance-critical infrastructure. You’ll own critical components of our PyTorch-based stack, lead technical direction, and collaborate across engineering, research, and product to push the boundaries of elastic, fault-tolerant, high-performance model execution.


What you’ll do:

Lead Runtime Design & Development:

  • Own the core runtime architecture supporting AI training and inference at scale.
  • Design resilient and elastic runtime features (e.g. dynamic node scaling, job recovery) within our custom PyTorch stack.
  • Optimize distributed training reliability, orchestration, and job-level fault tolerance.


Drive Performance at Scale:

  • Profile and enhance low-level system performance across training and inference pipelines.
  • Improve packaging, deployment, and integration of customer models in production environments.
  • Ensure consistent throughput, latency, and reliability metrics across multi-node, multi- GPU setups.


Build Internal Tooling & Frameworks:

  • Design and maintain libraries and services that support model lifecycle: training, check pointing, fault recovery, packaging, and deployment.
  • Implement observability hooks, diagnostics, and resilience mechanisms for deep learning workloads.
  • Champion best practices in CI/CD, testing, and software quality across the AI Runtime stack.


Collaborate & Mentor:

  • Work cross-functionally with Research, Infrastructure, and Product teams to align runtime development with customer and platform needs.
  • Guide technical discussions, mentor junior engineers, and help scale the AI Runtime team’s capabilities.


Ideal Candidate:

  • 5+ years of experience in systems/software engineering, with deep exposure to AI runtime, distributed systems, or compiler/runtime interaction.
  • Experience in delivering PaaS services.
  • Proven experience optimizing and scaling deep learning runtimes (e.g. PyTorch, TensorFlow, JAX) for large-scale training and/or inference.
  • Strong programming skills in Python and C++ (Go or Rust is a plus).
  • Familiarity with distributed training frameworks, low-level performance tuning, and resource orchestration.
  • Experience working with multi-GPU, multi-node, or cloud-native AI workloads.
  • Solid understanding of containerized workloads, job scheduling, and failure recovery inproduction environments.
Read more
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos

Similar jobs

Sukrthi Recruit
Sindhu Sindhu
Posted by Sindhu Sindhu
Coimbatore
2 - 3 yrs
₹2L - ₹3L / yr
Communication Skills
Operating systems
Team Management
Time management

A VMC Operator is responsible for operating and monitoring Vertical Machining Center (VMC) machines to manufacture precision components as per engineering drawings and production requirements. The role involves setting up workpieces, loading CNC programs, selecting cutting tools, and setting machine parameters to achieve accurate machining results.

The operator reads and interprets technical drawings, performs in-process and final inspections using measuring instruments, and ensures components meet quality standards and tolerances. Responsibilities also include basic CNC programming edits, tool offset setting, machine troubleshooting, routine maintenance, and maintaining a clean and safe work environment.

VMC Operators work closely with production, quality, and maintenance teams to meet production targets while adhering to safety procedures and quality standards.

Key Skills: VMC operation, CNC programming basics, blueprint reading, precision measurement.

Qualifications: ITI / Diploma in Mechanical or related trade; experience in CNC machining preferred.

Read more
Fireblaze Technologies
Fireblaze Technologies
Posted by Fireblaze Technologies
Nagpur
1 - 3 yrs
₹2L - ₹3L / yr
Lead Generation
Data conversion
Sales
Target audience

We are seeking a dynamic and target-driven Academic Counsellor (Sales) to convert student inquiries into enrollments. The role involves counselling prospective students, promoting academic programs, achieving admission targets, and ensuring a high-quality student experience.

Key Responsibilities

  • Handle inbound and outbound calls, emails, and walk-ins from prospective students
  • Counsel students and parents about courses, curriculum, fees, and career outcomes
  • Convert leads into admissions and consistently achieve monthly sales targets
  • Follow up with prospects and maintain accurate records in CRM systems
  • Explain course benefits, USPs, and value propositions effectively
  • Coordinate with marketing and academic teams for lead management
  • Participate in admission drives, seminars, webinars, and events
  • Handle objections professionally and close sales
  • Provide post-enrollment support to ensure student satisfaction

Requirement:

  • Bachelor’s degree in any discipline
  • Experience:
  • 0–1 year (Freshers with sales aptitude can apply)
  • 1–4 years experience in education sales, academic counselling, or inside sales preferred

Preferred Attributes

  • Experience in ed-tech, coaching institutes, or training organizations
  • Willingness to work on weekends or during peak admission periods
  • Self-motivated with a positive attitude



Read more
Kodo
at Kodo
2 recruiters
Agency job
via Babblebots by Omkar Karnani
Mumbai
3 - 6 yrs
₹18L - ₹25L / yr
skill iconKubernetes
skill iconDocker
Microsoft Windows Azure
Bash
CI/CD
+1 more

Must Have -

a. Background working with Startups

b. Good knowledge of Kubernetes & Docker

c. Background working in Azure


What you’ll be doing


  • Ensure that our applications and environments are stable, scalable, secure and performing as expected.
  • Proactively engage and work in alignment with cross-functional colleagues to understand their requirements, contributing to and providing suitable supporting solutions.
  • Develop and introduce systems to aid and facilitate rapid growth including implementation of deployment policies, designing and implementing new procedures, configuration management and planning of patches and for capacity upgrades
  • Observability: ensure suitable levels of monitoring and alerting are in place to keep engineers aware of issues.
  • Establish runbooks and procedures to keep outages to a minimum. Jump in before users notice that things are off track, then automate it for the future.
  • Automate everything so that nothing is ever done manually in production.
  • Identify and mitigate reliability and security risks. Make sure we are prepared for peak times,
  • DDoS attacks and fat fingers.
  • Troubleshoot issues across the whole stack - software, applications and network.
  • Manage individual project priorities, deadlines, and deliverables as part of a self-organizing team.
  • Learn and unlearn every day by exchanging knowledge and new insights, conducting constructive code reviews, and participating in retrospectives.


Requirements

  • 2+ years extensive experience of Linux server administration include patching, packaging (rpm), performance tuning, networking, user management, and security.
  • 2+ years of implementing systems that are highly available, secure, scalable, and self-healingon Azure cloud platform
  • Strong understanding of networking, especially in cloud environments along with a good understanding of CICD.
  • Prior experience implementing industry standard security best practices, including those recommended by Azure
  • Proficiency with Bash, and any high-level scripting language.
  • Basic working knowledge of observability stacks like ELK, prometheus, grafana, Signoz etc
  • Proficiency with Infrastructure as Code and Infrastructure Testing, preferably using Pulumi/Terraform.
  • Hands-on experience in building and administering VMs and Containers using tools such as Docker/Kubernetes.
  • Excellent communication skills, spoken as well as written, with a demonstrated ability to articulate technical problems and projects to all stakeholders.


Read more
Startup connecting physical & digital worlds through tech
Startup connecting physical & digital worlds through tech
Agency job
via Qrata by Blessy Fernandes
Bengaluru (Bangalore)
3 - 7 yrs
₹15L - ₹18L / yr
skill iconPython
skill iconJavascript
TypeScript
skill iconHTML/CSS
  • Work closely with product managers and engineers to design, implement, test and continually improve scalable frontend and backend services.
  • Develop products using agile methods and tools.
  • Develop commercial grade software that is user friendly and suitable for a global audience.
  • Plan, create and execute (manual and automated) tests.
  • Be involved and participate in the overall application lifecycle.
  • Building reusable code and libraries for future use.
  • Staying up to date with current technologies and providing insights on cutting edge software approaches, architectures, and vendors.
Required Skills

  • Fluency in any one of JavaScript, TypeScript or Python.
  • Strong problem solving skills. 
  • Should have built large scalable enterprise applications from scratch.
  • Strong experience in architectural patterns, High level designs.
  • Experience in Nosql and SQL DBs.
  •  
Read more
"A Reputed MNC"
Chennai, Bengaluru (Bangalore), Hyderabad, Kolkata, Pune, Mumbai, NCR (Delhi | Gurgaon | Noida)
15 - 26 yrs
₹28L - ₹35L / yr
Asset management
Asset manager
IT asset management
Hardware

About the Role

Objectives

  • Budget: Management of budget for Hardware Asset Management tools, services and staffing (permanent and temporary)
  • Talent management: Permanent, contract, temporary and graduate trainees
  • Matrix management: Management of staff with Hardware Asset Management responsibility in other business units
  • Lead third-party Hardware Asset Management resources: Supervisory responsibility for third-party resources supporting Hardware Asset Management activity
  • Financial: Oversight of spending on technology asset support, which includes hardware, software, cloud services, support, maintenance and hardware disposal
  • Environment: Endpoints and edge devices, software and service asset vendors, on-premises data center assets, and cloud services
  • Other: Liability management and remediation tracking

Responsibilities

The IT asset manager will:

  • Ensure that appropriate resources (people, tools, services) and processes are in place to gather and analyze data relating to technology asset life cycles and related processes.
  • Promote and advocate the value of Hardware Asset Management to various leaders. This includes developing and promoting effective Hardware Asset Management strategies, policy, approaches and practices across the full range of digital technology assets used within the organization.
  • Identify and report on breaches of Hardware Asset Management policy, and track remedial action.
  • Report on (or develop reporting on) remediation of digital technology asset-related risks and progress of optimization activity.
  • Ensure accurate and timely reporting, and that data completeness and quality issues are quickly identified and escalated to the responsible data or process owners.
  • Gather data on and provide analysis of all activities that have an impact on the value, cost and risk of digital technology asset life cycles.
  • Provide finance with the data to support the analysis of digital technology asset project and maintenance budgets and business cases.
  • Assign responsibilities for sub disciplines, such as IT asset disposition (ITAD).
  • Ensure that Hardware Asset Management -related tools and services are properly evaluated, selected, implemented, configured and maintained, with appropriate integration with other sources of organizational data.
  • Provide access to technology asset data in support of essential activities required to support the effective running of the business.
  • Manage the everyday functions of the core Hardware Asset Management team. This includes staffing, budgeting and other relevant management functions required to hold all IT and business stakeholders accountable for optimizing the cost, risk and benefit of digital technology assets throughout their life cycle.
  • Managing the deployment and collection of devices and other hardware assets to the associates or offices.

Requirements of the Hardware Asset Manager

Behaviors and Competencies

The IT asset manager acts in a leadership role and must demonstrate the following leadership attributes:

  • Strategic thinking — An understanding of strategic business objectives and the ability to develop a vision and strategy for Hardware Asset Management and execute it effectively in order to drive results toward those objectives
  • Interpersonal skills Proven ability to collaborate, build relationships and influence individuals at all levels in a matrix management environment (as well as external vendors and service providers) to ensure that segregation and overlapping roles are identified and coordinated
  • Strategic relationship management — Includes working with vendors during compliance audits
  • Strategic vision Ability to sense emerging needs and drive change as Hardware Asset Management develops and matures to meet organizational and technology challenges

Skills

The Hardware asset manager must have the following skills:

  • Strong communication skills with a proven ability to understand key business and technical concepts, and then effectively communicate these concepts with technical staff, business stakeholders and senior management
  • Strong organizational skills, the ability to perform under pressure and to manage multiple priorities with competing demands for resources
  • Strong analytical, data processing and problem-solving skills
  • Proficiency in process formulation and improvement
  • Ability to read and interrupt contracts to properly manage digital technology, which includes license and maintenance agreements
  • Ability to manage the deployment and collection of devices and other hardware assets to the associates or offices in a large scale. Demonstrable experience of managing assets in hundreds of thousands will be required.

Knowledge

The Hardware asset manager must have in-depth knowledge and experience of the following:

  • Technology asset life cycle (whether digital technology, information, financial or physical assets), best practice governance, tools and services
  • Policy, process, and procedure development and implementation
  • Technology contracts and their likely cost implications through the technology asset life cycle
  • Engagement with sourcing, procurement and vendor management legal, tax and accountancy advisors for additional information
  • Data processing, analysis and quality management tools
  • Ability to manage effective deployment, collection and disposition of assets and managing the devices through its lifecycle

Additionally, an understanding of IT business continuity, cybersecurity, and integrated risk management and optimization methodologies is highly desirable, along with knowledge of IT management.

Experience

The following experience is considered essential:

  • A management role, including engagement with sourcing, procurement and vendor management, business stakeholders and frontline operational IT staff
  • Prior Hardware Asset Management roles (hardware or software) or demonstrable experience of working with IT asset management professionals to deliver Hardware Asset Management capability and improved business outcomes
  • Atleast 16 years of experience with IT, enterprise asset management or information management
  • Demonstrated leadership experience building cross-organizational consensus with exposure to technology providers and/or business clients
  • Demonstrated experience in liaising with middle and senior management
  • Experience with building business requirements for digital technology tools and services, developing business cases, and selecting solutions using RFP documentation and processes
  • Experience leading system or tool implementations, with responsibility for verifying capabilities, outputs, dependencies and implementation scope
  • Experience managing deployment, collection and disposition of assets in a large scale with globally distributed workforce.

The following experience is considered desirable, but not mandatory:

  • 13+ years of experience working in the working in a large IT/ITeS organization
  • 13+ years of experience in managing external IT service providers handling digital technology assets
  • Experience with ServiceNow platform

This is a sensitive role dealing with commercial risks and significant costs. The organization must have a high level of confidence in the integrity and track record of the individual appointed to this role.

Certifications

The following certifications are considered desirable, but not mandatory:

  • ITIL v3 Foundation certification
  • Certified IT Asset Manager
  • COBIT 5 Foundation

Education and Training

  • An undergraduate or postgraduate degree in finance, IT, engineering, business management or a related field
  • Tertiary qualifications in financial accounting, project management or data analysis, or a legal or paralegal qualification that includes contract, copyright or intellectual property law are preferred

 

Read more
Softobiz Technologies Private limited
at Softobiz Technologies Private limited
2 candid answers
1 recruiter
Adlin Asha
Posted by Adlin Asha
Hyderabad
8 - 18 yrs
₹15L - ₹30L / yr
ETL
Informatica
Data Warehouse (DWH)
Amazon Redshift
skill iconPostgreSQL
+2 more

Experience: 8+ Years

Work Location: Hyderabad

Mode of work: Work from Office


Senior Data Engineer / Architect

 

Summary of the Role

 

The Senior Data Engineer / Architect will be a key role within the data and technology team, responsible for engineering and building data solutions that enable seamless use of data within the organization. 

 

Core Activities

-         Work closely with the business teams and business analysts to understand and document data usage requirements

-         Develop designs relating to data engineering solutions including data pipelines, ETL, data warehouse, data mart and data lake solutions

-         Develop data designs for reporting and other data use requirements

-         Develop data governance solutions that provide data governance services including data security, data quality, data lineage etc.

-         Lead implementation of data use and data quality solutions

-         Provide operational support for users for the implemented data solutions

-         Support development of solutions that automate reporting and business intelligence requirements

-         Support development of machine learning and AI solution using large scale internal and external datasets

 

Other activities

-         Work on and manage technology projects as and when required

-         Provide user and technical training on data solutions

 

Skills and Experience

-         At least 5-8 years of experience in a senior data engineer / architect role

-         Strong experience with AWS based data solutions including AWS Redshift, analytics and data governance solutions

-         Strong experience with industry standard data governance / data quality solutions  

-         Strong experience with managing a Postgres SQL data environment

-         Background as a software developer working in AWS / Python will be beneficial

-         Experience with BI tools like Power BI and Tableau

Strong written and oral communication skills

 

Read more
SynRadar
at SynRadar
1 video
2 recruiters
Ashish Rao
Posted by Ashish Rao
Mumbai, Navi Mumbai
2 - 4 yrs
₹4L - ₹8L / yr
Web application security
Cyber Security
Vulnerability assessment
Penetration testing
Information security
+6 more

This profile will include following responsibilities:

 

- Perform Web Application Security Testing

- Perform Mobile Application Security Testing

- Scan Network for Security Vulnerabilities

- Co-ordinate with the clients for Project related queries

- Undertake meeting with the client teams for discussing security issues and recommendations

- Create detailed security reports

- Keep track of project progress & send regular updates

- Research on Open source security tools & new security topics

- Create Security Knowledge base for the team

The candidate should be we well versed with application security concepts, including the mitigation techniques:
  • Web Application Security – OWASP Top 10
  • Mobile Application Security – Mobile OWASP Top 10
  • Threat Modelling
  • Risk Rating Frameworks
  • Web Traffic Interception (For Web/Mobile apps)
  • SSL
  • Network Concepts
  • Web Development Basics - HTTP/HTML/JavaScript
  • Basic Mobile Application Concepts (either Android or IOS)
Read more
An Aviation service company.
An Aviation service company.
Agency job
via kredo by Amit Lal
NCR (Delhi | Gurgaon | Noida)
3 - 5 yrs
₹3L - ₹6L / yr
Tally
TD9
Tally developer
TDL
Tally definition language
+3 more

 

Job Description

 

POSITION                         -                  Tally Developer

QUALIFICATION              -                  Graduate

WORK EXPERIENCE        -                  3 - 5+ years

LOCATION                        -                  Gurgaon

 

Job Description:

  • Candidate with minimum 3 to 5 years of strong experience in Tally Customization Development
  • ERP9 TDL program development & implementation
  • ERP9 Technical support
  • Manage Tally Customization & Tally Integration requirements

 

 

Required Skills:

  • Tally Definition Language, Excel, XML and Tally
  • Tally Integration with other systems / database
  • Good knowledge of Finance and accounting with commerce background.
  • Good communication skills and customer handling experience
  • Experience with other development languages (Asp.Net, MVC, HTML, SQL) is added advantage.

 

 

 

 

Read more
Smart Data Enterprises
at Smart Data Enterprises
3 recruiters
Sarika Tarudkar
Posted by Sarika Tarudkar
Nagpur
2 - 6 yrs
₹3L - ₹7L / yr
skill iconNodeJS (Node.js)
skill iconAngularJS (1.x)
MEAN stack
skill iconAngular (2+)
skill iconMongoDB
+3 more

About You:

We are looking for a candidate with 3 years of experience in mobile app development. Must have done at least 4 projects in react native. In this position you’ll be working with technologies like React Native, Redux, RX Observables, Type Script etc.

 

Skills & Qualifications:

- Good mobile app development experience (3 years)

- Strong knowledge of React Native(2+ years)

- Strong Javascript knowledge

- Strong attention to detail in UX & interactions

- Familiar with different tools Sentry, Bitrise, Hockeyapp, ESLint

- Familiar with Javascript ES6/ES7

- Experience with HTML/CSS(SAAS/LESS)

- Experience integrating REST APIs.

- Good knowledge of Git, Gitlab/bitbucket/Github etc.

- Familiarity with Redux

Read more
SpryOX
at SpryOX
2 recruiters
Alefiya Balasinorwala
Posted by Alefiya Balasinorwala
Mumbai
0 - 1 yrs
₹1L - ₹3L / yr
Business Development
Client Servicing
Sales
Presales
Salesforce
Company Profile: SpryOX is a creative and results-driven software organisation established in the year 2011. We specialize in Website Design & Development, Mobile App Development, E-commerce Solutions and Content Management Systems. We love to create beautiful online experiences and innovative solutions. We create a unique process for each client to ensure that business objectives are met, success is achieved and users are happy. Designation: Sales & Marketing Executive Qualification: Graduate in BCA/BTECH/BSCIT/MCA Experience: Any Experienced or Fresher Salary: 12000 to 25000 Location: Charni Road (Mumbai) Job Profile : Generate Leads, Sell Web Development Services like (domain registration, website hosting, website designing and development, bulk sms, e-commerce sites/portal, Meeting clients directly, Tele-caller, Concept sellers for online marketing & internet based solutions. Skills Required: • Strong communication skills, effective verbal and written communication skills in English. • Following up new business opportunities and setting up meetings, generate Leads. • Candidate should be able to do marketing in website design & development services. • Attend meeting and gather all the requirements needed from the clients, regarding the project. • Slightly competitive with a strong desire and ability to hit targets and exceed expectations. • Good leadership potential and an ability to keep meetings on track, on time, and on target. • Good time manager, able to juggle numerous tasks simultaneously to meet project deadlines. • Familiarity with main types of software development life cycles. • Planning and preparing presentations • Communicating new product developments to prospective clients • Sense of urgency and ability to manage and prioritize multiple work requirements to meet deadlines. • Overseeing the development of marketing literature. • Strong in-person, over-the-phone, and online presentation skills • Ability to build rapport with prospects and customers over the phone and via email • Ability to identify the key needs of a client and the translate those needs into possible solution-sets • Flexible attitude and demonstrated ability to go above and beyond the job description to achieve success • Superior verbal communication and presentation skills • Manage multiple projects simultaneously and prioritize accordingly with the ability to execute a plan. • Highly technologically adept with a proven proficiency in Word, Excel, Outlook, Power Point, CRMs and other online resources for information. • Should be proactive & aggressive in nature. Only applicants who are available full time for a minimum duration of 1 year can apply.
Read more
Why apply to jobs via Cutshort
people_solving_puzzle
Personalized job matches
Stop wasting time. Get matched with jobs that meet your skills, aspirations and preferences.
people_verifying_people
Verified hiring teams
See actual hiring teams, find common social connections or connect with them directly.
ai_chip
Move faster with AI
We use AI to get you faster responses, recommendations and unmatched user experience.
Did not find a job you were looking for?
icon
Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.
companies logo
companies logo
companies logo
companies logo
companies logo
Get to hear about interesting companies hiring right now
Company logo
Company logo
Company logo
Company logo
Company logo
Linkedin iconFollow Cutshort
Users love Cutshort
Read about what our users have to say about finding their next opportunity on Cutshort.
Shubham Vishwakarma's profile image

Shubham Vishwakarma

Full Stack Developer - Averlon
I had an amazing experience. It was a delight getting interviewed via Cutshort. The entire end to end process was amazing. I would like to mention Reshika, she was just amazing wrt guiding me through the process. Thank you team.
Companies hiring on Cutshort
companies logos