
Full-Stack Machine Learning Engineer
Role: Full-Time, Long-Term Required: Python Preferred: C++
OVERVIEW
We are seeking a versatile ML engineer to join as a core member of our technical team. This is a long-term position for someone who wants to build sophisticated production systems and grow with a small, focused team. You will work across the entire stack—from data ingestion and feature engineering through model training, validation, and deployment.
The ideal candidate combines strong software engineering fundamentals with deep ML expertise, particularly in time series forecasting and quantitative applications. You should be comfortable operating independently, making architectural decisions, and owning systems end-to-end.
CORE TECHNICAL REQUIREMENTS
Python (Required): Professional-level proficiency writing clean, production-grade code—not just notebooks. Deep understanding of NumPy, Pandas, and their performance characteristics. You know when to use vectorized operations, understand memory management for large datasets, and can profile and optimize bottlenecks. Experience with async programming and multiprocessing is valuable.
Machine Learning (Required): Hands-on experience building and deploying ML systems in production. This goes beyond training models—you understand the full lifecycle: data validation, feature engineering, model selection, hyperparameter optimization, validation strategies, monitoring, and maintenance.
Specific experience we value: gradient boosting frameworks (LightGBM, XGBoost, CatBoost), time series forecasting, probabilistic prediction and uncertainty quantification, feature selection and dimensionality reduction, cross-validation strategies for non-IID data, model calibration.
You should understand overfitting deeply—not just as a concept but as something you actively defend against through proper validation, regularization, and architectural choices.
Data Pipelines (Required): Design and implement robust pipelines handling real-world messiness: missing data, late arrivals, schema changes, upstream failures. You understand idempotency, exactly-once semantics, and backfill strategies. Experience with workflow orchestration (Airflow, Prefect, Dagster) expected. Comfortable with ETL/ELT patterns, incremental vs full recomputation, data quality monitoring, database design and query optimization (PostgreSQL preferred), time series data at scale.
C++ (Preferred): Experience valuable for performance-critical components. Writing efficient C++ and interfacing with Python (pybind11, Cython) is a significant advantage.
HIGHLY DESIRABLE: MULTI-AGENT ORCHESTRATION
We are building systems leveraging LLM-based automation. Experience with multi-agent frameworks highly desirable: LangChain, LangGraph, or similar agent frameworks; designing reliable AI pipelines with error handling and fallbacks; prompt engineering and output parsing; managing context and state across agent interactions. You do not need to be an expert, but genuine interest and hands-on experience will set you apart.
DOMAIN EXPERIENCE: FINANCIAL DATA AND CRYPTO
Preference for candidates with experience in quantitative finance, algorithmic trading, or fintech; cryptocurrency markets and their unique characteristics; financial time series data and forecasting systems; market microstructure, volatility, and regime dynamics. This helps you understand why reproducibility is non-negotiable, why validation must account for temporal structure, and why production reliability cannot be an afterthought.
ENGINEERING STANDARDS
Code Quality: Readable, maintainable code others can modify. Proper version control (meaningful commits, branches, code review). Testing where appropriate. Documentation: docstrings, READMEs, decision records.
Production Mindset: Think about failure modes before they happen. Build in observability: logging, metrics, alerting. Design for reproducibility—same inputs produce same outputs.
Systems Thinking: Consider component interactions, not just isolated behavior. Understand tradeoffs: speed vs accuracy, flexibility vs simplicity. Zoom between architecture and implementation.
WHAT WE ARE LOOKING FOR
Self-Direction: Given a problem and context, you break it down, identify the path forward, and execute. You ask questions when genuinely blocked, not when you could find the answer yourself.
Long-Term Orientation: You think in years, not months. You make decisions considering future maintainability.
Intellectual Honesty: You acknowledge uncertainty and distinguish between what you know versus guess. When something fails, you dig into why.
Communication: You explain complex concepts clearly and document your reasoning.
EDUCATION
University degree in a quantitative/technical field preferred: Computer Science, Mathematics, Statistics, Physics, Engineering. Equivalent demonstrated expertise through work also considered.
TO APPLY
Include: (1) CV/resume, (2) Brief description of a production ML system you built, (3) Links to relevant work if available, (4) Availability and timezone.

About Capital Squared
About
Company social profiles
Similar jobs
We are seeking an experienced Python Lead to design, develop, and scale high-performance backend systems. The ideal candidate will have strong expertise in Python-based backend development, system design, and cloud-native architectures. You will lead the development of scalable APIs, work with modern cloud platforms, and collaborate with cross-functional teams to deliver reliable and efficient applications.
Key Responsibilities
- Design and develop scalable backend services using Python (Django/Flask).
- Build and maintain RESTful APIs and WebSocket-based applications.
- Implement efficient algorithms, data structures, and design patterns for high-performance systems.
- Develop and optimize database schemas and queries using PostgreSQL, MySQL, or MongoDB.
- Integrate caching and queuing systems to improve system performance and reliability.
- Deploy and manage applications on AWS or GCP cloud environments.
- Implement and maintain CI/CD pipelines using tools such as Jenkins, GitLab CI, or GitHub Actions.
- Work with Docker containers and Linux-based environments for development and deployment.
- Collaborate with engineering teams to design scalable system architectures.
- Explore and integrate AI-driven capabilities such as RAG, LLMs, and vector databases where applicable.
Required Skills
- Strong expertise in Python backend development using Django or Flask
- Experience with REST APIs, WebSockets, and microservices architecture
- Solid knowledge of design patterns, algorithms, and data structures
- Experience with relational and NoSQL databases (PostgreSQL, MySQL, MongoDB)
- Hands-on experience with AWS or GCP cloud services
- Experience with CI/CD pipelines and containerization (Docker)
- Proficiency in Git and Linux environments
Preferred Skills
- Familiarity with AI/ML concepts
- Experience with RAG architectures and LLM integrations
- Knowledge of vector databases such as Pinecone or ChromaDB
What We’re Looking For
- Strong problem-solving and system design skills
- Ability to lead backend development initiatives
- Experience building scalable and production-grade systems
- Excellent collaboration and communication skills
About the Role-
Thinking big and executing beyond what is expected. The challenges cut across algorithmic problem solving, systems engineering, machine learning and infrastructure at a massive scale.
Reason to Join-
An opportunity for innovators, problem solvers & learners. Working will be Innovative, empowering, rewarding & fun. Amazing Office, competitive pay along with excellent benefits package.
Requiremets and Responsibilities- (please read carefully before applying)
- The overall experience of 3-6 years in Java/Python Framework and Machine Learning.
- Develop Web Services, REST, XSD, XML technologies, Java, Python, AWS, API.
- Experience on Elastic Search or SOLR or Lucene -Search Engine, Text Mining, Indexing.
- Experience in highly scalable tools like Kafka, Spark, Aerospike, etc.
- Hands on experience in Design, Architecture, Implementation, Performance & Scalability, and Distributed Systems.
- Design, implement, and deploy highly scalable and reliable systems.
- Troubleshoot Solr indexing process and querying engine.
- Bachelors or Masters in Computer Science from Tier 1 Institutions
Must have:
- 8+ years of experience with a significant focus on developing, deploying & supporting AI solutions in production environments.
- Proven experience in building enterprise software products for B2B businesses, particularly in the supply chain domain.
- Good understanding of Generics, OOPs concepts & Design Patterns
- Solid engineering and coding skills. Ability to write high-performance production quality code in Python
- Proficiency with ML libraries and frameworks (e.g., Pandas, TensorFlow, PyTorch, scikit-learn).
- Strong expertise in time series forecasting using stat, ML, DL and foundation models
- Experience of working on processing time series data employing techniques such as decomposition, clustering, outlier detection & treatment
- Exposure to generative AI models and agent architectures on platforms such as AWS Bedrock, Crew AI, Mosaic/Databricks, Azure
- Experience of working with modern data architectures, including data lakes and data warehouses, having leveraged one or more of the frameworks such as Airbyte, Airflow, Dagster, AWS Glue, Snowflake,, DBT
- Hands-on experience with cloud platforms (e.g., AWS, Azure, GCP) and deploying ML models in cloud environments.
- Excellent problem-solving skills and the ability to work independently as well as in a collaborative team environment.
- Effective communication skills, with the ability to convey complex technical concepts to non-technical stakeholders
Good To Have:
- Experience with MLOps tools and practices for continuous integration and deployment of ML models.
- Has familiarity with deploying applications on Kubernetes
- Knowledge of supply chain management principles and challenges.
- A Master's or Ph.D. in Computer Science, Machine Learning, Data Science, or a related field is preferred
Your responsibilities as a backend engineer will include:
- Back-end software development
- Software engineering and designing data models and write effective APIs
- Working together with engineers and product teams
- Understanding business use cases and requirements for different internal teams
- Maintenance of existing projects and New feature development
- Consume and integrate classifier/ ML snippets from Data science team
What we are looking for:
- 2+ years of industry experience with the Python and Django framework.
- Degree in Computer Science or related field
- Good analytical skills with strong fundamentals of data structures and algorithms
- Experience building backend services with hands-on experience through all stages of Agile software development life cycle.
- Ability to write optimized codes,debug programs, and integrate applications with third party tools by developing various APIs
- Experience with Databases.
- Experience with writing REST-APIs.
- Prototyping initial collection and leveraging existing tools and/or creating new tools
- Experience working different types of datasets (e.g. unstructured, semi-structured, with missing information)
- Ability to think critically and creatively in a dynamic environment, while picking up new tools and domain knowledge along the way
- A positive attitude, and a growth mindset
Bonus:
- Experience with relevant Python libraries such as Requests, sklearn, Selenium
- Hands on experience in Machine learning implementations
- Experience with Cloud infrastructure (e.g. AWS) and relevant microservices
- Good Humor
Note- We are currently working from home due to the pandemic. If selected you may work from a remote location though once office reopens candidate must work from Office.
Your responsibilities as a backend engineer will include:
- Back-end software development
- Software engineering and designing data models and write effective APIs
- Working together with engineers and product teams
- Understanding business use cases and requirements for different internal teams
- Maintenance of existing projects and New feature development
- Consume and integrate classifier/ ML snippets from Data science team
What we are looking for:
- 2+ years of industry experience with the Python and Django framework.
- Degree in Computer Science or related field
- Good analytical skills with strong fundamentals of data structures and algorithms
- Experience building backend services with hands-on experience through all stages of Agile software development life cycle.
- Ability to write optimized codes,debug programs, and integrate applications with third party tools by developing various APIs
- Experience with Databases.
- Experience with writing REST-APIs.
- Prototyping initial collection and leveraging existing tools and/or creating new tools
- Experience working different types of datasets (e.g. unstructured, semi-structured, with missing information)
- Ability to think critically and creatively in a dynamic environment, while picking up new tools and domain knowledge along the way
- A positive attitude, and a growth mindset
Bonus:
- Experience with relevant Python libraries such as Requests, sklearn, Selenium
- Hands on experience in Machine learning implementations
- Experience with Cloud infrastructure (e.g. AWS) and relevant microservices
- Good Humor
Responsibilities:
- Writing reusable, testable, and efficient code
- Design and implementation of low-latency, high-availability, and performant applications
- Integration of user-facing elements developed by front-end developers with server side logic
- Implementation of security and data protection
- Integration of data storage solutions (may include databases, key-value stores, blob stores, etc.)
- Expert in Python, with knowledge of at least one Python web framework (such as Django, Flask, etc depending on your technology stack)
- Familiarity with some ORM (Object Relational Mapper) libraries
- Able to integrate multiple data sources and databases into one system
- Understanding of the threading limitations of Python, and multi-process architecture
- Good understanding of server-side templating languages (such as Jinja 2, Mako, etc depending on your technology stack)
- Basic understanding of front-end technologies, such as JavaScript, HTML5, and CSS3
- Understanding of accessibility and security compliance (depending on the specific project)
- Knowledge of user authentication and authorization between multiple systems, servers, and environments
- Understanding of fundamental design principles behind a scalable application
- Familiarity with event-driven programming in Python
- Understanding of the differences between multiple delivery platforms, such as mobile vs desktop, and optimizing output to match the specific platform
- Able to create database schemas that represent and support business processes
- Strong unit test and debugging skills
- Basic knowledge of machine learning algorithm and libraries like keras, tensorflow, sklearn.








