Discovered Labs

https://discoveredlabs.com

Founded :

2024

Type :

Products & Services

Size :

0-20

Stage :

Profitable

About

The first AEO/GEO agency that helps B2B companies dominate AI search results. Get discovered by your customers when they ask AI for recommendations.

Jobs at Discovered Labs

Senior Data Engineer

at Discovered Labs

Posted by Ben Moore

The recruiter has not been active on this job recently. You may apply but please expect a delayed response.

Remote only

4 - 8 yrs

₹32L - ₹54L / yr

Data engineering

Senior Data Engineer

Pls apply here:

tinyurl [dot] com/ysk8w2eu

About Discovered Labs

At Discovered Labs we work with $10M - $50M ARR companies to help them get more leads, users and customers from Google, Bing and AI assistants such as ChatGPT, Claude and Perplexity.

We approach marketing the way engineers approach systems: data in, insights out, feedback loops everywhere. Every decision traces back to measurable outcomes. Every workflow is designed to eliminate manual bottlenecks and compound over time.

High-level overview of our approach:

Data-driven automation: We treat marketing programs like products. We instrument everything, automate the repetitive, and focus human effort on high-leverage problems.
First principles thinking: We don't copy what others do. We understand the underlying mechanics of how search and AI systems work, then build solutions from that foundation.
Full-stack ownership: SEO and AEO rarely work as isolated tasks. We work across the entire funnel and multiple surface areas to ensure we own the outcome and clients win.

The Team

We're a deeply technical team building the SpaceX of the AEO & SEO space. You'll work alongside engineers who have built fraud engines powering Stripe, Plaid, and Coinbase; developed self-driving car systems at Aurora; and conducted AI research at Stanford. We don't have layers of management. You'll work directly with founders who can go deep on architecture, code, and product.

This Role

Own the data infrastructure behind automated reporting, AI visibility monitoring, competitive intelligence, and proactive alerting across a growing multi-tenant client base.

The hard problem is operational complexity, less so petabyte scale volume. Many clients, each with multiple data sources, different schemas, different API rate limits, different failure modes, different freshness requirements. When one breaks, it can't take down everyone else. Fault isolation, graceful degradation, and per-tenant reliability are built in from the start.

This is largely greenfield. You'll be building out monitoring, observability, data quality layers and pipeline orchestration.

You report to the CTO and work closely with product engineers who build the features that consume your data layer. You'll define interfaces and data contracts together. There's no platform team. You own your infrastructure, your CI, and your monitoring.

What You'll Do

Multi-tenant data infrastructure. Ingestion, validation, and transformation across multiple data sources. Fault isolation, schema variation, and graceful upstream failure handling.
Third-party API integration. Most of our data comes from external APIs with their own auth flows, rate limits, pagination quirks, and breaking changes. You'll build robust, resilient connectors that handle all of this gracefully across many client accounts.
Data quality systems. Automated checks on distributions, volumes, null rates, and freshness. Statistical validation, not just schema validation. Bad data doesn't make it downstream.
Data observability. Freshness monitoring, volume anomaly detection, schema drift detection, lineage tracking, blast radius analysis. You know the difference between "the code ran" and "the data is correct."
Alerting design. Not just dashboards. Threshold tuning, noise reduction, avoiding alert fatigue. Mean time to detection is a core metric for this role.
Freshness SLAs. Define them per source, build infrastructure to meet them, alert before they breach.
Event-driven trigger infrastructure. Surface performance changes, quality regressions, and freshness violations as events for downstream systems.
Entity data models. Design schemas for client, competitor, and content entities. Own schema evolution and backward compatibility.
Operational environment. CI/CD, containers, deployment pipelines, credential management. Every deploy passes CI before production.

The Ideal Person for This Role

A builder who ships. You care about getting working systems into production, not endless planning or polish. You've built data infrastructure people actually rely on.
An operator, not just an architect. You don't just design systems, you run them. You find satisfaction in making things reliable, not just making them work once.
An owner. You take responsibility for outcomes, not just tasks. When a pipeline you built breaks at 3am, you fix it and make sure it doesn't break again.
Humble and curious. You acknowledge what you don't know, ask good questions, and genuinely want to learn. You take feedback as a gift, not a threat.
A first-principles thinker. You understand why things work, not just how. You can go five levels deep on schema decisions, validation strategies, and architecture tradeoffs.
Always improving. You're not satisfied with "good enough." You actively seek ways to get better at your craft and make systems better over time.

Requirements

4+ years in data engineering, platform engineering, or infrastructure-heavy backend work.
Python, SQL, pipeline orchestration (Airflow, Dagster, Prefect, or similar).
Event-driven architectures or real-time data processing.
Third-party API integration. You've built resilient connectors against external APIs with rate limits, auth flows, pagination, and breaking changes. Not just calling endpoints, but handling the full operational reality.
Pipeline fundamentals. Idempotent pipelines, backfill strategies, and schema evolution handled gracefully in production.
Data quality systems in production. Automated checks on distributions, volumes, freshness, null rates. Not a one-off notebook.
Data observability. Freshness monitoring, anomaly detection, lineage tracking, blast radius analysis.
Alerting design. Threshold tuning, noise reduction, escalation paths. You've thought about false positives as much as missed detections.
Own your infrastructure. Containers, CI/CD, deployment pipelines, monitoring, credential management. No platform team to hand off to.
Multi-tenant or multi-client data systems. Tenant isolation, per-client configuration, and operational overhead at scale.
APIs or service layers for data exposure. You've built interfaces that other systems consume, not just internal scripts.
Collaborative. You'll work closely with product engineers to define data contracts and interfaces. You communicate tradeoffs clearly in writing. You document decisions, write clear specs, and communicate tradeoffs in writing.

Preferred Qualifications

Experience with marketing or analytics data (GA4, GSC, SEO tools)
Prior experience at a fast-moving startup

What's in It for You

Fully remote position
Work directly with the CTO on high-impact projects
High ownership and autonomy. No micromanagement.
First-hand exposure to cutting-edge AI and search technology
Your work will directly impact well-known (10M+ ARR) companies' performance
Join a fast-growing company at the intersection of AI and marketing

Our Hiring Process

Application
Take-Home Project
Technical Deep Dive
Leadership Interview
Reference Checks

Pls apply here:

tinyurl [dot] com/ysk8w2eu

Senior Data Engineer

Pls apply here:

tinyurl [dot] com/ysk8w2eu

About Discovered Labs

At Discovered Labs we work with $10M - $50M ARR companies to help them get more leads, users and customers from Google, Bing and AI assistants such as ChatGPT, Claude and Perplexity.

High-level overview of our approach:

Data-driven automation: We treat marketing programs like products. We instrument everything, automate the repetitive, and focus human effort on high-leverage problems.
First principles thinking: We don't copy what others do. We understand the underlying mechanics of how search and AI systems work, then build solutions from that foundation.
Full-stack ownership: SEO and AEO rarely work as isolated tasks. We work across the entire funnel and multiple surface areas to ensure we own the outcome and clients win.

The Team

This Role

Own the data infrastructure behind automated reporting, AI visibility monitoring, competitive intelligence, and proactive alerting across a growing multi-tenant client base.

This is largely greenfield. You'll be building out monitoring, observability, data quality layers and pipeline orchestration.

What You'll Do

Multi-tenant data infrastructure. Ingestion, validation, and transformation across multiple data sources. Fault isolation, schema variation, and graceful upstream failure handling.
Third-party API integration. Most of our data comes from external APIs with their own auth flows, rate limits, pagination quirks, and breaking changes. You'll build robust, resilient connectors that handle all of this gracefully across many client accounts.
Data quality systems. Automated checks on distributions, volumes, null rates, and freshness. Statistical validation, not just schema validation. Bad data doesn't make it downstream.
Data observability. Freshness monitoring, volume anomaly detection, schema drift detection, lineage tracking, blast radius analysis. You know the difference between "the code ran" and "the data is correct."
Alerting design. Not just dashboards. Threshold tuning, noise reduction, avoiding alert fatigue. Mean time to detection is a core metric for this role.
Freshness SLAs. Define them per source, build infrastructure to meet them, alert before they breach.
Event-driven trigger infrastructure. Surface performance changes, quality regressions, and freshness violations as events for downstream systems.
Entity data models. Design schemas for client, competitor, and content entities. Own schema evolution and backward compatibility.
Operational environment. CI/CD, containers, deployment pipelines, credential management. Every deploy passes CI before production.

The Ideal Person for This Role

A builder who ships. You care about getting working systems into production, not endless planning or polish. You've built data infrastructure people actually rely on.
An operator, not just an architect. You don't just design systems, you run them. You find satisfaction in making things reliable, not just making them work once.
An owner. You take responsibility for outcomes, not just tasks. When a pipeline you built breaks at 3am, you fix it and make sure it doesn't break again.
Humble and curious. You acknowledge what you don't know, ask good questions, and genuinely want to learn. You take feedback as a gift, not a threat.
A first-principles thinker. You understand why things work, not just how. You can go five levels deep on schema decisions, validation strategies, and architecture tradeoffs.
Always improving. You're not satisfied with "good enough." You actively seek ways to get better at your craft and make systems better over time.

Requirements

4+ years in data engineering, platform engineering, or infrastructure-heavy backend work.
Python, SQL, pipeline orchestration (Airflow, Dagster, Prefect, or similar).
Event-driven architectures or real-time data processing.
Third-party API integration. You've built resilient connectors against external APIs with rate limits, auth flows, pagination, and breaking changes. Not just calling endpoints, but handling the full operational reality.
Pipeline fundamentals. Idempotent pipelines, backfill strategies, and schema evolution handled gracefully in production.
Data quality systems in production. Automated checks on distributions, volumes, freshness, null rates. Not a one-off notebook.
Data observability. Freshness monitoring, anomaly detection, lineage tracking, blast radius analysis.
Alerting design. Threshold tuning, noise reduction, escalation paths. You've thought about false positives as much as missed detections.
Own your infrastructure. Containers, CI/CD, deployment pipelines, monitoring, credential management. No platform team to hand off to.
Multi-tenant or multi-client data systems. Tenant isolation, per-client configuration, and operational overhead at scale.
APIs or service layers for data exposure. You've built interfaces that other systems consume, not just internal scripts.
Collaborative. You'll work closely with product engineers to define data contracts and interfaces. You communicate tradeoffs clearly in writing. You document decisions, write clear specs, and communicate tradeoffs in writing.

Preferred Qualifications

Experience with marketing or analytics data (GA4, GSC, SEO tools)
Prior experience at a fast-moving startup

What's in It for You

Fully remote position
Work directly with the CTO on high-impact projects
High ownership and autonomy. No micromanagement.
First-hand exposure to cutting-edge AI and search technology
Your work will directly impact well-known (10M+ ARR) companies' performance
Join a fast-growing company at the intersection of AI and marketing

Our Hiring Process

Application
Take-Home Project
Technical Deep Dive
Leadership Interview
Reference Checks

Pls apply here:

tinyurl [dot] com/ysk8w2eu

Did not find a job you were looking for?

Search for relevant jobs from 10000+ companies such as Google, Amazon & Uber actively hiring on Cutshort.

Similar companies

Cutshort

https://cutshort.io

Founded

2015

Type

Product

Size

20-100

Stage

Profitable

About the company

To hire better and faster, companies need rich candidate data, smart software and sound human judgement.

Cutshort is using AI to combine all these 3 to offer a 10x talent sourcing solution that is faster, better and cheaper.

We have 3 AI-powered offerings

Hire using our AI platform: Affordable annual subscriptions
Get only sourcing: 3.5% of annual CTC when you hire
Get full recruiting: 6.99% of annual CTC when you hire

Customers such as Fractal, Sprinto, Shiprocket, Highlevel, ThoughtWorks, Deepintent have built strong engineering teams with Cutshort.

Jobs

Cutshort Lightning

https://cutshort.io

Founded

2023

Type

Services

Size

10-50

Stage

Bootstrapped

About the company

Jobs

OIP Insurtech

https://oipinsurtech.com

Founded

2012

Type

Products & Services

Size

1000-5000

Stage

Profitable

About the company

OIP Insurtech streamlines insurance operations and optimizes workflows by combining deep industry knowledge with advanced technology. Established in 2012, OIP InsurTech partners with carriers, MGAs, program managers, and TPAs in the US, Canada, and Europe, especially the UK.

With 1,200 professionals serving over 100 clients, we deliver insurance process automation, custom software development, high-quality underwriting services, and skilled tech staff to augment our clients.

While saving time and money is the immediate win, the real game-changer is giving our clients the freedom to grow their books, run their businesses, and focus on what they love. We’re proud to support them on this journey and make a positive impact on the industry!

Jobs

Ampera Technologies

https://amperatech.ai

Founded

2024

Type

Services

Size

20-100

Stage

Profitable

About the company

At Ampera Technologies, we empower businesses with cutting-edge data analytics, quality assurance, and data engineering solutions

Jobs

AMPM Network

https://ampmnetwork.com

Founded

2026

Type

Product

Size

0-20

Stage

Raised funding

About the company

Fast news. Real context. Clear perspective. India's first creator-led news app delivering 120-second briefings, twice a day. No doomscrolling. No outrage loops.

Jobs

Move my Stuff

https://movemstuff.com.au

Founded

2008

Type

Services

Size

20-100

Stage

Bootstrapped

About the company

Jobs

Ritually

https://ritually.ai

Founded

2025

Type

Product

Size

0-20

Stage

Raised funding

About the company

Ritually helps enterprises identify and automate their back-office rituals. We build the process intelligence layer that shows large organizations how their highest-value, most repetitive work actually gets done and turns that work into automation, for a world where humans and agents work together.

Our founders, Ari Winkleman and Rachel Bush, previously built and exited a startup (Involvio) to Cisco. The company is funded and working with design partners.

You'll join an AI-native, repeat founding team at the earliest stage, with equity and real scope. You'll work directly with the founders, own meaningful work from 0-1 and have a real path to grow with the company.

We care deeply about trust, craft, customer obsession, and speed and that's what we look for in the people who join us!

Jobs

Unimaa Software Private Limited

https://unimaasoftware.in

Founded

2024

Type

Products & Services

Size

0-20

Stage

Bootstrapped

About the company

Jobs

MindBridge

https://mindbridge.net.in

Founded

2025

Type

Product

Size

0-20

Stage

Bootstrapped

About the company

At MindBridge, we partner with businesses to solve complex challenges and unlock new opportunities for growth through consulting, shared services, and AI-powered solutions. We combine deep industry expertise with technology to help organizations transform critical business functions across finance, compliance, HR, IT, legal, and ESG. By delivering scalable, future-ready solutions, we enable our clients across the USA, UK, Europe, and the Middle East to improve operational efficiency, strengthen governance, and achieve sustainable business outcomes.

What sets us apart is our people and our collaborative culture. We believe in working together, embracing innovation, and creating meaningful impact for our clients, our communities, and one another. At MindBridge, you'll have the opportunity to work on challenging projects, grow alongside talented professionals, and contribute to building solutions that shape the future of global businesses.

Jobs

URHired

https://urhired.ie

Founded

2023

Type

Product

Size

0-20

Stage

Bootstrapped

About the company

URHired is an AI-powered interview preparation platform built for neurodivergent talent - trusted by football club foundations, universities and community organisations across Ireland, the UK and the US.

Jobs

Want to work at Discovered Labs?

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Discovered Labs

About

Company social profiles

Jobs at Discovered Labs

Senior Data Engineer

Similar companies

Cutshort

Cutshort Lightning

OIP Insurtech

Ampera Technologies

AMPM Network

Move my Stuff

Ritually

Unimaa Software Private Limited

MindBridge

URHired