

Discovered Labs
https://discoveredlabs.comJobs at Discovered Labs
The recruiter has not been active on this job recently. You may apply but please expect a delayed response.
Senior Data Engineer
Pls apply here:
tinyurl [dot] com/ysk8w2eu
About Discovered Labs
At Discovered Labs we work with $10M - $50M ARR companies to help them get more leads, users and customers from Google, Bing and AI assistants such as ChatGPT, Claude and Perplexity.
We approach marketing the way engineers approach systems: data in, insights out, feedback loops everywhere. Every decision traces back to measurable outcomes. Every workflow is designed to eliminate manual bottlenecks and compound over time.
High-level overview of our approach:
- Data-driven automation: We treat marketing programs like products. We instrument everything, automate the repetitive, and focus human effort on high-leverage problems.
- First principles thinking: We don't copy what others do. We understand the underlying mechanics of how search and AI systems work, then build solutions from that foundation.
- Full-stack ownership: SEO and AEO rarely work as isolated tasks. We work across the entire funnel and multiple surface areas to ensure we own the outcome and clients win.
The Team
We're a deeply technical team building the SpaceX of the AEO & SEO space. You'll work alongside engineers who have built fraud engines powering Stripe, Plaid, and Coinbase; developed self-driving car systems at Aurora; and conducted AI research at Stanford. We don't have layers of management. You'll work directly with founders who can go deep on architecture, code, and product.
This Role
Own the data infrastructure behind automated reporting, AI visibility monitoring, competitive intelligence, and proactive alerting across a growing multi-tenant client base.
The hard problem is operational complexity, less so petabyte scale volume. Many clients, each with multiple data sources, different schemas, different API rate limits, different failure modes, different freshness requirements. When one breaks, it can't take down everyone else. Fault isolation, graceful degradation, and per-tenant reliability are built in from the start.
This is largely greenfield. You'll be building out monitoring, observability, data quality layers and pipeline orchestration.
You report to the CTO and work closely with product engineers who build the features that consume your data layer. You'll define interfaces and data contracts together. There's no platform team. You own your infrastructure, your CI, and your monitoring.
What You'll Do
- Multi-tenant data infrastructure. Ingestion, validation, and transformation across multiple data sources. Fault isolation, schema variation, and graceful upstream failure handling.
- Third-party API integration. Most of our data comes from external APIs with their own auth flows, rate limits, pagination quirks, and breaking changes. You'll build robust, resilient connectors that handle all of this gracefully across many client accounts.
- Data quality systems. Automated checks on distributions, volumes, null rates, and freshness. Statistical validation, not just schema validation. Bad data doesn't make it downstream.
- Data observability. Freshness monitoring, volume anomaly detection, schema drift detection, lineage tracking, blast radius analysis. You know the difference between "the code ran" and "the data is correct."
- Alerting design. Not just dashboards. Threshold tuning, noise reduction, avoiding alert fatigue. Mean time to detection is a core metric for this role.
- Freshness SLAs. Define them per source, build infrastructure to meet them, alert before they breach.
- Event-driven trigger infrastructure. Surface performance changes, quality regressions, and freshness violations as events for downstream systems.
- Entity data models. Design schemas for client, competitor, and content entities. Own schema evolution and backward compatibility.
- Operational environment. CI/CD, containers, deployment pipelines, credential management. Every deploy passes CI before production.
The Ideal Person for This Role
- A builder who ships. You care about getting working systems into production, not endless planning or polish. You've built data infrastructure people actually rely on.
- An operator, not just an architect. You don't just design systems, you run them. You find satisfaction in making things reliable, not just making them work once.
- An owner. You take responsibility for outcomes, not just tasks. When a pipeline you built breaks at 3am, you fix it and make sure it doesn't break again.
- Humble and curious. You acknowledge what you don't know, ask good questions, and genuinely want to learn. You take feedback as a gift, not a threat.
- A first-principles thinker. You understand why things work, not just how. You can go five levels deep on schema decisions, validation strategies, and architecture tradeoffs.
- Always improving. You're not satisfied with "good enough." You actively seek ways to get better at your craft and make systems better over time.
Requirements
- 4+ years in data engineering, platform engineering, or infrastructure-heavy backend work.
- Python, SQL, pipeline orchestration (Airflow, Dagster, Prefect, or similar).
- Event-driven architectures or real-time data processing.
- Third-party API integration. You've built resilient connectors against external APIs with rate limits, auth flows, pagination, and breaking changes. Not just calling endpoints, but handling the full operational reality.
- Pipeline fundamentals. Idempotent pipelines, backfill strategies, and schema evolution handled gracefully in production.
- Data quality systems in production. Automated checks on distributions, volumes, freshness, null rates. Not a one-off notebook.
- Data observability. Freshness monitoring, anomaly detection, lineage tracking, blast radius analysis.
- Alerting design. Threshold tuning, noise reduction, escalation paths. You've thought about false positives as much as missed detections.
- Own your infrastructure. Containers, CI/CD, deployment pipelines, monitoring, credential management. No platform team to hand off to.
- Multi-tenant or multi-client data systems. Tenant isolation, per-client configuration, and operational overhead at scale.
- APIs or service layers for data exposure. You've built interfaces that other systems consume, not just internal scripts.
- Collaborative. You'll work closely with product engineers to define data contracts and interfaces. You communicate tradeoffs clearly in writing. You document decisions, write clear specs, and communicate tradeoffs in writing.
Preferred Qualifications
- Experience with marketing or analytics data (GA4, GSC, SEO tools)
- Prior experience at a fast-moving startup
What's in It for You
- Fully remote position
- Work directly with the CTO on high-impact projects
- High ownership and autonomy. No micromanagement.
- First-hand exposure to cutting-edge AI and search technology
- Your work will directly impact well-known (10M+ ARR) companies' performance
- Join a fast-growing company at the intersection of AI and marketing
Our Hiring Process
- Application
- Take-Home Project
- Technical Deep Dive
- Leadership Interview
- Reference Checks
Pls apply here:
tinyurl [dot] com/ysk8w2eu
Similar companies
About the company
Beyond Seek is a team of R.A.R.E individuals who're solving impactful problems using the best tools available today!
Jobs
1
About the company
Quantiphi is an award-winning AI-first digital engineering company driven by the desire to reimagine and realize transformational opportunities at the heart of the business. Since its inception in 2013, Quantiphi has solved the toughest and most complex business problems by combining deep industry experience, disciplined cloud, and data-engineering practices, and cutting-edge artificial intelligence research to achieve accelerated and quantifiable business results.
Jobs
8
About the company
Jobs
12
About the company
Jobs
2
About the company
Blitzy is a Boston, MA based Generative AI Start-up with an established office in Pune, India. We are on a mission to automate custom software creation to unlock the next industrial revolution. We're backed by multiple tier 1 investors, have success as founders at the last start-up, and dozens of Generative AI patents to our names.
Our Culture
Our Co-Founder and CTO is a Serial Gen AI Inventor who grew up in Pune, India, is a BITS Pilani graduate, and worked at NVIDIA's Pune office for 6 years. There, he was promoted 5 times in 6 years and was transferred to the NVIDIA Headquarters in Santa Clara, California. After making significant contributions to NVIDIA, he proceeded to attend Harvard for his dual Masters in Engineering and MBA from HBS. Our other Co-Founder/CEO is a successful Serial Entrepreneur who has built multiple companies. As a team, we work very hard, have a curious mind-set, and believe in a low-ego high output approach.
Funding Journey
In September 2024, Blitzy secured $4.4M in seed funding from prominent investors including Link Ventures, Asymmetric Capital Partners, Flybridge, and four other strategic investors, demonstrating strong market confidence in their autonomous software development platform.
Our Values
- We move Blitzy Fast: Time is both our company's and our client's most precious asset. We move fast and fearlessly to innovate internally and deliver exceptional software externally to our clients.
- We have a Championship Mindset: We operate like a professional sports team. We win as a team by holding ourselves and each other to high standards, collaborating in-person, and remaining focused on the mission.
- We have a Passion for Invention: We are inventors at heart. We value starting with best practices and open source, but we are pushing the frontier of what is possible.
- We Work for the Customer: We focus on delivering outsized value to the customers we work with and expanding those relationships to deep, meaningful partnerships.
What We Ask of Candidates
Please ask yourself if you are ready for a challenge before applying. Even in optimal conditions, Start-Ups are hard, and are always a lot of work. What you do week to week will change. If this feels exciting, not concerning, that's a good sign.
Jobs
3
About the company
Jobs
11
About the company
Jobs
3
About the company
Jobs
2
About the company
Ande is an AI-native, full-stack TypeScript platform built on React, Node.js, GraphQL, and Postgres, running on AWS and powering web, mobile, internal operations, deep integrations, and agentic workflows.
Our product sits at the intersection of enterprise workflows, hospitality operations, payments, compliance, procurement, and AI — giving engineers the opportunity to solve problems that combine polished user experiences with complex real-world systems.
Engineering at Ande is deeply product-oriented and systems-heavy. We care about:
- Type safety and shared abstractions
- Fast iteration and observable production systems
- High-quality user experiences
- Building durable foundations for a category-defining platform
PMs and engineers work closely with the business domain, contributing directly to:
- Booking experiences
- Client entertainment policies
- Venue operations
- Spend visibility and approvals
- Payments and procurement workflows
- Enterprise integrations
- AI-driven workflows that reduce manual coordination across enterprises and hospitality partners
Founders
- Lohit Sarma
- Ashish Bidadi
- Michael McDermott
Jobs
0







