mlop

Experiment tracking for training ML models

Spring 2025active2025•Website

AIOpsDeveloper ToolsMachine LearningSaaS

Disclaimer

FYI Combinator is not affiliated with Y Combinator. Reports are generated by AI Research Agents and may not be 100% accurate.

Documenso

Open source e-signing

The open source DocuSign alternative. Beautiful, modern, and built for developers.

Learn more →

Your Company Here

Sponsor slot available

Want to be listed as a sponsor? Reach thousands of founders and developers.

Report from 3 months ago

What do they actually do

mlop is an open‑source experiment‑tracking tool for people training ML models. It provides a Python SDK for logging runs, a fast Rust ingestion service, and a web UI. You can sign up for the hosted service and use it today, and the code is publicly available under the mlop GitHub organization website/docs, docs, GitHub.

Typical use: install the package (pip install "mlop[full]"), call mlop.login() and mlop.init(...) to start a run, log metrics/parameters/artifacts during training, then finish the run. Runs appear in the dashboard where you can compare experiments with real‑time visualizations; the product emphasizes parameter/gradient‑level tracing, alerts when training looks off, and automated suggestions intended to catch problems early quickstart, docs, website.

The service targets individual ML developers and small teams (free personal tier) and also sells Pro and Enterprise plans. Enterprise options include self‑hosted deployment, security audits, and higher‑touch support pricing.

Who are their target customer(s)

Individual ML developer training on a laptop or single cloud GPU: Wastes time keeping runs organized and often re‑runs experiments because it’s hard to see what changed or failed; wants a quick way to log and compare runs from Python code quickstart, pricing.
Small startup ML team paying for cloud GPUs: Needs early alerts and clear comparisons so a bad job doesn’t burn budget; wants realtime visibility and simple workflows that don’t slow iteration docs, website.
MLOps / platform engineer at a growing company: Must meet security/compliance needs and keep ingestion reliable at scale; needs self‑hosted options and an architecture that handles high‑volume metrics without throttling teams pricing—enterprise/self‑hosted, GitHub.
ML engineer focused on training/debugging: Needs to find root causes like exploding gradients or bad parameter updates quickly; standard trackers often lack fine‑grained traces and actionable suggestions docs, GitHub.
Engineer responsible for production models/inference: Wants to detect serving issues and tie them back to training; lacks inference monitoring and compute‑to‑experiment correlation today (features listed as upcoming) website.

How would they acquire their first 10, 50, and 100 customers

First 10: Invite early adopters from the open‑source community and founders’ network (GitHub stargazers/contributors, YC/ML contacts) to free hosted accounts with hands‑on setup to achieve value in one session GitHub, quickstart.
First 50: Publish simple how‑to notebooks/Colabs and post to ML communities (Slack/Discord, HN, X) so individuals and small teams can try with pip install, plus limited‑time Pro trials and office‑hours webinars to unstick users quickstart, pricing.
First 100: Run targeted outbound to startups and platform/MLOps engineers offering short self‑hosted pilots and an enterprise checklist (security, integrations, SLA); convert measurable pilot outcomes into case studies and short plays pricing, GitHub, website.

What is the rough total addressable market

Top-down context:

The narrow experiment‑tracking market is estimated around USD ~0.48B in 2024 source. With compute tracking and inference monitoring, mlop participates in broader MLOps/observability markets measured in the low billions today with fast growth sources, source, source.

Bottom-up calculation:

A seat‑based model: if 50k–100k ML‑active teams adopt a tracker, with 5–10 seats per team and $25–$50 per seat per month for SMBs, plus 2k–5k enterprise deployments at $10k–$50k ARR, the addressable annual spend ranges from low hundreds of millions to several billions (experiment tracking + MLOps/observability).

Assumptions:

Global ML‑active teams: 50k–100k; enterprises needing self‑hosted: 2k–5k.
Average seats per team: 5–10; SMB per‑seat pricing: $25–$50/month.
Enterprise contracts: $10k–$50k ARR depending on scale/compliance needs.

Who are some of their notable competitors

Weights & Biases: Market incumbent for experiment tracking and collaboration with a polished hosted product, automation/alerting, and self‑managed enterprise options; closed‑source and broad in scope site, self‑managed.
MLflow: Open‑source tracking server and model lifecycle tool; widely used for simple, portable logging and a model registry, with less emphasis on per‑layer gradients or realtime alerts tracking docs, server docs.
Comet (and Opik): Commercial experiment tracking with registry and production monitoring; also ships Opik for LLM observability, offering broader evaluation/monitoring features than a pure tracker Comet, Opik.
Neptune.ai: Experiment tracker oriented to high‑scale metric ingestion and performant UI, with strong self‑hosting support; often chosen when teams need very large‑scale runs and on‑premise deployment product, self‑hosted.
ClearML: Open‑source end‑to‑end ML platform combining tracking with orchestration/autoscaling and serving; closer to a full platform than a focused tracker docs, server repo.