Build vs. buy vs. API

medium

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

There are three ways to get AI into your product — call a hosted API, run an open model yourself, or fine-tune/train something custom — and for 95% of product teams the right first answer is 'call the API'. Builders waste enormous effort self-hosting or fine-tuning before they've proven the feature works at all, when a hosted frontier model would have validated the idea in an afternoon. The decision hinges on volume, latency, privacy, and how differentiated the model needs to be. Knowing where each option pays off keeps you from over-engineering the plumbing before you've earned the right to optimize it.

Demo

A decision routine over the few variables that actually matter — volume, data sensitivity, latency floor, and whether a generic model is good enough — that lands on API, self-host, or fine-tune.

Try it yourself

Run the decision for your feature at today's volume, then at 100x volume.
Note the volume threshold where self-hosting starts to beat per-call API pricing.
List what would have to be true for fine-tuning to be worth it (hint: almost nothing, yet).
Check whether your data sensitivity needs a zero-retention API tier rather than self-hosting.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

In one paragraph, explain the difference between using a hosted AI API, self-hosting a model, and fine-tuning your own.

2. Why it works (the mechanism)

Walk me through how to decide between calling an API and self-hosting an open model for a product feature.

3. Advanced — application & what's next

At what point does self-hosting or fine-tuning actually beat a hosted API for a product team, and how would I know I've reached it?