When NOT to fine-tune

medium

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

Just as important as the green lights are the red ones. Don't fine-tune when your facts change often (RAG), when you have fewer than a hundred examples (prompt instead), when a stronger base model with a good prompt already clears the bar, when you can't build an eval to prove it worked, or when you haven't yet exhausted prompt engineering. Recognizing these anti-patterns up front is what keeps you from a multi-week project that ends with a model no better than your starting prompt. This is the 'stop' list every fine-tuning decision should pass through.

Demo

The demo is a pre-flight checklist that returns the reasons you should NOT fine-tune yet — if any fire, fix that first.

Try it yourself

Run the checklist against your task and list every reason that fires.
Take a 'too few examples' result and design the few-shot prompt you'd use instead.
Take a 'no eval' result and sketch the smallest eval set that would unblock you.
Find a project that passes all four checks and explain why it's genuinely a fine-tuning candidate.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

In one paragraph, when should I NOT fine-tune a model?

2. Why it works (the mechanism)

Walk me through the warning signs that mean I should reach for prompting or RAG instead of fine-tuning.

3. Advanced — application & what's next

I have 60 examples, facts that update weekly, and no eval set, but my boss wants a fine-tune. Make the case for what to do instead and how to sequence it.