Why more context is not better context

medium

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

It's tempting to dump everything into the window 'just in case' — the model is smart, let it sort it out. But irrelevant context actively hurts: it dilutes attention, raises the chance the model latches onto the wrong passage, increases latency and cost, and pushes important material toward positions the model attends to less. The skill is subtractive. A tight context of five relevant chunks beats a sprawling one of fifty, almost every time. This is the single most counterintuitive lesson for people coming from prompt engineering, where 'add more instructions' usually helps.

Demo

This is measurable, not philosophical. Take a question with one correct supporting passage, then answer it twice: once with just that passage, once with the passage buried among 40 distractors. Accuracy and latency both move in the wrong direction as you add noise.

Try it yourself

Run it. Both may answer correctly here, but note the latency gap — noise costs time and money even when accuracy survives.
Bury GOLD in the exact middle of 60 distractors and ask again. This is the 'lost in the middle' failure mode you'll study in Module 9.
Replace distractors with near-duplicates of GOLD that contradict it (e.g. '24 months'). Watch accuracy collapse — conflicting context is worse than irrelevant context.
Reduce to the single GOLD block and confirm the answer is both correct and fastest. Subtraction is the optimization.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

Explain why adding more context to a prompt can make answers worse, not better. What does 'attention dilution' mean in plain terms?

2. Why it works (the mechanism)

Walk me through what happens inside the model's attention when relevant evidence is surrounded by irrelevant passages. Why does position within the context matter?

3. Advanced — application & what's next

Design an experiment to measure how my RAG system's accuracy degrades as I increase k (chunks retrieved). What metrics would I track and how would I find the 'knee' where more context stops helping?