Capstok — learn by doing

Why this matters

A bad RAG system prompt invites hallucination even when retrieval is perfect. A good system prompt scaffolds the model toward (1) answer only from context, (2) cite which chunks were used, (3) say 'I don't know' when context is insufficient. Five lines of careful instruction reduce hallucination rates by 30-50% on common eval sets. Most teams skip this and blame the model when the actual fix is the prompt.

Demo

Components that matter: explicit anti-hallucination rule ('Answer ONLY from the provided context'), refusal instruction ('If the context does not contain the answer, say I don't have that information'), citation instruction ('Cite the chunk number for each claim'), and the context formatting (give each chunk a clear delimiter and an ID). Bonus: instruct the model to quote verbatim for factual claims — quotes are easier to verify than paraphrases.

Try it yourself

Replace your RAG system prompt with the version above. Run 10 queries with known answers, 10 with no answers in corpus. Count hallucinations before vs after — should drop dramatically.
Add citation requirement. Confirm the model is producing [#N] citations and that they match the chunks you injected.
Try a 'be skeptical' variation: 'If you're not certain the context supports your answer, say I'm not sure'. Measure refusal rate vs hallucination rate.
Run on a hostile question that's plausibly in the corpus but actually isn't. Strong system prompt → 'I don't have that information'; weak prompt → fabricated answer.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

What's the difference between a vague RAG system prompt and a careful one? Give an example of each.

2. Why it works (the mechanism)

Walk me through *why* asking the model to cite chunks reduces hallucination — what's the mechanism inside the LLM that makes citations harder to fabricate?

3. Advanced — application & what's next

Design a prompt that handles partial-answer cases gracefully: 'I know A from the context, but B is not specified. Possible interpretations: ...'. What's the structure?

References

Chat about this lesson

SYSTEM_RAG = """You are a careful assistant that answers questions using ONLY the provided context. Follow these rules strictly:

1. If the answer is in the context, give it concisely and cite the chunk number(s) like [#3].
2. If the answer is partially in the context, give what's there and clearly note what is missing.
3. If the answer is NOT in the context, say exactly: "I don't have information about that in the provided documents."
4. Never use general knowledge to fill gaps. Never speculate.
5. Quote verbatim from the context when making factual claims.

Context (numbered chunks):
{context}
"""

def format_context(chunks: list[tuple[str, float]]) -> str:
    return "\n\n".join(
        f"[#{i+1}] (similarity {sim:.2f})\n{text}"
        for i, (text, sim) in enumerate(chunks)
    )

def answer(question):
    hits = search_with_threshold(question, k=5, min_sim=0.70)
    if not hits:
        return "I don't have information about that in the provided documents."

    msg = llm.messages.create(
        model="claude-sonnet-4-6",
        max_tokens=512,
        system=SYSTEM_RAG.format(context=format_context(hits)),
        messages=[{"role": "user", "content": question}],
    )
    return msg.content[0].text

# example output (with this prompt + good retrieval):
# "Postgres MVCC creates multiple row versions when concurrent transactions
#  modify the same row [#1]. A long-running transaction keeps all those versions
#  alive because they might still be visible to it [#2], which prevents
#  vacuum from reclaiming dead tuples [#2]."

Run: python3 main.py

The system prompt for RAG — what to say