recordingtest/.claude/commands/evaluate.md

---
name: evaluate
description: Grade a completed module against its Sprint Contract via the evaluator agent. Usage /evaluate <contract-slug>
allowed-tools: Read, Glob, Grep, Bash, Agent
---

Evaluate module: `$ARGUMENTS`

Delegate to the **evaluator** subagent. It must:

1. Read `docs/contracts/$ARGUMENTS.md`. Refuse if missing.
2. For each Definition-of-Done item, run the verification named in the contract's Evaluation plan.
3. Collect evidence (command output, diffs, file paths).
4. Write `docs/contracts/$ARGUMENTS.evaluation.md` with the verdict table.
5. Return the verdict to the caller.

If verdict is **fail**, do NOT mark PROGRESS.md as done — report back so the generator can iterate.
If verdict is **pass**, the caller (not the evaluator) may update PROGRESS.md.

Never let the generator and evaluator be the same agent in a single session.