init step installs project-scoped subagent templates for both Codex and Claude Code. Each template is intentionally narrow and role-specific. The roles stay separate to keep implementation, judgment, and correction in distinct hands — this reduces self-justification and makes failures easier to localize.
Installed files
- Codex
- Claude Code
In Claude Code, if
init just created or refreshed .claude/agents/* during a running session, start a new Claude Code session before relying on those updated agents. Use /agents to confirm the available agent list.Role definitions
task-spec-freezer
Purpose: Freeze the task into .agent/tasks/<TASK_ID>/spec.md.
Hard boundaries:
- May read repo guidance and relevant code
- Must not change production code
- Must not write verdict or problems files
task-builder
Purpose: Implement the task and later pack evidence.
Modes: BUILD and EVIDENCE
Hard boundaries:
- In
BUILDmode: implement against the spec, make the smallest safe change set, keep unrelated files untouched, do not writeverdict.jsonorproblems.md - In
EVIDENCEmode: do not change production code; writeevidence.md,evidence.json, and raw artifacts
task-verifier
Purpose: Fresh-session verification against the current codebase.
Hard boundaries:
- Must not edit production code
- Must not patch the evidence bundle to make it look complete
- Must write
verdict.json - Must write
problems.mdonly when the verdict is notPASS
task-fixer
Purpose: Repair only what the verifier identified.
Hard boundaries:
- Must reread the spec and verifier output
- Must reconfirm the problem before editing
- Must regenerate evidence after the fix
- Must not write final sign-off
Invocation patterns
- Codex
- Claude Code
Use explicit delegation language. The parent asks Codex to spawn one named child, waits for it, and then continues. Do not spawn any child until Builder:Verifier:Fixer:Repeat the same delegation pattern for each role.
init <TASK_ID> has finished and .agent/tasks/<TASK_ID>/spec.md exists. Do not batch init with other commands or tool calls.Keep delegation depth flat — one child per role at a time.Spec freezer:Same-session evidence packing
The preferred pattern is:- Spawn
task-builder - Let it implement (
BUILDmode) - Continue with the same child in
EVIDENCEmode
task-builder child with an explicit EVIDENCE-ONLY prompt if the original builder session is unavailable or you intentionally want a fresh evidence-only run.
Follow-up prompt to the same builder session:
Why the roles stay separate
The workflow keeps three concerns apart:- Implementation — building the code
- Judgment — evaluating whether the code meets the spec
- Correction — fixing only what the judgment found wanting