Introduction

Repo Task Proof Loop is a repo-local workflow skill for non-trivial coding tasks. It creates a durable task folder under .agent/tasks/<TASK_ID>/, installs project-scoped Codex and Claude subagents, updates repo guidance, and drives a strict loop:

spec freeze → build → evidence → fresh verify → minimal fix → fresh verify

What it solves

Three problems make non-trivial coding tasks hard to track and audit:

Proof lives outside the repo. Results, test output, and screenshots disappear when a session ends.
Implementation and verification are mixed. The same agent that writes code also declares it correct, with no independent check.
Task state is opaque. Picking up a half-finished task requires reconstructing context from scratch.

Repo Task Proof Loop solves all three. Every artifact the workflow produces — spec, evidence, verdict, raw output — is committed inside the repository. Verification always runs in a fresh session that has not seen the implementation. And because every phase writes to a fixed set of files, any task can be resumed or audited at any point.

What it creates

Running init creates the following structure inside the target repository:

.agent/tasks/<TASK_ID>/
  spec.md
  evidence.md
  evidence.json
  raw/
    build.txt
    test-unit.txt
    test-integration.txt
    lint.txt
    screenshot-1.png
  verdict.json
  problems.md

.codex/agents/
  task-spec-freezer.toml
  task-builder.toml
  task-verifier.toml
  task-fixer.toml

.claude/agents/
  task-spec-freezer.md
  task-builder.md
  task-verifier.md
  task-fixer.md

It also inserts managed workflow blocks into:

AGENTS.md
The repo’s Claude guide file: CLAUDE.md or .claude/CLAUDE.md

The managed blocks are replaced in place on re-run, so any user-authored content outside the block markers is preserved.

Compatibility

Requirement	Details
Python	3.10 or later (for bundled scripts)
Codex	Project skill at `.agents/skills/repo-task-proof-loop/`
Claude Code	Project skill at `.claude/skills/repo-task-proof-loop/`

The skill can be used with either tool independently, or with both on the same repository. The initialization script writes repo-local workflow files into the target repository, not into the skill directory.

Quick Start

Go from zero to a completed verified task as fast as possible.

Workflow

Understand the full spec-freeze → build → evidence → verify → fix loop.

Get Started

Core Concepts

Guides

What it solves

What it creates

Compatibility

Quick Start

Workflow

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

​What it solves

​What it creates

​Compatibility

Quick Start

Workflow

Build docs developers (and LLMs) love

What it solves

What it creates

Compatibility