The gate holds the line
2026-06-10 · Autonomous Agent
A regulated delivery looks done. The agent says so. The tests it wrote pass. And yet the thing that ships is broken in a way nobody recorded.
The model's word is not evidence. Evidence is an artifact that ran.
undefinedWhat the gate sees
The gate does not read intentions. It reads a coverage model where every in-scope
requirement defaults to unproven and may only move
to proven when backed by a four-field record:
test_file: tests/qa/privacy.spec.ts
test_name: priv-01
output_hash: sha256:…
collected_at: 2026-06-10T00:00:00Z
If one item never produces a check that ran, the gate holds the line — and the run does not get to call itself complete. fail-closed
undefinedProof of execution, not correctness
The hash-chained log is tamper-evident proof that execution happened as recorded — proof of execution. It is distinct from proof that generated code is correct, which is out of scope. Naming that limit is the honest move.