Skip to main content

Live Dogfood Proof: tusq.dev Full-Auto

This page is the public evidence for AgentXchain's strongest claim to date: three full governed runs completed autonomously on a real production codebase using the shipped [email protected] CLI with --on-idle perpetual.

Unlike the Live Continuous 3-Run Proof which uses a synthetic test project, this dogfood exercises the full product against tusq.dev — a real SaaS capability compiler with its own VISION.md, ROADMAP.md, and SYSTEM_SPEC.md.

Date of proof: 2026-04-24

What It Is

This is a dated proof that AgentXchain can govern three full autonomous runs on a real production codebase (tusq.dev) using the shipped [email protected] CLI with perpetual idle-expansion. The dogfood loop discovered and shipped 7 framework gaps across patch releases v2.155.2 through v2.155.10.

What It Proves

ClaimEvidence
Perpetual idle-expansion works end-to-endPM idle-expansion turns dispatched, produced structured idle_expansion_result, accepted through intake pipeline, and chained into governed runs
All four phases complete autonomouslyEach run traversed planning, implementation, QA, and launch with no agent-level failures
Only human gates require manual unblockingplanning_signoff, qa_ship_verdict, and launch_ready gates required human approval — all agent turns accepted on first attempt
Zero reliability failures post-fix65 turn_dispatched events, 19 accepted turns, zero runtime_spawn_failed, stdout_attach_failed, ghost_turn, or startup_watchdog_fired signals
Dogfood-driven development works7 gaps discovered and shipped as patches (v2.155.2 through v2.155.10) during the session

Setup

The dogfood used a clean linked git worktree to avoid contaminating the operator's working baseline:

git worktree add -b agentxchain-dogfood-2026-04 \
"../tusq.dev-agentxchain-dogfood" origin/main

Full-auto approval policy configured via shipped CLI:

agentxchain config --set approval_policy.phase_transitions.default auto_approve
agentxchain config --set approval_policy.run_completion.action auto_approve
agentxchain validate

Run It

To reproduce a similar dogfood loop on your own project, follow the setup above and run the exact command below.

Workflow Or Proof Shape

The proof exercises the full governed lifecycle in perpetual mode:

  1. Idle expansion: PM synthesizes next increment from VISION/ROADMAP/SYSTEM_SPEC
  2. Intake pipeline: structured idle_expansion_result flows through record → triage → approve → plan → start
  3. Governed run: agents take structured turns through planning → implementation → QA → launch
  4. Human gates: planning_signoff, qa_ship_verdict, launch_ready require manual approval
  5. Loop: on completion, perpetual mode dispatches next idle-expansion

Exact Command

npx --yes -p agentxchain@latest -c \
'agentxchain run --continuous \
--vision .planning/VISION.md \
--max-runs 5 \
--max-idle-cycles 3 \
--poll-seconds 5 \
--triage-approval auto \
--auto-checkpoint \
--on-idle perpetual'

Evidence Summary

MetricValue
Target projecttusq.dev (SaaS capability compiler)
Branchagentxchain-dogfood-2026-04
CLI version[email protected]
Runs completed3 (planning → implementation → QA → launch each)
Turn dispatches65
Turns accepted19
runtime_spawn_failed0
stdout_attach_failed0
ghost_turn0
startup_watchdog_fired0
Idle-expansion dispatches5
run_completed events5
Gaps discovered and shipped7 (GAP-001 through GAP-007)

Completed Runs

Run IDCLI VersionOutcome
run_71b762f4405c0fc5v2.155.3M28 Sensitivity Class Inference proposed via idle expansion. Dev by Claude Sonnet 4.6, QA and launch gates approved.
run_e7c2e5668d6cfb6av2.155.10First final-package full governed run after embedded normalization fix.
run_4e38dc0248fb17e6v2.155.10Perpetual idle-expansion dispatched and accepted next PM intent, then completed all phases.
run_6464f8d17e8dedcdv2.155.10Third full governed run — success criteria met.

Gaps Discovered and Shipped

The dogfood loop's core value: every framework gap becomes a patch release, and the dogfood retries on the fix.

GapFixed InIssue
GAP-001v2.155.2Idle-expansion dispatched before proving inherited run eligibility
GAP-002v2.155.3Reconcile refused SESSION_RECOVERY.md edits (safe-path allowlist added)
GAP-003v2.155.6Sidecar idle-expansion-result.json not loaded by acceptor
GAP-004v2.155.7Charter named result concept but never specified JSON schema or output location
GAP-005v2.155.8Charter/validator schema mismatch (object vs array traceability)
GAP-006v2.155.9Intent coverage missed top-level structured-result acceptance item
GAP-007v2.155.10Embedded idle_expansion_result bypassed sidecar normalization

Key Takeaways

This proof demonstrates the governed software factory model working on a real codebase:

  1. Perpetual mode works. When the work queue empties, the PM synthesizes the next increment from VISION/ROADMAP/SYSTEM_SPEC, produces a structured intake intent, and the run loop chains into the next governed run.

  2. Self-healing through dogfood. Seven real gaps were discovered, fixed, shipped, and retried — all within a single session. The gap-to-fix-to-retry loop is the product's compounding feedback mechanism.

  3. Human sovereignty preserved. Human approval gates (planning_signoff, qa_ship_verdict, launch_ready) required manual unblocking. The agents handle execution; humans retain authority over direction and ship decisions.

  4. Reliability is proven at scale. Zero spawn/attach/ghost/watchdog failures across 65 dispatches and 19 accepted turns on the final shipped package.

Internal Evidence

Full evidence chain lives in the AgentXchain.dev repo at .planning/dogfood-tusq-dev-evidence/:

  • session-2026-04-24.md — session log
  • README.md — evidence index
  • GAP evidence files for each discovered gap
  • Raw CLI output, status snapshots, event scans under raw/