№ 013agents · building filed may '26

Build a software factory with Claude Code.

Seven specialized agents. Three human checkpoints. Features that ship while you sleep.

You thought you were using AI to code. You were typing faster.

That's the gap most builders haven't named yet — and it's the entire reason Claude Code sessions plateau around month two. Here's the difference, and the seven-agent system that closes it.

Save this one. It's going to save you months.

01

The loop that feels productive but isn't.

Watch the shape of a normal Claude Code session. You ask Claude to build a feature. It generates code. Something breaks. You paste the error back. It patches it. Something else breaks. You ask again. By the end of the day you've shipped something — but you've also touched the same files seven times, the conventions you set up two weeks ago are quietly broken, and a feature in another part of the codebase is now subtly worse than it was this morning.

On day one this feels like magic. By day thirty you're spending more time supervising AI than you used to spend writing code. Same logic appears in three different places. Claude forgot the pattern you established. New features break old ones. Tests are missing or shallow.

You wake up and realize: the AI isn't failing. Your workflow is.

02

The shift: from vibe coding to a software factory.

Real engineering teams don't work in one big conversation. Different people own different jobs. Someone clarifies the user problem. Someone thinks about architecture. Someone builds the API. Someone builds the UI. Someone considers edge cases. Someone reviews.

When you collapse all of that into one AI session, mistakes compound silently. The fix is to split the work across specialized agents — each with one focused job, its own clean context window, only the tools it actually needs, and strict rules about what it cannot touch.

The result is a software factory. One developer plus seven focused agents equals a coordinated team.

03

The seven agents, at a glance.

The deep-dive on each agent — their inputs, outputs, and the rules they cannot break — lives in the Advanced tab. Here's the working reference you can keep open while you build.

The factory seven jobs, one developer, three checkpoints
Agent Job Tools Cannot touch
№ 01Researcher Maps the code before anything is built. Finds patterns, similar features, risks. Read, Grep, Glob Any file. Read-only.
№ 02Story Writer Idea → user story with acceptance criteria, edge cases, out-of-scope. Read Code. Business rules it doesn't know.
№ 03Spec Writer Story → technical brief. Data model, API, files that will change, risks. Read, Grep, Glob Any file. Cannot invent infra.
№ 04Backend Builder API routes, services, jobs, migrations, unit tests. Backend half only. Read, Edit, Write, Bash — backend folders Frontend. New dependencies without instruction.
№ 05Frontend Builder Components, pages, hooks, UI tests. Consumes the backend's API summary. Read, Edit, Write, Bash — frontend folders Services, routes, workers, migrations.
№ 06Test Verifier Acceptance tests against the user story. Reports pass / fail / not-coverable. Read, Edit, Write (test files), Bash Production code. Inventing workarounds.
№ 07Validator Reports gaps against story + brief. Never fixes. Just tells the truth. Read, Grep, Glob Anything. Read-only by design.
04

How the chain actually runs.

You open Claude Code and type: "Build invoice reminders for invoices unpaid for more than 7 days." That's it. Here's what happens without you typing anything else.

  1. 01

    Researcher maps the code.

    Pulls the invoice, payment, and email files. Returns relevant patterns and risks. No code written.

  2. 02

    Story Writer drafts the user story.

    Acceptance criteria. Edge cases. Out-of-scope. Open questions.

  3. 03pause

    You approve the story.

    First human checkpoint. The single approval that prevents 80% of downstream rework.

  4. 04

    Spec Writer drafts the technical brief.

    Data model, API, files that will change, risks. Every file that will be touched, named.

  5. 05pause

    You approve the brief.

    Second human checkpoint. Highest-leverage approval in the whole chain. If you see "store IDs in memory" — catch it here.

  6. 06

    Backend Builder ships the backend.

    Service. API route. Background job. Unit tests. Returns: files changed, patterns reused, all tests green.

  7. 07

    Frontend Builder ships the UI.

    Reads the backend summary first. Builds the admin UI and the reminder button. Component tests. All green.

  8. 08

    Test Verifier writes the acceptance tests.

    One test per acceptance criterion. Reports: 7 passing, 1 failing — manual trigger doesn't check tenant ownership.

  9. 09

    Validator catches it.

    Reports as Critical with file path and line number. Chain loops back to Backend Builder. Fix applied. All 8 tests pass. Validator re-runs. Clean.

  10. 10pause

    You review and open the PR.

    Third checkpoint. Three human approvals total. Everything else ran on its own.

05

The take.

The factory doesn't remove you from the process. It removes you from the parts that don't need you. You stay in the loop where your judgment actually matters: Is this the right problem? Is this the right design? Is this safe to ship?

The agents handle everything between. That's the difference between using AI as a faster keyboard — and using AI as a coordinated team.

Decision rules three signals worth acting on
If you see… Do this Don't do this
Same logic appearing in 3+ files Stop. Grep for an existing helper. Reuse or extract. Add a CLAUDE.md rule so the next session doesn't repeat the duplication. Write a fourth copy.
Claude forgot a pattern you set last week Add it to CLAUDE.md. Throw the conversation away. Start fresh with the rule baked in. Patch the conversation with "remember to…"
Acceptance test fails after a green build Trust the test. Send it back to the right builder with the failing criterion quoted. Validator confirms after. Edit the test until it passes.

Want this wired into your actual codebase — not a blank .claude/ folder? That's the kind of build the audit + retainer relationship was made for. austinaiguy.com.