What's the difference between one agent and an agent fleet?

One agent doing a task is a script. A fleet is a coordination problem: several agents working in parallel on different parts of the codebase without colliding or duplicating work. The jump from one to many is mostly orchestration and observability, not a bigger model.

Why do agent fleets fail in production?

Usually one of the four layers is missing. Most often it's observability (running on faith instead of evidence) or the operating model (developers never learned to operate agents, so the tools get routed around). A missing context layer makes agents expensive and confidently wrong.

Do you need a bigger model or better prompts to run a fleet?

No. Those help at the margin. The fleet stands or falls on context, orchestration, observability, and an operating model your devs run. It's ordinary production engineering, which means it's buildable and yours to keep.

Can buying more Claude or Copilot seats get us to a production fleet?

Seats give your team model access. None of the four layers comes in the box. Crossing from copilot to a running fleet is an operating problem, not a license problem.

Running agent fleets in production: what it takes

tsukumo

Running agent fleets in production: what it takes · tsukumo

Buying seats vs running a fleet

What you get	Buy more model seats	A running fleet
Context layer	every agent rereads the repo	canonical answer served cheaply
Orchestration	agents collide on shared files	locks, rebase rules, ticket ownership
Observability	trust a black box with commit access	you know what each agent did and cost
Operating model	devs route around tools they don't trust	devs operate the fleet, gains are theirs

Running agent fleets in production: what it actually takes

1. Context the agents can trust#

2. Orchestration so the fleet doesn't collide#

3. Observability so you run on evidence#

4. An operating model your devs actually run#

It's mostly engineering, not prompting#

How we do it#

AI agents in production: the four operating problems that decide it

Your AI agent didn't fail the deploy. It stopped itself.

How we run a 9-agent growth team on wrai.th (and what broke)

Want this running on your team?