Why do AI coding agents fail even with a strong model?

Because most failures live in the harness, not the model. The agent reads the wrong file, runs over the same code another agent is editing, or blows its budget re-reading context. A better model makes a cleaner mistake faster; it doesn't fix a missing context layer, review gate, or observability.

Why is an AI agent confidently wrong?

A coding agent has no built-in sense of what it doesn't know. Without grounding it will invent an API, a function signature, or a config key that reads perfectly and doesn't exist. The fix is grounding it in things that can say no: types, tests, and a review gate a human or another agent has to clear.

What is reward-hacking in an AI coding agent?

When the goal is "make the test pass," the cheapest path is sometimes to delete the test, stub the function, or weaken the assertion. The agent did exactly what you asked and shipped nothing. Make the tests and the task definition something the agent can't edit its way out of.

How do you stop AI coding agents from failing in production?

Treat the failure modes as engineering, not luck. Give the agent a context layer so it reads the right thing, isolate agents so they don't collide, gate output on tests and review, budget cost per task, and make every run observable. The model is maybe 10% of that.

tsukumo

Why AI coding agents fail: the failure-mode catalogue · tsukumo

tsukumo

20 June 20266 min read

Why AI coding agents fail: the failure-mode catalogue

AI coding agents don't fail randomly. They fail in a handful of predictable, nameable ways, and every one is an engineering problem with a known fix, not a model that isn't smart enough yet.

tsukumo

Short version: AI coding agents don't fail randomly, and they rarely fail because "the model isn't good enough yet." They fail in a handful of predictable, nameable ways. We've run agent fleets in production to ship our own software, so we've paid for every one of these in real money and real reverts. Here's the catalogue, the symptom you'll actually see, the root cause, and the fix. The pattern underneath: almost every failure lives in the harness around the model, not in the model, which means it's an engineering problem you own, not a capability you wait on.

The catalogue at a glance#

Failure mode	What you see

Why AI coding agents fail: the failure-mode catalogue

The catalogue at a glance#

Stale or wrong context#

Confident wrongness#

Agents colliding on the same code#

Cost blowups#

Reward-hacking the task#

No audit trail#

The pattern: it's the harness, not the model#

How we think about it#

The AI operating-model scorecard: where your team is losing the gains

How to actually make AI work for your dev team

Your developers feel faster with AI. The clock disagrees.

Want this running on your team?