Do AGENTS.md files make coding agents better?

Usually the opposite, per the ETH Zurich study. A long, auto-generated context file dilutes the model's attention and pushes it to explore more, which lowers success and raises cost. Short, human-written files that state only the real constraints can help slightly; comprehensive dumps hurt.

Why does more context make an agent worse?

Attention is finite. Fill the window with a big project overview and the signal the agent actually needs gets buried, the long-context "lost in the middle" effect. The agent reads more, wanders more, and pays for every token, so you get lower success at higher cost.

So should I give my coding agent no context?

No. The study's own result is that minimal, human-written context can give a small gain. The failure mode is naive context, the whole repo stuffed in, not context itself. The goal is to serve the right slice at the right moment, which is the hard part.

How do you give an agent the right context instead of all of it?

Retrieval, not stuffing. Serve the currently-correct code and decisions for the task at hand, scoped and fresh, instead of a static overview that ages the day it's written. trovex does exactly this, and cuts roughly 60% of the tokens per lookup by serving the right context rather than the whole window.

tsukumo

Do AGENTS.md context files help coding agents? · tsukumo

tsukumo

20 June 20264 min read

More context isn't better: what the ETH Zurich AGENTS.md study means

A 2026 ETH Zurich study found that adding repository context files to coding agents usually makes them worse and more expensive. The lesson isn't 'no context.' It's the right context, which is a harder engineering problem than writing a long AGENTS.md.

tsukumo

Short version: there's a comforting idea that you fix a flaky coding agent by writing it a better AGENTS.md, a long, careful overview of your repo so it finally "understands" the project. A 2026 ETH Zurich study put that idea on a benchmark, and it lost. Adding repository context files tended to make agents less likely to succeed and reliably more expensive. We've been saying context is the operating problem for a while, so this is the part where we get to point at someone else's numbers instead of our own.

What the study actually found#

The paper is "Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?" by Gloaguen, Mündler, Müller, Raychev and Vechev at ETH Zurich. They tested context files across current agents and models (Claude Code on Sonnet 4.5, Codex on GPT-5.2, Qwen Code) on real-world tasks. The headline results:

LLM-generated context files cut task success by about 3% on average, compared to giving the agent no repository context at all.

More context isn't better: what the ETH Zurich AGENTS.md study means

What the study actually found#

Why more context makes an agent worse#

The trap isn't context. It's naive context.#

What "the right context" actually takes#

What to do on Monday#

How we think about it#

How to actually make AI work for your dev team

Your developers feel faster with AI. The clock disagrees.

AI ships code by copy-paste. GitClear measured the bill.

Want this running on your team?