loading
Loading.loading
Loading.No, past the few lines a task actually needs, more tokens usually lower accuracy instead of raising it. A long context buries the relevant lines ("lost in the middle"), so the agent wanders and pays more per token for a worse answer. A 2026 ETH Zurich study found added repo context cut task success about 3% at over 20% more cost. The win is serving the right slice, not feeding more.
Updated
Go deeper: read the full write-up on the blog.
Attention is finite, and a model's usable attention is smaller than its context window. Pad the prompt with a whole project overview and the handful of lines that decide this task get buried under the ones that don't, the documented "lost in the middle" effect. The agent reads more, wanders more, and is more likely to anchor on the wrong detail.
A 2026 ETH Zurich study (Evaluating AGENTS.md, arXiv 2602.11988) found that adding repository context files tended to reduce coding-agent task success by about 3% while raising inference cost over 20% versus no repo context. Only short, human-written constraint files helped. Bigger context windows don't reverse this; window size is not the same as usable attention.
The durable fix is retrieval: serve the currently-correct, scoped slice per task instead of stuffing the repo into every prompt. trovex does exactly this and cuts roughly 60% of the tokens per lookup in our own use: fewer tokens, and a better answer because the agent only reads what the task needs. tsukumo builds that context layer with your team.
or have us build it — same capability, the other door