What happens when AI agent sources conflict?

In most setups, nothing visible. A deep-research or coding agent retrieves several documents, they disagree, and the agent silently picks one and answers in the same confident tone it uses when every source lines up. The conflict, the most useful signal in the trace, gets discarded. Berkeley's MAST taxonomy names task verification one of the three categories of multi-agent failure.

What is a verification gate for agents?

A verification gate is a checkpoint between retrieval and answer. When the retrieved sources agree, the agent proceeds. When they disagree, the gate stops the silent pick: it surfaces the conflict, resolves it against a canonical source of truth instead of a coin flip, and logs which source won and why. It turns an invisible guess into a recorded, reviewable decision.

How do I see which source my agent used?

You need observability at the source-selection step, where the pick happens, rather than at the final output alone. The agent should record, for each answer, which documents it retrieved, which ones disagreed, and which one it trusted. We built yoru for this: it shows what the agent did and which source it chose when they conflicted. Without that log, a wrong-but-confident answer looks identical to a correct one.

Does a verification gate replace agent evaluation?

No. A verification gate is narrow: it handles the moment retrieved sources conflict. It does not tell you whether your agent is drifting better or worse over time, and it does not catch the case where sources quietly agree and are wrong together. You still need evals on a golden set and observability across the whole run. The gate is one control, not the whole system.

What do AI agents do when sources disagree?

tsukumo

What do AI agents do when sources disagree? · tsukumo

Silent pick vs verification gate

Step	Silent pick	Verification gate
On conflicting sources	Picks the top-ranked doc	Surfaces the conflict
How it resolves	Similarity re-rank	Against a canonical source of truth
What you can review later	Final answer only	Which source won and why
A wrong answer looks like	A right answer	A flagged decision

Most agents hide it when their sources disagree

What do AI agents do when their sources disagree?

Why is hidden source disagreement dangerous?#

What does a verification gate look like?#

How do you know which source the agent trusted?#

Does this replace evaluation?#

How we think about it#

How we run a 9-agent growth team on wrai.th (and what broke)

We benchmarked the token cost of rereading docs on our own repo

The canonical-doc layer the 7 agent-memory types miss

Want this running on your team?