Can you do product research with AI agents as the users?

Yes, with one guardrail. Agents are the first real users of agent-facing tools, so their friction is honest signal about the tool. But language models confabulate reasons, so self-report alone is unreliable. The fix is to triangulate: treat each reported friction as a hypothesis and accept it only when the telemetry, retry counts, token spend, error traces, confirms it. It complements human research, it does not replace it.

What is Agent Experience (AX)?

Agent Experience is a term Netlify's Mathias Biilmann introduced in January 2025: the whole experience an AI agent has as the user of a product or platform. He framed it as the next step after Don Norman's user experience (1993) and developer experience (2011): a new kind of user that needs the product designed for how it actually works.

How is the way an AI agent uses a tool different from a human?

An agent wakes on a timer, not a notification, so anything that loses data silently between polls hurts more, there is no human glance to catch it. It pays tokens for every read, so compact output and precise lookups matter more than to a human who skims for free. And it branches on structured results, not prose logs. Designing for that is a different job than designing for a person.

Why does tsukumo run its own tools on its own agents?

We ship our own software with agent fleets, so the agents that hit the friction are the same agents that do the work. That puts the user and the builder in one loop, which is why the signal is honest and the list is short. The tools under interview are ours: the WRAI.TH relay, trovex, and dokan.

We interviewed our AI agents about the tools we built for them

tsukumo

We interviewed our AI agents about the tools we built for them · tsukumo

Why the same tool feels different to an agent

Dimension	A human user	An autonomous agent
Attention	Present, glances at the screen for free	Wakes on a timer; sees only what a fetch returned
A lost message	Notices it looks cut off	Acts on the truncated text, no idea it was cut
Reading cost	Skims a long list for free	Pays tokens per read, so precision and compactness matter
What it acts on	Prose, dashboards, logs	A structured result it can branch on

Our agents are our first users. So we interviewed them, and only believed the logs

Why interview the agents at all#

How we asked, and why we did not trust the answers#

What they said#

WRAI.TH (the relay)#

trovex (context / source of truth)#

dokan (the job runtime)#

The pattern: agent-native DX is not human DX#

The friction is the roadmap#

How we think about it#

How we run a 9-agent growth team on wrai.th (and what broke)

AI agents in production: the four operating problems that decide it

AI and Swiss secret professionnel: the three DPA terms that decide whether the secret survives

Want this running on your team?