How do I measure the ROI of AI coding agents?

Against your own baseline, on outcomes. Ask whether the team ships more of the work that matters, at held quality, with the same people, and whether your cost per unit of work went down. Avoid single productivity percentages; they're usually invented. A handful of honest signals beats one impressive-looking number.

What AI productivity metrics should I ignore?

Lines of AI-written code, suggestion-acceptance rate, percent of code written by AI, and raw commit or PR counts. All measure activity, all are easy to inflate, and none tells you whether anything valuable shipped or whether quality held.

Is 'percent of code written by AI' a good metric?

No. It rewards volume, not value, and it's trivially gamed. A team can raise it while shipping more low-quality code that costs more to review and fix. Measure outcomes shipped at held quality instead.

Why can't a vendor just give me a single ROI number for AI?

Because an honest one comes from your codebase, your baseline, and your definition of the right work, not a benchmark. Anyone quoting a clean universal percentage is selling, not measuring. The real read is a few signals tracked against where you started.

Measuring AI's impact, without vanity metrics

tsukumo

Measuring AI's impact, without vanity metrics · tsukumo

What a vendor counts vs what your business feels

Question	Vanity metric	Honest signal
What it tracks	Activity (lines, accepts, commits)	Outcomes shipped at held quality
Easy to inflate	Yes, trivially	No, it's gated on quality
Survives a bad quarter	Can still look great	Falls when the work slips
Comes from	A vendor benchmark	Your baseline, your codebase

Measuring AI's impact in production, honestly (no vanity metrics)

The metrics to ignore#

The honest question#

Signals worth watching#

The one number that's actually real#

Why there's no single ROI figure#

How we measure it#

AI agents in production: the four operating problems that decide it

Your AI agent didn't fail the deploy. It stopped itself.

How we run a 9-agent growth team on wrai.th (and what broke)

Want this running on your team?