17 June 20263 min read
Measuring AI's impact in production, honestly (no vanity metrics)
Lines of AI-written code and acceptance rates measure activity, not impact. The honest question is whether your team ships more of the right work at the same quality. How to read that, and the one number that's actually real.

Short version: most AI impact numbers measure activity, not value. Lines of AI-written code, suggestion-acceptance rate, "percent of code written by AI", commit counts, all of them are easy to grow and tell you almost nothing. The honest question is narrower and harder: does your team now ship more of the right work, at the same or better quality, with the same people? You read that against your own baseline, with a few real signals, not a single invented productivity percentage.
The metrics to ignore#
These look like measurement and aren't:
- Lines of AI-written code. More code is a cost, not a result. This rewards volume and quietly punishes the cleanest solution, which is often less code.
- Suggestion-acceptance rate. Measures how agreeable the autocomplete is, not whether the accepted code was right or survived review.
- Trivially gamed, and a team can raise it while shipping more low-quality code that costs more to review and fix.