# When “Faster” Doesn’t Mean Sooner

[Jirka Koutny](https://www.strv.com/blog/authors/jirka-koutny)  
Backend Engineering Manager

---

## TL;DR

AI makes it easier to produce code. That often increases change volume and scope, which shifts the bottleneck to review, testing, security and integration. If you don’t redesign how confidence is created, you don’t get faster delivery. You get a larger queue of work waiting to be trusted.  

**A useful question:** When we say “AI makes us faster,” do we mean typing or do we mean shipping?

> “The meaning of a word is its use in the language.”

## Why AI Isn’t Improving Your Numbers  

A specific kind of discomfort is showing up in presales and planning conversations lately. AI can generate code quickly, so the story seems obvious: projects should get faster and cheaper. Surprisingly, many teams are seeing the opposite. [Estimates don’t move. Sometimes they rise.](https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/)

This is [the AI acceleration paradox](https://survey.stackoverflow.co/2025/ai). The AI acceleration paradox is what happens when AI speeds up code generation but overloads downstream steps (review, testing and integration), leaving delivery time unchanged or worse. In my experience, it often starts with one deceptively simple misunderstanding.  

## What “Faster” Changes Downstream

A common assumption goes like this: **Dev delivers faster, but not in lower quality, so QA shouldn’t change much.**  

The counter-argument is subtler but crucial: **Dev delivers faster, but not less, meaning QA has more to verify in the same calendar time.**  

Both statements can be true at the same time, depending on what we mean by *faster*. When AI reduces coding friction, teams produce more: more changes, more iterations, a wider scope, and more combinations of things that can break. Even if individual changes are fine, the total surface area that needs verification grows.  

This is where the paradox starts. We optimize a local step (code generation) and unintentionally overload downstream ones: review, QA, security, integration.  

## Where AI Actually Shifts Time in the Delivery Pipeline

AI speeds up writing code. It doesn’t automatically speed up shipping software. That's the part that matters.  
Clients don’t pay us for code. They pay us for delivery: software that solves their problems and keeps solving them. That has very little to do with how quickly we can produce code in isolation.  

It’s like widening one stretch of road while leaving all the intersections untouched. Cars move faster for a moment, then they queue. In software, that queue is work waiting for confidence: pull requests waiting for review, tests waiting to be written or fixed, security checks waiting and releases waiting for sign-off.  

## Where the Bottleneck Moves

The Theory of Constraints forces a simple question: where is the system actually constrained?  

Throughput is limited by the narrowest point. Speed up something else, and you don’t increase throughput. You create inventory in front of the constraint.  

For many teams, AI has changed the shape of the pipe. Code can appear quickly, sometimes very quickly. The work that follows often can't scale at the same rate: review capacity, test coverage, integration stability, security validation and the act of proving "this does what we think it does."  

## The Cost of “Almost Right” AI Code  

The expensive failures aren’t the obvious ones. If the model generated nonsense, you’d reject it immediately. The cost comes from almost-right code. It looks plausible, reads well and even passes a quick glance; it seems ready to commit and release. Under the surface, it contains subtle issues that take real effort to uncover.  

That creates a verification tax. Typing time goes down. Reading, reasoning, testing and debugging times go up. If you measure end-to-end time instead of perceived speed, what you “saved” during implementation often reappears later as verification and correction work.  

There’s also no free ride on trust:  
- If people don’t trust the output, they verify more.  
- If people trust too much, defects leak, and you pay later.  

Either way, the system pays. Only the timing changes.  

## Different Teams, Different Definitions of "Faster"

This is where delivery starts to feel political, even when nobody is being political.  

Leadership hears “AI makes us faster” and expects shorter timelines. Developers see more options and produce more changes. Reviewers and QA see a flood of diffs. Security sees more surface area and more risk. Seniors become the emergency services, pulled into judgment calls and subtle failures because they're the people others rely on when the system is uncertain.  

## How to Make AI Speed Show Up in Delivery

The fix is to redesign where confidence is created. Slowing AI adoption won't work. Neither will push the team harder. Traditional QA treated quality like a checkpoint at the end. That worked when code production was expensive and slow. If AI makes code production cheap, quality has to be engineered earlier and scaled through automation. Most teams treat this as a QA department problem. It's actually a delivery-system design problem.  

Here’s what it looks like when AI helps the system, not just the coder.  

## Limit the Queue

If PRs are piling up, that's inventory, and inventory is cost.  

A blunt but effective rule: if the review queue exceeds a threshold (say five PRs), block new code generation until the backlog is cleared. It sounds aggressive, but it stabilizes lead time by preventing the “start more work while blocked” spiral.  

## Make Verification Explicit

If the AI writes the code, the AI should write the tests.  

Generate unit/integration tests from the PR diff and requirements. Push toward test-driven generation. Reject PRs that don’t come with passing tests. This is a workflow rule, not a preference.  

## Scale Verification with Agentic Testing

Classic UI automation becomes brittle under high change volume. Agentic testing scales better:  
- Dynamic test generation from changes and requirements  
- Self-healing suites when selectors change  
- Exploratory agents that hunt regressions outside scripted paths  

In parallel, add an AI "first-pass" reviewer that flags missing tests, risky patterns and compliance issues and summarizes changes for humans. That frees humans to focus on architecture and correctness.  

## Move Quality Earlier

Quality can't be an afterthought anymore; it has to be designed in, adapted and respected more than ever. QA work starts earlier: shaping acceptance criteria so they're testable, adding constraints directly into prompts, validating inputs, not only debugging outputs.  

The line between manual and automated testing is fading. What replaces it is a model where automation and agentic tooling do the heavy lifting, while human judgment serves as the ultimate quality gate.  

## Keep Judgment Human

Complex logic still needs human judgment. The best model is *human orchestrator plus AI velocity*: generate, review, iterate, until it’s actually right. That’s also how you avoid turning senior engineers into a permanent constraint.  

## Make Security Part of the Flow

As output volume grows, security and compliance can become constraints too. Policy checks and dependency scanning can’t be optional “later” steps anymore.  

## Measure Flow, Not Output

If you keep tracking output metrics like lines of code, AI will make you feel great while delivery stays stuck. Lines of code measure activity, not progress.  

Better gauges are flow gauges:  
- How long does work wait for review?  
- How many times does it bounce back?  
- How often does CI break?  
- How much gets rewritten shortly after merge?  
- How frequently do defects escape?  
- What's the quality of fixed defects?  

If you want a simple “AI reality dashboard,” watch PR review cycle time, churn or rework rate, defect escape rate and flow efficiency. They expose the invisible tax that local productivity metrics miss.  

## Questions to Ask Before Scaling AI Tooling

Before you buy another license bundle, ask:  
- What is our bottleneck today: coding, review, tests, security, integration or skillset gaps?  
- Are we increasing throughput, or just increasing inventory?  
- Where is confidence created in our process, early or at the end?  
- Do we have explicit "proof" rules (tests, gates) or implicit "trust" habits?  
- Which metrics describe flow, not activity?  

## Four Failure Modes to Watch For

- If you don’t limit WIP, AI will amplify your queues.  
- If you don’t scale verification, speed turns into rework.  
- If you don’t redesign confidence, your best people become the constraint.  
- If you don’t communicate enough, you might be part of a bottleneck.  

## The Question to End With

AI can increase local speed dramatically. But if you don’t change the surrounding process (how you verify, how you review, how you manage work-in-progress), you can end up with a system that produces more change and delivers less certainty.  

Ask the harder question. Did AI reduce our end-to-end time to value, or did it just increase the volume of work waiting to be trusted?  

---

**Don't miss anything**