CI Feedback Latency tracking
CI Feedback Latency is the time from when an agent pushes a commit to when CI produces a result (pass or fail) that the agent can act on.
- ·ITS (Iterations-to-Success) is tracked with a target of 1-3
- ·CPI (Cost-per-Iteration) is tracked with a target under $0.50
- ·CI feedback latency is tracked as a metric (time from push to CI result)
- ·Metrics are broken down per team and per repository
- ·Cost tracking includes model API costs, CI compute costs, and runner costs per iteration
Evidence
- ·ITS dashboard showing iteration count distribution per PR
- ·CPI dashboard showing cost per CI iteration
- ·CI feedback latency chart with P50, P95, P99 breakdowns
What It Is
CI Feedback Latency is the time from when an agent pushes a commit to when CI produces a result (pass or fail) that the agent can act on. For humans, a 15-minute CI run is annoying but manageable - a developer can context-switch to something else and return when CI is done. For agents, a 15-minute CI feedback loop is an architectural problem. An agent waiting for CI feedback is either idle (wasting time) or guessing at fixes without feedback (producing lower quality code). Either outcome degrades the agent workflow significantly.
The reason CI feedback latency matters so much more for agents than for humans is the iteration structure. A human developer runs CI once, waits, and takes action. An agent running at ITS of 2-3 iterates: push, wait for CI, read result, push fix, wait for CI again. If each CI run takes 15 minutes and the agent iterates 3 times, that's 45 minutes of latency before a PR is ready for review. If CI takes 2 minutes, the same workflow completes in 6 minutes. The 7x latency difference directly translates to agent throughput.
At L3, tracking CI feedback latency separately from other metrics is important because it's a controllable variable that has an outsized impact on everything else. Teams that invest in CI speed - through better caching, test parallelization, incremental test selection, and faster runner hardware - see immediate, measurable improvements in ITS, in agent throughput, and in the development experience for human developers as a side benefit.
The target at L3 is to understand the current distribution and drive toward the L4 target of under 5 minutes. The measurement itself - tracking p50, p90, and p99 CI run duration for agent-triggered runs - is the starting point. Teams often discover that their CI latency distribution is bimodal: a fast path for small changes (2-3 minutes) and a slow path for changes touching shared infrastructure (20-30 minutes). The slow path is where agent workflows get stuck, and it's the highest-priority optimization target.
Why It Matters
- Directly multiplies agent throughput - every minute reduction in CI latency is a minute reduction in agent iteration time, multiplied by ITS; teams with 5-minute CI run 3x faster agent workflows than teams with 15-minute CI
- Reduces agent errors from guessing - agents that wait a long time for CI feedback are more likely to make additional changes while waiting, compounding errors; fast CI keeps the agent's reasoning tightly coupled to real test feedback
- Makes high-ITS less costly - even if ITS is 4-5 on complex tasks, fast CI means the total iteration time is still acceptable; the combination of tracking both metrics reveals the actual throughput ceiling
- Improves human developer experience as a side effect - CI speed investments benefit human developers equally; the business case for CI infrastructure investment is stronger when it serves both human and agent workflows
- Reveals test suite health problems - slow CI latency often traces to specific slow tests or test dependencies; tracking latency by test suite segment identifies which tests are the bottleneck and should be candidates for optimization or parallelization
Getting Started
6 steps to get from here to the next level
Common Pitfalls
Mistakes teams actually make at this stage - and how to avoid them
How Different Roles See It
Bob is seeing that his team's agent workflows are slower than expected. Individual agents take 30-45 minutes to produce a PR ready for review, when he expected 10-15 minutes. The agents are running but the throughput isn't materializing.
What Bob should do - role-specific action plan
Sarah is investigating why some developers' agents are more productive than others. She's collected ITS data and notices that the developers with high ITS tend to work in a specific area of the codebase. She suspects CI latency might be part of the story.
What Sarah should do - role-specific action plan
Victor has a 2-minute CI fast path for his agent workflows and runs full CI only on PR merge. His agents get nearly instant feedback and consistently achieve ITS of 1-2 on well-specified tasks. He considers fast CI as important as good context for agent productivity.
What Victor should do - role-specific action plan
Further Reading
5 resources worth reading - hand-picked, not scraped
From the Field
Recent releases, projects, and discussions relevant to this maturity level.