PR throughput per dev
PR throughput per developer is the first meaningful per-developer productivity metric for AI-assisted development: how many pull requests does a developer merge per week, on averag
- ·DORA metrics are tracked consistently with a dashboard
- ·AI tool license count vs. active usage rate is measured
- ·PR throughput per developer is tracked
- ·AI acceptance rate (% of AI suggestions accepted) is measured per tool
- ·Metrics are reviewed in team retrospectives at least monthly
Evidence
- ·DORA metrics dashboard with current data
- ·License utilization report (licenses purchased vs. active users)
- ·PR throughput chart showing per-developer breakdown
What It Is
PR throughput per developer is the first meaningful per-developer productivity metric for AI-assisted development: how many pull requests does a developer merge per week, on average? At L2 (Guided), this becomes the primary signal for tracking whether AI tools are changing developer output. It's not a perfect metric - PRs vary enormously in size and complexity - but it's the most directly measurable output signal available without sophisticated instrumentation.
Before AI tools, typical developer throughput in a well-functioning team is 3-6 PRs per week, depending on the codebase, review culture, and task complexity. With active AI tool usage at L2, teams consistently see this number move to 6-10 PRs per week for high-usage developers. At L3 and L4 with agent-heavy workflows, throughput can reach 15-30 PRs per week per developer. These numbers aren't universal - they depend heavily on PR size convention, codebase maturity, and CI speed - but the direction and magnitude of the shift is consistent across teams that have tracked it carefully.
PR throughput is the gateway metric to a richer productivity picture. By itself, it tells you output rate but not output quality. A developer running agents that produce 20 PRs per week of mediocre, heavily-edited code might have lower net output than a developer producing 8 carefully crafted PRs that merge cleanly. This is why PR throughput is always paired with PR cycle time (how long from PR open to merge?), review burden (how much effort do reviewers spend on each PR?), and defect rate (how often does the code break in production?). Together, these four metrics give a much more complete picture.
The reason PR throughput is the right starting metric at L2 is that it requires no new instrumentation. GitHub, GitLab, and Bitbucket all expose this data directly. A simple query or dashboard plugin will give you per-developer weekly PR counts going back as far as your git history. This makes it possible to compute a pre-AI baseline from historical data even if you didn't instrument anything before AI adoption began.
Why It Matters
- Directly observable - PR throughput requires zero new tooling to track; it's the most accessible productivity signal available and can be computed retroactively from git history
- Reveals the AI impact immediately - teams that add AI tools and then measure PR throughput almost always see an inflection point at the month when adoption ramped up; this is the clearest data point available for AI ROI conversations
- Per-developer granularity - reporting average throughput across the team hides enormous variance; per-developer tracking reveals who has adopted effective AI workflows (high throughput) and who hasn't (unchanged throughput), enabling targeted intervention
- Creates healthy benchmarks - once the team knows the distribution of PR throughput, the benchmark becomes the high-performer's output; developers can see that 10 PRs/week is achievable for their codebase and are motivated to find the AI workflows that get them there
- Leading indicator for capacity planning - if developers are producing PRs faster, the bottleneck moves to review, to CI, or to planning; tracking PR throughput early reveals where the next constraint is before it becomes a crisis
Getting Started
6 steps to get from here to the next level
Common Pitfalls
Mistakes teams actually make at this stage - and how to avoid them
How Different Roles See It
Bob's team has been using AI tools for six months. He wants to show the CTO that the tools are producing results, but his engineering managers are giving him conflicting reports about productivity impact. Some say the team is shipping faster, others say they're not sure.
What Bob should do - role-specific action plan
Sarah has been asked to design a developer productivity scorecard. She's been given several options: PR throughput, lines of code, story points, commit frequency. She needs to pick the right metrics for a fair and useful scorecard.
What Sarah should do - role-specific action plan
Victor's personal PR throughput has tripled since he adopted parallel agent workflows. He produces 20-25 PRs per week, up from 7-8. He wants to help the rest of the team reach similar levels, but he knows that simply showing his numbers will seem unbelievable or intimidating.
What Victor should do - role-specific action plan
Further Reading
5 resources worth reading - hand-picked, not scraped