Maturity Matrix

Green = auto-merge → auto-deploy

Green = auto-merge → auto-deploy is the L4 delivery pattern where a pull request that passes all required CI checks and satisfies all policy criteria is automatically merged and th

  • ·Green-classified PRs auto-merge and auto-deploy without human intervention
  • ·Team throughput exceeds 50 PRs per day
  • ·Canary or progressive deployment is automated (no manual rollout decisions)
  • ·Auto-deploy includes automated rollback on error rate threshold breach
  • ·Merge queue wait time is under 10 minutes

Evidence

  • ·Auto-merge and auto-deploy logs for Green PRs
  • ·PR throughput dashboard showing 50+ per day
  • ·Canary deployment configuration with automated promotion/rollback rules

What It Is

Green = auto-merge → auto-deploy is the L4 delivery pattern where a pull request that passes all required CI checks and satisfies all policy criteria is automatically merged and then automatically deployed to production without requiring any human to click a button. The entire path from "agent completes task" to "feature in production" is automated. Humans intervene only when automation fails or when the change is explicitly flagged as requiring review.

This is not the same as "simple CD" from L1. Simple CD automates the mechanics but has no safety system. Green = auto-merge → auto-deploy at L4 has: policy-based merge rules that define exactly what "green" means (which checks, which approvals, which review criteria), a merge queue that handles concurrency, a staged deployment pipeline with automated health checks, and automated rollback if post-deploy metrics degrade. The automation is safe because it's surrounded by policy.

The trigger condition "green" requires precise definition. At a minimum: all required CI checks pass, no unresolved review comments, policy-defined approval requirements met (which may be zero for auto-approved PR categories), and no conflict with queued PRs. Some teams add additional signals: security scan clean, change is below a size threshold, PR was produced by an approved agent identity. The policy file is the source of truth for what "green" means.

At L4, auto-merge → auto-deploy is the default path, not the exception. The majority of PRs (especially agent-generated ones) flow through without human intervention. The team's attention shifts from "merging and deploying PRs" to "watching dashboards and responding to anomalies." This is a fundamental change in how engineering time is spent.

Why It Matters

  • Eliminates the merge-to-deploy gap - without auto-merge and auto-deploy, approved PRs sit waiting for a human to merge them; at 50+ PRs/day this creates a perpetual backlog of "approved but not shipped" work with real business cost
  • Removes the deploy as a coordination event - manual deploys require scheduling, communication, and attention from multiple people; auto-deploy eliminates this overhead and makes every merge a silent, routine production update
  • Enables L5 throughput - 1000+ merges/week (Stripe scale) is simply not achievable with any human-touch merge or deploy process; auto-merge → auto-deploy is the prerequisite for that scale
  • Closes the agent feedback loop - agents producing PRs get real production feedback (via monitoring, error rates, feature flags) rather than waiting for human-orchestrated deploys; this feedback improves future agent outputs
  • Reduces time-to-user for fixes - a bug fix that passes CI and policy review should be in production within minutes, not hours or days; auto-merge → auto-deploy makes this automatic

Getting Started

6 steps to get from here to the next level

Common Pitfalls

Mistakes teams actually make at this stage - and how to avoid them

How Different Roles See It

B
BobHead of Engineering

Bob's team has the technical prerequisites for auto-merge (merge queue, policy rules, CD pipeline with gates) but the CTO is concerned about "removing humans from the deploy loop." Bob needs to make the case that auto-merge → auto-deploy with good policy and automated rollback is safer than the current manual process, not riskier.

What Bob should do - role-specific action plan

S
SarahProductivity Lead

Sarah has been measuring the gap between "PR merged" and "code in production" and finds it averages 4 hours on her team. Most of that gap is a human waiting to execute the deploy. For agent-produced code, this 4-hour gap is particularly problematic because agents can't observe the production feedback they need to validate their work.

What Sarah should do - role-specific action plan

V
VictorStaff Engineer - AI Champion

Victor has already configured auto-merge for low-risk PRs in his repositories and it works flawlessly. He wants to extend auto-merge → auto-deploy to the team's main production service, which is higher stakes. He needs a rollback mechanism that's reliable enough to trust with automated production deploys.

What Victor should do - role-specific action plan