Maturity Matrix

PEV loop: Plan → Execute → Verify

The PEV loop - Plan, Execute, Verify - is the fundamental operating model for working with AI agents at high maturity.

  • ·Agentic Engineer role combines orchestration, supervision, and architecture responsibilities
  • ·PEV (Plan, Execute, Verify) loop is the standard workflow for all engineering tasks
  • ·Non-coder contributors can produce software changes via agent interfaces
  • ·Agentic Engineer career ladder exists with defined progression criteria
  • ·Non-coder contribution rate is tracked as an organizational capability metric

Evidence

  • ·Agentic Engineer role description with orchestration and supervision responsibilities
  • ·PEV loop documentation and adoption evidence in team workflows
  • ·Non-coder contributor logs showing software changes via agent interfaces

What It Is

The PEV loop - Plan, Execute, Verify - is the fundamental operating model for working with AI agents at high maturity. It provides a structured three-phase framework that applies at every scale: a single developer delegating one task to one agent, a fleet manager coordinating five parallel agents, and an Agentic Engineer operating an organizational-scale orchestration system. The loop's power is in its universality: the same three questions apply at every level of complexity. Plan: what exactly needs to be done, and what does the agent need to know to do it? Execute: how does the work get done with appropriate supervision? Verify: how do we know the output meets the requirements?

The Plan phase is where most of the work happens that determines quality. A well-planned task has: a clear scope (what's in and what's explicitly out), the context the agent needs (architecture, patterns, constraints), success criteria specified in terms the agent can verify (tests pass, specific behavior is present, specific file changes are made), and an explicit statement of what "done" looks like. The Plan phase is not a few seconds of writing a prompt - it is 10-20 minutes of structured thinking about the task, analogous to a tech lead doing careful task design before assigning work to a developer.

The Execute phase is not passive waiting - it is structured supervision. The developer (or orchestration system) monitors agent progress at defined intervals, intervenes when agents encounter decision points that require human judgment, and maintains the broader context that the agent can't hold in its context window. The Execute phase at the organizational level involves routing tasks to appropriate agent types, managing dependencies between parallel workstreams, and handling the exceptions that automated systems escalate.

The Verify phase closes the loop by validating that the Execute phase achieved what the Plan phase specified. This is not just running the tests (though that's part of it). At L5, Verify includes: automated quality checks calibrated to the specific task type, intent alignment validation (does the output match what was actually intended, not just what was formally specified?), side effect checking (what did the agent change beyond the explicit task scope?), and feedback capture (what in the Plan or Execute phase would have prevented any defects found in Verify?). The feedback from Verify feeds back into the Plan phase for the next iteration, making the loop truly cyclic rather than linear.

Why It Matters

The PEV loop creates discipline and reproducibility in agentic workflows that ad-hoc agent use cannot provide:

  • Makes quality predictable - teams with PEV discipline produce consistently high-quality AI outputs; teams without it produce highly variable outputs that depend on individual developer experience; PEV is what converts AI potential into reliable engineering practice
  • Identifies where quality problems originate - the three-phase structure makes it easy to diagnose failures: defects from poor planning (scope unclear, context missing), execution failures (insufficient supervision, wrong task routing), or verification gaps (tests didn't catch the real requirement, intent alignment wasn't checked); each diagnosis points to a specific improvement
  • Enables organizational-scale automation - at L5, the PEV loop can be partially automated: automated plan templates for standard task types, automated execution with built-in supervision checkpoints, automated verification using test suites and quality checks; the loop structure is what makes automation possible because it defines clear phase boundaries
  • Creates a common language for AI-augmented work - when all developers use the same three-phase mental model, communication improves: "I'm in plan phase for this feature" means something specific, "this failed at verify" points to a specific phase, "the plan was underspecified" is an actionable diagnosis
  • Scales from individual to organizational use - the same framework works for a single developer with one agent and for an Agentic Engineer designing a system that runs thousands of agent tasks per week; universal applicability makes it worth the investment in internalizing deeply
Tip

When an agent task fails, identify which phase the failure originated in. Most failures trace back to the Plan phase (underspecified context, unclear scope) rather than the Execute phase (agent behavior) or Verify phase (checking criteria). This means that improving Plan quality is the highest-leverage intervention for improving overall agent task success rates.

Getting Started

6 steps to get from here to the next level

Common Pitfalls

Mistakes teams actually make at this stage - and how to avoid them

How Different Roles See It

B
BobHead of Engineering

Bob's team has high AI adoption and generally good agent usage, but the variance in output quality is still significant. Some developers produce excellent agent outputs consistently; others have boom-and-bust cycles: great results on some tasks, expensive messes on others. Bob suspects the inconsistency comes from inconsistent planning and verification practices.

What Bob should do - role-specific action plan

S
SarahProductivity Lead

Sarah is designing the L4 to L5 transition curriculum and needs to explain what changes when developers move from fleet management to agentic engineering. PEV is the answer - L4 developers use agent fleets with individual-level PEV loops; L5 Agentic Engineers design and operate organizational-scale PEV systems. But she needs to make this distinction concrete for people who haven't experienced both levels.

What Sarah should do - role-specific action plan

V
VictorStaff Engineer - AI Champion

Victor has been using PEV implicitly for months - he just hasn't named it. When he reflects on his most effective agent workflows, they all have the same structure: careful task specification before launch, structured check-ins during execution, and a checklist-driven review of completed outputs. He hasn't articulated this to his colleagues, who don't have the same structure and produce less consistent results.

What Victor should do - role-specific action plan