Maturity Matrix

Span of control = how many agents you can effectively supervise

Span of control is a management concept from organizational science: how many direct reports can one manager effectively supervise? The answer for human teams is typically 5-9, wit

  • ·Developer role is formally defined as "manager of agent fleet"
  • ·Span of control is measured: how many parallel agents each developer effectively supervises
  • ·Performance evaluation includes agent supervision effectiveness (not just personal code output)
  • ·Span of control target is defined per role (e.g., 3-5 agents for standard developers, 5-10 for senior)
  • ·Agent supervision training is part of standard developer onboarding

Evidence

  • ·Updated role descriptions defining developer as agent supervisor
  • ·Span of control metrics dashboard
  • ·Performance review criteria including agent supervision effectiveness

What It Is

Span of control is a management concept from organizational science: how many direct reports can one manager effectively supervise? The answer for human teams is typically 5-9, with 7 as a common optimum. The same concept applies to agent fleet management: how many AI agents can one developer effectively supervise simultaneously? The answer turns out to be in the same range - typically 3-7 - and for the same underlying reasons.

The parallel between human team management and agent fleet management is not superficial. In both cases, the limiting factor is the supervisor's attention and cognitive bandwidth. A manager with 12 direct reports cannot give each person the direction, feedback, and support they need; quality degrades and people go off track. A developer with 8 parallel agents cannot review each agent's output carefully, catch errors promptly, and course-correct quickly enough; quality degrades and agents go off track. The cognitive load of maintaining a mental model of "where is each unit, what do they need, what decision needs to be made next" scales with the number of units, and eventually exceeds the supervisor's capacity.

The 3-7 range for agents specifically reflects two constraints. The lower bound (3) represents the minimum parallelism that actually delivers a throughput multiplier worth the coordination overhead. Running 2 agents has limited advantage over sequential work once you account for the attention cost. The upper bound (7) reflects the maximum number of simultaneous agent states that most developers can hold in working memory while maintaining adequate supervision quality. Some developers with highly developed task management systems and high-context, well-defined tasks can operate at 7-10; most developers working on moderately complex tasks with moderate context quality are most effective at 4-6.

The practical span of control for a given developer on a given day depends on several variables: task complexity (simple, well-defined tasks support higher spans), context quality (agents with excellent context make fewer disruptive decisions), review cycle discipline (structured 15-minute cycles support higher spans than ad-hoc monitoring), and the developer's experience with fleet management (experienced fleet managers can handle more agents than those new to the practice).

Why It Matters

Understanding span of control provides a principled framework for structuring AI-augmented work:

  • Sets realistic throughput expectations - teams that expect developers to run unlimited parallel agents will be disappointed; teams that plan around 4-6 agents per developer get accurate velocity projections
  • Identifies where productivity gains plateau - the throughput benefit of adding agents is not linear; the first 3 provide high additional value, the next 2-3 provide moderate additional value, beyond 7 the marginal value typically falls below the marginal supervision cost
  • Guides task design decisions - if you want to increase a developer's effective span, the most direct intervention is increasing context quality and task specificity so that agents need less supervision per task; this is a better investment than trying to increase raw supervision capacity
  • Informs hiring and team structure - an organization of 10 developers with spans of 5 agents per developer can sustain 50 simultaneous agent workstreams; this changes how you think about the relationship between headcount and development capacity
  • Creates a growth metric for developers - increasing your effective span of control is a concrete, measurable developer skill; a developer who moves from a span of 3 to a span of 6 has demonstrably become a more effective fleet manager; this is a new dimension of developer growth that the traditional role structure doesn't capture
Tip

Track your actual span of control over a two-week period. Log how many agents you started, how many you were supervising simultaneously at peak, and how often you had to clean up agent work that went significantly wrong. This is your real span of control, not your theoretical maximum. Start from this baseline rather than aspirational targets.

Getting Started

6 steps to get from here to the next level

Common Pitfalls

Mistakes teams actually make at this stage - and how to avoid them

How Different Roles See It

B
BobHead of Engineering

Bob is doing sprint planning and trying to estimate velocity for the upcoming quarter. His team has 8 developers operating at various levels of AI fluency. He's trying to figure out how many parallel agent workstreams the team can sustain and what the actual throughput multiplier is compared to pure human development.

What Bob should do - role-specific action plan

S
SarahProductivity Lead

Sarah is designing the organizational metrics dashboard for AI adoption. She wants to include a "productivity" metric but is struggling to define what to measure. She's been asked to demonstrate that the AI tooling investment is delivering value, and stakeholder expectations are high.

What Sarah should do - role-specific action plan

V
VictorStaff Engineer - AI Champion

Victor runs at a span of 6-7 agents effectively and has developed the most sophisticated fleet management workflow on the team. He's been asked to help two other developers increase their effective span from 3 to 5. He's not sure how to coach this transition because most of what he does has become intuitive.

What Victor should do - role-specific action plan