Back to Development
developmentL3 SystematicContext Engineering

MCP servers: architecture, ownership, SLA

Model Context Protocol servers are the infrastructure layer of context engineering at L3 - dedicated services that give AI agents structured, real-time access to your organization's knowledge.

  • ·MCP servers provide structured context (architecture, ownership, SLAs) to agents
  • ·Context is organized across at least 3 of the 5 levels: System, Code, Org, Historical, Operational
  • ·Token budget management is implemented (agents receive context within defined token limits)
  • ·Context sources are versioned and tested for correctness
  • ·Context budgeting policy defines priority order when token limits are reached

Evidence

  • ·MCP server configuration files listing active context sources
  • ·Token budget configuration in agent settings
  • ·Context coverage audit showing 3+ context levels populated

What It Is

The Model Context Protocol (MCP) is an open standard that defines how AI agents discover and consume structured context from external services. An MCP server exposes "tools" - discrete capabilities that agents can call to retrieve information or take actions. In the context of coding workflows, MCP servers might expose: database schemas, API contract definitions, deployment status, team ownership graphs, incident history, or architecture documentation.

At L2, context is static - it lives in CLAUDE.md files and is written once, updated occasionally. At L3, context becomes dynamic: agents query MCP servers at runtime to retrieve the current state of the world. Instead of a static CLAUDE.md entry saying "the payments service owns these endpoints," an MCP server returns live ownership data from your service registry. Instead of a CLAUDE.md describing the database schema, an MCP server returns the actual schema from the running database.

Stripe is the canonical example: in their production agent workflows, agents have access to over 400 MCP tools covering every aspect of the Stripe platform - API definitions, internal service contracts, deployment pipelines, team ownership, and more. When an agent modifies a payments API endpoint, it can query the MCP server to see which teams own dependent services, what the current SLA commitments are, and what the deployment history looks like. This is qualitatively different from static context: the agent is reasoning with live organizational knowledge.

At L3, organizations treat MCP servers as internal infrastructure. They have defined owners, SLA targets (availability, latency), security postures (authentication, authorization, audit logging), and operational runbooks. The MCP server is not a prototype or a developer convenience - it's a production service on which agent workflows depend.

Why It Matters

MCP servers unlock a category of agent capability that static context files cannot provide:

  • Real-time accuracy - agents work with current state, not stale documentation that may be months out of date
  • Structured data instead of prose - database schemas, API contracts, and ownership graphs are queryable data structures, not paragraphs the agent has to parse
  • Organizational scope - MCP servers can aggregate context across the entire organization, not just a single repository
  • Programmatic access patterns - agents can retrieve only the context they need for a specific task, reducing context window pressure
  • Audit trail - every MCP query is logged, creating an audit trail of what context agents consulted when making decisions

The difference between L2 and L3 context engineering is the difference between a static map and a live navigation system. The static map (CLAUDE.md) is better than nothing. The live navigation system (MCP) tells you where the traffic jams are right now.

Tip

Start your MCP server journey with a read-only, non-sensitive data source - your team's service registry or API contract catalog. This lets you test the infrastructure pattern and demonstrate value without taking on security complexity at the outset.

Getting Started

6 steps to get from here to the next level

Common Pitfalls

Mistakes teams actually make at this stage - and how to avoid them

How Different Roles See It

B
BobHead of Engineering

Bob's team has successfully deployed CLAUDE.md files across most repositories (L2). Developers are reporting good results, but senior engineers are raising a new problem: the context in CLAUDE.md files keeps going stale. The service ownership section was accurate six months ago; since then, three service owners have changed and two services were deprecated. Manually updating CLAUDE.md isn't keeping up with organizational change.

What Bob should do - role-specific action plan

S
SarahProductivity Lead

Sarah has been tracking agent suggestion quality and sees a pattern: suggestions about the current state of the system (who owns what, what's deployed, what the current schema looks like) are consistently worse than suggestions about how to write code. The coding quality has improved with CLAUDE.md; the operational knowledge quality hasn't.

What Sarah should do - role-specific action plan

V
VictorStaff Engineer - AI Champion

Victor has been manually providing operational context in every agent session: the current schema migration state, which services are currently degraded, what the team's sprint focus is. He's essentially acting as a human MCP server. He knows this doesn't scale and has been researching the MCP protocol to understand what building a real MCP server would take.

What Victor should do - role-specific action plan