DORA metrics (if at all)
At L1 (Ad-hoc), most engineering teams track DORA metrics inconsistently or not at all.
- ·DORA metrics (deployment frequency, lead time, change failure rate, MTTR) are not tracked, or tracked inconsistently
- ·No AI-specific metrics exist
- ·Team acknowledges the need for AI-specific metrics beyond traditional DORA
- ·Basic deployment frequency is at least known (even if not dashboarded)
Evidence
- ·Absence of metrics dashboard or inconsistent/manual tracking
- ·No AI-specific fields in existing metrics systems
What It Is
At L1 (Ad-hoc), most engineering teams track DORA metrics inconsistently or not at all. DORA - the four key metrics from the DevOps Research and Assessment program - are deployment frequency, lead time for changes, change failure rate, and mean time to restore. These metrics were designed to measure software delivery performance for traditional human-driven development. Teams at L1 may have heard of them, may have a dashboard that pulls some of the data, but the metrics are rarely reviewed in retrospectives, rarely acted on, and often incomplete because the underlying tooling isn't set up to capture them reliably.
The "if at all" qualifier is honest. Many engineering teams at this level are in one of two positions: either they've never implemented DORA tracking because they're focused on feature delivery and haven't had the bandwidth for measurement infrastructure, or they have a partial DORA dashboard that was set up by a past initiative and is now ignored. Both situations share the same consequence - no reliable signal about whether the team is delivering software effectively.
DORA metrics were conceived before AI-assisted development existed. They measure the output of human developers working in a traditional code-review-and-merge cycle. They remain useful at L2 and L3 as a baseline, but at L1 they're a symptom of a broader problem: the team doesn't yet have a culture of measurement. Without measurement, there's no feedback loop for improvement, and without a feedback loop, AI adoption will be unguided and its impact invisible.
The goal at L1 is not to achieve perfect DORA tracking - that's a distraction. The goal is to recognize that the absence of measurement is itself a significant problem, and to take the first step toward establishing any consistent signal about delivery performance. Even imperfect DORA data is vastly better than no data.
Why It Matters
- No measurement means no improvement signal - teams that don't track deployment frequency don't know if they're deploying more or less than they were six months ago; AI adoption decisions are made on gut feel rather than evidence
- DORA baselines enable before/after comparisons - when AI tools are introduced, teams with existing DORA data can immediately measure impact; teams without baselines can never prove (or disprove) that the investment paid off
- Deployment frequency predicts everything else - teams that deploy frequently have shorter lead times, lower failure rates, and faster recovery; DORA is a proxy for overall development health, not just a CI/CD metric
- Leadership will ask for ROI - at some point, the CTO or CFO will ask what the AI tooling investment produced; teams with DORA baselines have an answer; teams without them have nothing to show
- Measurement creates accountability - tracking lead time makes slow review cycles visible; tracking change failure rate makes test quality visible; measurement surfaces problems that were previously invisible and therefore unaddressed
Getting Started
6 steps to get from here to the next level
Common Pitfalls
Mistakes teams actually make at this stage - and how to avoid them
How Different Roles See It
Bob leads a 40-person engineering organization where some teams have informal DORA tracking and most don't. He's invested in GitHub Copilot licenses and is getting pressure from leadership to show impact. When he asks his engineering managers what the team's lead time is, he gets different answers from every manager - because they're all measuring different things.
What Bob should do - role-specific action plan
Sarah has been asked to evaluate whether the team's AI tool investments are working. She pulls the data she can find - PR counts, commit frequency, some velocity metrics from Jira - but nothing is connected to outcomes. She can tell that some developers are using AI tools more than others, but she can't tell whether that usage is translating into faster delivery.
What Sarah should do - role-specific action plan
Victor is the team's most advanced AI user and is frustrated that he can't make the case for the AI patterns he's pioneered. He can feel the productivity improvement in his own work - he ships more and context-switches less - but he has no data to back up the claim when he advocates for team-wide adoption.
What Victor should do - role-specific action plan
Further Reading
5 resources worth reading - hand-picked, not scraped
Metrics