What Are DORA Metrics? Beginner Guide

When I first heard about DORA metrics at Salesken, I was skeptical. We already had velocity charts, sprint burndowns, and Jira dashboards. Why did we need four more numbers? Then I actually measured them — and realized our sprint velocity had been hiding a 9-day lead time and a 35% change failure rate. DORA gave me the honest picture that velocity never could.

In 2018, Google Cloud published a framework that would reshape how engineering teams measure performance. The research introduced DORA metrics—a framework so effective that it became the industry standard for assessing software delivery excellence. Yet many engineering leaders still ask: "What are DORA metrics, and why should I care?"

The best DORA metrics tools are Sleuth, LinearB, Swarmia, Jellyfish, and Glue—each measuring the four DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery) differently, with some focusing on Git data and others connecting DORA to broader engineering intelligence.

If you're a VP of Engineering, CTO, or engineering manager struggling to justify DevOps investments or understand whether your team's delivery is truly improving, this guide is for you.ecame the industry standard for assessing software delivery excellence. Yet many engineering leaders still ask: "What are DORA metrics, and why should I care?"

If you're a VP of Engineering, CTO, or engineering manager struggling to justify DevOps investments or understand whether your team's delivery is truly improving, this guide is for you.

What Does DORA Stand For?

DORA is an acronym for DevOps Research and Assessment. It originated from research by Dr. Nicole Forsgren, Jez Humble, and Gene Kim—the same team behind the bestselling book "Accelerate," which revolutionized how we think about software delivery performance.

The DORA framework measures how well an organization delivers software and operates systems. Unlike vanity metrics that look impressive but don't correlate with business outcomes, DORA metrics directly link to organizational performance, customer satisfaction, and team stability.

Why Do DORA Metrics Matter?

Before DORA, teams relied on vague measures: "We ship when we're ready" or "Our velocity is high." Understanding metrics for software development became essential. Without standardized metrics, it was impossible to:

Compare delivery performance across teams
Identify bottlenecks objectively
Track whether process improvements actually worked
Demonstrate the value of DevOps investments to leadership
Spot teams at risk of burnout or instability

DORA metrics solve this problem by providing clear, measurable signals of delivery health that correlate with business impact.

The 4 DORA Metrics Explained

DORA consists of four key metrics. Together, they tell a complete story about your team's delivery capability and operational stability.

1. Deployment Frequency

What it measures: How often your team deploys code to production.

Why it matters: Deployment frequency is a proxy for speed and team confidence. Teams that deploy frequently typically have:

Better continuous integration practices
Faster feedback loops
Smaller, less risky changes
More agile responses to production issues

How to measure it: Count production deployments per day, week, or month across your team or service. A high-performing team typically deploys multiple times per day. Lower performers might deploy monthly or less frequently.

Real-world context: Deployment frequency isn't about moving fast recklessly—it's about moving fast safely through smaller, more frequent releases.

2. Lead Time for Changes

What it measures: The time from when code is committed to when it runs in production.

Why it matters: Lead time reveals how quickly your team can respond to:

Customer requests and feature demands
Security vulnerabilities
Critical bugs
Market opportunities

Fast lead time (hours to a day) indicates mature CI/CD, while slow lead time (weeks or months) suggests manual bottlenecks, approval processes, or testing challenges.

How to measure it: Track the elapsed time from commit to production deployment. For high performers, this is measured in hours. For lower performers, it might be weeks or months.

Real-world context: This metric captures the entire flow—code review, testing, approval, and deployment—giving you visibility into where delays occur.

3. Change Failure Rate

What it measures: The percentage of changes that result in degraded service, require rollback, or cause incidents.

Why it matters: A high change failure rate signals:

Insufficient testing practices
Deployments that are too large or risky
Lack of observability
Production environments that differ from test environments

Lower failure rates (0-15% for high performers) indicate robust quality practices and confidence in releases.

How to measure it: Track failed deployments as a percentage of total deployments. Failed deployments include those that caused incidents, required rollbacks, or degraded customer-facing services.

Real-world context: This metric rewards teams for shipping responsibly. Going faster doesn't help if you're breaking production constantly.

4. Mean Time to Recovery (MTTR)

What it measures: The average time from when a production incident is detected until the service is restored.

Why it matters: MTTR measures operational resilience. It reveals:

How quickly your team detects problems
How prepared you are to fix them
Whether you have good incident response processes
The maturity of your monitoring and alerting

High-performing teams recover from incidents in under an hour. Lower performers might take days.

How to measure it: For each production incident, record the time from detection to resolution. Calculate the average across all incidents in a period (month or quarter).

Real-world context: MTTR isn't about preventing all incidents—that's unrealistic. It's about recovering fast when things go wrong, minimizing customer impact.

DORA Metrics Cheat Sheet

Here's a quick reference for each metric:

Metric	High Performer	Low Performer	Measures
Deployment Frequency	Multiple times per day	Monthly or less	Speed & confidence
Lead Time for Changes	Less than 1 day	1 month or more	Cycle time & flow
Change Failure Rate	0-15%	45%+	Quality & stability
Mean Time to Recovery	Less than 1 hour	1+ day	Operational resilience

How to use this table:

Find your team's current performance level for each metric
Identify which metrics need improvement
Focus on metrics that directly impact your business (e.g., if security is a concern, focus on lead time; if reliability matters most, focus on MTTR)

Beyond the Metrics: Implementation Challenges

Many teams measure DORA metrics but struggle to improve them. Why? Because the metrics alone don't tell you what to fix. They're signals, not solutions.

A team might have high deployment frequency but high change failure rate—which suggests they're deploying small changes frequently, but those changes are breaking production. The fix isn't "stop deploying"—it's "improve your testing practices."

This is where the challenge lies: translating metrics into action. Teams need visibility not just into the numbers, but into:

Which services are struggling with deployment frequency
Which code changes are causing failures
Which teams are experiencing alert fatigue
Which bottlenecks are slowing lead time

The Common Pitfall: Measuring Without Context

Many organizations implement DORA metrics dashboards but struggle with the next question: "Now what?" A dashboard showing that deployment frequency has improved from 1x/week to 2x/week is good news. But without context, it's incomplete.

Context questions that matter:

Did we sacrifice quality to increase frequency? (Check change failure rate)
Are we deploying faster because we removed testing? (Check for quality regressions)
Is faster deployment coming at the cost of team stability? (Survey team satisfaction)
Which teams are high performers? Which need support? (Segment metrics by team/service)

The answer to each question reveals whether improvement is genuine progress or short-term optimization with hidden costs.

Real-World Example: The Deployment Frequency Trap

A startup measures 5 deployments per week and celebrates. Then they investigate deeper:

Deployment frequency: 5x/week (high performer level)
Change failure rate: 45% (low performer level)
Lead time for changes: 3 days (moderate)
MTTR: 8 hours (low performer level)

The story revealed: they're deploying frequently and small, which is good. But many of those deployments break production. When incidents occur, the team struggles to recover quickly because they lack good monitoring and runbooks. They then spend time firefighting instead of improving processes.

The solution wasn't "deploy less frequently." It was "improve testing, add monitoring, and create better incident response playbooks." Once they addressed those, the change failure rate dropped to 12%, MTTR to 45 minutes, and the team was healthier despite the same deployment frequency.

DORA Metrics in the Real World: Who's Using Them?

Google: Google internal teams use DORA metrics extensively. The research originated from studying Google's own practices.

Stripe: Highly public about optimizing DORA metrics. Stripe credits their ability to move fast to strong deployment infrastructure and incident response practices.

Spotify: Uses DORA-aligned metrics to measure engineering effectiveness across hundreds of engineers.

GitHub: Uses variants of DORA metrics in their internal DevOps processes.

Slack: Early adopter of DORA. Their engineering team published on optimizing lead time specifically.

These organizations didn't choose DORA because it sounded good. They chose it because the metrics directly correlate with their business outcomes: faster feature delivery, higher reliability, and better team morale.

Understanding DORA Benchmarks

One of DORA's most valuable contributions is the benchmark data. Google's research classified teams into performance tiers:

High Performers (Top Quartile)

Deployment Frequency: Multiple times per day (often: hourly)
Lead Time for Changes: Less than 1 hour
Change Failure Rate: 0-15%
Mean Time to Recovery: Less than 15 minutes
Organizational Impact: These teams typically show:
- Higher profitability
- Better market responsiveness
- Lower employee turnover
- Faster achievement of organizational goals

Medium Performers

Deployment Frequency: 1-7 times per week
Lead Time for Changes: 1 day - 1 week
Change Failure Rate: 15-45%
Mean Time to Recovery: 1 hour - 1 day
Organizational Impact: Steady but not rapid delivery. Some process bottlenecks exist.

Low Performers

Deployment Frequency: Monthly or less
Lead Time for Changes: 1 month or more
Change Failure Rate: 45%+
Mean Time to Recovery: 1 day or more
Organizational Impact: Slow delivery, quality concerns, potential team dissatisfaction

Important caveat: These benchmarks are industry averages, not absolute targets. A financial services firm operating under strict regulatory constraints might have legitimate reasons for slower deployment frequency. Context matters. What matters more is: are you improving your own metrics over time?

How AI Agents Automate DORA Tracking

Historically, measuring DORA metrics required manual work: engineers instrumenting code, building dashboards, aggregating what I've seen multiple tools (Git, CI/CD, incident management, monitoring).

AI agents are changing this. Modern agentic platforms can:

Autonomously track deployment frequency from Git and CI/CD logs
Calculate lead time by analyzing commit-to-deployment pipelines
Monitor production incidents and flag failed deployments
Alert teams when DORA metrics trend in the wrong direction
Provide root-cause analysis when metrics degrade

The benefit: your team gets continuous, real-time visibility into delivery performance without the overhead of manual measurement. Engineers focus on improving the metrics instead of collecting them.

Getting Started with DORA Metrics

If you're new to DORA metrics, start here:

Establish baselines: Measure each metric for your team as-is, today. Don't worry about high-performer benchmarks yet.
Identify the constraint: Which metric is your biggest drag? Is it deployment frequency? Lead time? Start there.
Make one improvement: Choose one process change that directly addresses your weakest metric. Example: if lead time is slow because of manual approval, implement automated testing to reduce approval time.
Measure the impact: Re-measure after 2-4 weeks. Did that one change improve the metric?
Iterate: Pick the next constraint and repeat.

Conclusion

DORA metrics provide engineering leaders with a language for talking about delivery performance. They move the conversation from "We need to be faster" to "Our deployment frequency is 5x per month—let's get to daily." From vague to specific. From intuition to data.

Whether you're justifying DevOps investment, building a culture of continuous improvement, or benchmarking against industry standards, DORA metrics are an essential framework. Combined with modern tooling and observability, they become a real-time dashboard of your team's delivery health.

Start measuring today. The insights will surprise you.

Frequently Asked Questions

What are the best DORA metrics tools?

The best DORA metrics tools include Glue for combined codebase intelligence and delivery metrics, LinearB for pipeline analytics, Sleuth for deployment tracking, and Jellyfish for business-aligned engineering metrics. Most tools pull data from your CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins) and incident management platforms (PagerDuty, Opsgenie) to calculate deployment frequency, lead time for changes, change failure rate, and mean time to recovery. The best platforms go beyond raw numbers to surface actionable bottlenecks — showing you why your cycle time is high, not just that it is.

Are DORA metrics still relevant in 2026?

Yes, DORA metrics remain the most widely adopted framework for measuring software delivery performance. However, leading teams now supplement them with additional metrics like developer experience scores and code health indicators for a more complete picture.

How often should you review DORA metrics?

Most teams review DORA metrics weekly in engineering standups and monthly in leadership reviews. Avoid reviewing them daily, as daily fluctuations create noise. Focus on trends over 4-8 week periods rather than individual data points.

What tools measure DORA metrics?

Common tools include Glue, LinearB, Sleuth, Faros AI, and Jellyfish. Many teams also build custom engineering metrics dashboards using data from their CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins) combined with incident management tools like PagerDuty or Opsgenie.

If you're a VP of Engineering, CTO, or engineering manager struggling to justify DevOps investments or understand whether your team's delivery is truly improving, this guide is for you.

What Does DORA Stand For?

Why Do DORA Metrics Matter?

Compare delivery performance across teams
Identify bottlenecks objectively
Track whether process improvements actually worked
Demonstrate the value of DevOps investments to leadership
Spot teams at risk of burnout or instability

DORA metrics solve this problem by providing clear, measurable signals of delivery health that correlate with business impact.

The 4 DORA Metrics Explained

DORA consists of four key metrics. Together, they tell a complete story about your team's delivery capability and operational stability.

1. Deployment Frequency

What it measures: How often your team deploys code to production.

Why it matters: Deployment frequency is a proxy for speed and team confidence. Teams that deploy frequently typically have:

Better continuous integration practices
Faster feedback loops
Smaller, less risky changes
More agile responses to production issues

Real-world context: Deployment frequency isn't about moving fast recklessly—it's about moving fast safely through smaller, more frequent releases.

2. Lead Time for Changes

What it measures: The time from when code is committed to when it runs in production.

Why it matters: Lead time reveals how quickly your team can respond to:

Customer requests and feature demands
Security vulnerabilities
Critical bugs
Market opportunities

Fast lead time (hours to a day) indicates mature CI/CD, while slow lead time (weeks or months) suggests manual bottlenecks, approval processes, or testing challenges.

How to measure it: Track the elapsed time from commit to production deployment. For high performers, this is measured in hours. For lower performers, it might be weeks or months.

Real-world context: This metric captures the entire flow—code review, testing, approval, and deployment—giving you visibility into where delays occur.

3. Change Failure Rate

What it measures: The percentage of changes that result in degraded service, require rollback, or cause incidents.

Why it matters: A high change failure rate signals:

Insufficient testing practices
Deployments that are too large or risky
Lack of observability
Production environments that differ from test environments

Lower failure rates (0-15% for high performers) indicate robust quality practices and confidence in releases.

How to measure it: Track failed deployments as a percentage of total deployments. Failed deployments include those that caused incidents, required rollbacks, or degraded customer-facing services.

Real-world context: This metric rewards teams for shipping responsibly. Going faster doesn't help if you're breaking production constantly.

4. Mean Time to Recovery (MTTR)

What it measures: The average time from when a production incident is detected until the service is restored.

Why it matters: MTTR measures operational resilience. It reveals:

How quickly your team detects problems
How prepared you are to fix them
Whether you have good incident response processes
The maturity of your monitoring and alerting

High-performing teams recover from incidents in under an hour. Lower performers might take days.

How to measure it: For each production incident, record the time from detection to resolution. Calculate the average across all incidents in a period (month or quarter).

Real-world context: MTTR isn't about preventing all incidents—that's unrealistic. It's about recovering fast when things go wrong, minimizing customer impact.

DORA Metrics Cheat Sheet

Here's a quick reference for each metric:

Metric	High Performer	Low Performer	Measures
Deployment Frequency	Multiple times per day	Monthly or less	Speed & confidence
Lead Time for Changes	Less than 1 day	1 month or more	Cycle time & flow
Change Failure Rate	0-15%	45%+	Quality & stability
Mean Time to Recovery	Less than 1 hour	1+ day	Operational resilience

How to use this table:

Find your team's current performance level for each metric
Identify which metrics need improvement
Focus on metrics that directly impact your business (e.g., if security is a concern, focus on lead time; if reliability matters most, focus on MTTR)

Beyond the Metrics: Implementation Challenges

Many teams measure DORA metrics but struggle to improve them. Why? Because the metrics alone don't tell you what to fix. They're signals, not solutions.

This is where the challenge lies: translating metrics into action. Teams need visibility not just into the numbers, but into:

Which services are struggling with deployment frequency
Which code changes are causing failures
Which teams are experiencing alert fatigue
Which bottlenecks are slowing lead time

The Common Pitfall: Measuring Without Context

Context questions that matter:

Did we sacrifice quality to increase frequency? (Check change failure rate)
Are we deploying faster because we removed testing? (Check for quality regressions)
Is faster deployment coming at the cost of team stability? (Survey team satisfaction)
Which teams are high performers? Which need support? (Segment metrics by team/service)

The answer to each question reveals whether improvement is genuine progress or short-term optimization with hidden costs.

Real-World Example: The Deployment Frequency Trap

A startup measures 5 deployments per week and celebrates. Then they investigate deeper:

Deployment frequency: 5x/week (high performer level)
Change failure rate: 45% (low performer level)
Lead time for changes: 3 days (moderate)
MTTR: 8 hours (low performer level)

DORA Metrics in the Real World: Who's Using Them?

Google: Google internal teams use DORA metrics extensively. The research originated from studying Google's own practices.

Stripe: Highly public about optimizing DORA metrics. Stripe credits their ability to move fast to strong deployment infrastructure and incident response practices.

Spotify: Uses DORA-aligned metrics to measure engineering effectiveness across hundreds of engineers.

GitHub: Uses variants of DORA metrics in their internal DevOps processes.

Slack: Early adopter of DORA. Their engineering team published on optimizing lead time specifically.

Understanding DORA Benchmarks

One of DORA's most valuable contributions is the benchmark data. Google's research classified teams into performance tiers:

High Performers (Top Quartile)

Deployment Frequency: Multiple times per day (often: hourly)
Lead Time for Changes: Less than 1 hour
Change Failure Rate: 0-15%
Mean Time to Recovery: Less than 15 minutes
Organizational Impact: These teams typically show:
- Higher profitability
- Better market responsiveness
- Lower employee turnover
- Faster achievement of organizational goals

Medium Performers

Deployment Frequency: 1-7 times per week
Lead Time for Changes: 1 day - 1 week
Change Failure Rate: 15-45%
Mean Time to Recovery: 1 hour - 1 day
Organizational Impact: Steady but not rapid delivery. Some process bottlenecks exist.

Low Performers

Deployment Frequency: Monthly or less
Lead Time for Changes: 1 month or more
Change Failure Rate: 45%+
Mean Time to Recovery: 1 day or more
Organizational Impact: Slow delivery, quality concerns, potential team dissatisfaction

How AI Agents Automate DORA Tracking

Historically, measuring DORA metrics required manual work: engineers instrumenting code, building dashboards, aggregating what I've seen multiple tools (Git, CI/CD, incident management, monitoring).

AI agents are changing this. Modern agentic platforms can:

Autonomously track deployment frequency from Git and CI/CD logs
Calculate lead time by analyzing commit-to-deployment pipelines
Monitor production incidents and flag failed deployments
Alert teams when DORA metrics trend in the wrong direction
Provide root-cause analysis when metrics degrade

The benefit: your team gets continuous, real-time visibility into delivery performance without the overhead of manual measurement. Engineers focus on improving the metrics instead of collecting them.

Getting Started with DORA Metrics

If you're new to DORA metrics, start here:

Establish baselines: Measure each metric for your team as-is, today. Don't worry about high-performer benchmarks yet.
Identify the constraint: Which metric is your biggest drag? Is it deployment frequency? Lead time? Start there.
Make one improvement: Choose one process change that directly addresses your weakest metric. Example: if lead time is slow because of manual approval, implement automated testing to reduce approval time.
Measure the impact: Re-measure after 2-4 weeks. Did that one change improve the metric?
Iterate: Pick the next constraint and repeat.

Conclusion

Start measuring today. The insights will surprise you.

Frequently Asked Questions

What are the best DORA metrics tools?

Are DORA metrics still relevant in 2026?

How often should you review DORA metrics?

What tools measure DORA metrics?

What Are DORA Metrics? A Beginner's Guide to Measuring Software Delivery Performance

What Does DORA Stand For?

Why Do DORA Metrics Matter?

The 4 DORA Metrics Explained

1. Deployment Frequency

2. Lead Time for Changes

3. Change Failure Rate

4. Mean Time to Recovery (MTTR)

DORA Metrics Cheat Sheet

Beyond the Metrics: Implementation Challenges

The Common Pitfall: Measuring Without Context

Real-World Example: The Deployment Frequency Trap

DORA Metrics in the Real World: Who's Using Them?

Understanding DORA Benchmarks

High Performers (Top Quartile)

Medium Performers

Low Performers

How AI Agents Automate DORA Tracking

Getting Started with DORA Metrics

Conclusion

Related Reading

Frequently Asked Questions

More articles

LinearB vs Jellyfish vs Swarmia: What Each Measures, What Each Misses, and When to Pick Something Else

Software Metrics in Software Engineering: From Code Analysis to Business Outcomes

Metrics for Software Development: What Your Team Should Track and Why

Stop stitching. Start shipping.

What Are DORA Metrics? A Beginner's Guide to Measuring Software Delivery Performance

What Does DORA Stand For?

Why Do DORA Metrics Matter?

The 4 DORA Metrics Explained

1. Deployment Frequency

2. Lead Time for Changes

3. Change Failure Rate

4. Mean Time to Recovery (MTTR)

DORA Metrics Cheat Sheet

Beyond the Metrics: Implementation Challenges

The Common Pitfall: Measuring Without Context

Real-World Example: The Deployment Frequency Trap

DORA Metrics in the Real World: Who's Using Them?

Understanding DORA Benchmarks

High Performers (Top Quartile)

Medium Performers

Low Performers

How AI Agents Automate DORA Tracking

Getting Started with DORA Metrics

Conclusion

Related Reading

Frequently Asked Questions

More articles

LinearB vs Jellyfish vs Swarmia: What Each Measures, What Each Misses, and When to Pick Something Else

Software Metrics in Software Engineering: From Code Analysis to Business Outcomes

Metrics for Software Development: What Your Team Should Track and Why

Stop stitching. Start shipping.