Guide to the top 20 QA metrics that matter

DS1398 – TestRail Updated Blogs (V1)_A Guide to the top 20 metrics that matter – Graphic Only

According to the Sembi Software Quality Pulse Report, 43% of teams are releasing new code weekly or faster, yet 37% prioritize speed of delivery over risk. For QA leaders, tracking the right metrics is what keeps quality from getting lost in the velocity.

By now, the necessity of dedicated Quality Assurance mechanisms within software development pipelines is beyond question. Inadequate QA coverage won’t just result in a subpar application; it will deliver unsatisfactory user experiences and adversely impact brand credibility. 

In light of life, most tech-first companies are investing significantly in their QA functions. To quote the 13th edition of the World Quality Report by Sogeti:

“Acceleration in Quality transformation is visible across the board….the importance of quality assurance has been reinforced: our survey sample of 1,750 CIOs and senior tech leaders emphasizes the value of testing more than ever before.”

However, to justify the increased investment, the results of software testing must be closely and consistently monitored. Naturally, stakeholders require the right metrics to evaluate QA performance and its role in enhancing development processes and timelines. 

Following the age-old wisdom of “if you can’t measure it, you can’t improve it,” we’ve put together a list of 20 essential QA metrics that will help you gain insight into the efficacy of your test protocols & teams. 

Quantitative vs. Qualitative QA Metrics

Before diving into the list, let’s take a moment to explore two major categories of QA metrics:

Quantitative (Absolute numbers) and Qualitative (Derived Metrics).

Quantitative Metrics (Absolute Numbers)

Quantitative metrics are exactly what they sound like. They are whole numbers, measuring a single aspect of the QA process. Some of the qualitative metrics listed here are:

  • Escaped Bugs
  • Defects per requirement
  • Number of tests run over a certain duration
  • Test review rate
  • Defect capture rate
  • Average bugs per test
  • Time to Test
  • Test Cost
  • Cost per bug fix
  • Defects per software change

Qualitative Metrics (Derived Numbers)

By themselves, quantitative metrics cannot offer a full picture of a QA team’s performance. For example, just the number of average bugs per test doesn’t say much if it’s not being seen in the context of, say, the total number of tests run and the average time to run each test. Qualitative metrics help with this by relating different relevant metrics to one another to provide a nuanced picture of a team’s speed, accuracy, or effectiveness.

A few qualitative metrics listed here are:

  • Test Coverage
  • Test Reliability
  • Cost of not testing
  • Test Execution Status
  • Defect Distribution over Time
  • Defect Resolution
  • Test Case Effectiveness
  • Defect Leakage
  • Test Case Productivity
  • Test Completion Status
  • Test Review Efficiency

Top 20 Essential QA Metrics

1. Escaped Bugs

The primary reason QA exists is to prevent most (or, ideally, all) bugs from reaching production. Ideally, customers should not have to detect and report any major bugs after an app or feature goes live. 

Therefore, the number of escaped bugs should be the primary metric to judge your entire QA process. If your customers are not reporting any bugs and your team does not have to pause everything to run hotfixes, it indicates your QA activities are yielding positive results. 

However, if major bugs repeatedly escape and disrupt the user experience, you may need to rethink your test suites. Thankfully, when customers report bugs, you can quickly identify problem areas and patterns instead of having to re-examine entire architectures. 

Realistically, however, it is not possible to identify and resolve every possible bug before hitting production – especially in demanding release timelines. But you can decide upon an acceptable number of quickly fixable bugs that won’t bother the customer too much. 

For example, if your team has to release a new feature in 3 weeks, you cannot guarantee a completely bug-free product. So, take the time to identify the feature’s primary purpose and main user paths. Then, ensure that bugs do not disrupt it and that the new feature does not break the app’s existing UI/UX. 

Focus on resolving these, with the idea that smaller bugs may show up in prod, but they won’t be as intrusive to the UX. 

Finally, when judging with this metric, figure out if major bugs are escaping.  If so, you might have to add or fix existing tests.

Of course, your long-term goal should be to design end-to-end test suites that capture every possible bug. That takes time, meticulous planning, and learning from actual tests, so meanwhile, use the framework above to prioritize.  

2. Test Coverage

The Sembi Software Quality Pulse Report found that 57% of QA tests are currently automated, leaving nearly half still manual. Knowing which metrics to track is the first step to knowing where automation investment will have the most impact.

This metric should answer the question: “How many tests are we running, and which software areas do they cover?” 

Calculated as a percentage, test coverage defines how much of the application is being verified by existing tests.

It’s easy to calculate this with two quick formulas:

  • Test Execution = Number of tests already run/Total tests to be run) x 100
  • Requirements Coverage = Number of requirements covered by existing tests/Total number of requirements) x 100

The second formula is especially important to verify that all (or a majority) of software features are being checked by QA. For example, if you simply run 500 tests, the suite does not guarantee high product quality by default. Tests must cover critical user paths, core feature performance, and obvious customer preferences. 

3. Defects per requirement (Requirement Defect Density)

61% of QA teams report moderate to dramatic increases in testing demand due to AI-generated code, according to the Sembi Software Quality Pulse Report. Teams managing that demand effectively are the ones measuring the right things and using those metrics to prioritize coverage and automation investment.

Monitoring the number of defects that appear for tests covering each requirement is particularly useful. This QA Metric can reveal if certain requirements are riskier than others, which helps product teams decide whether those features should be released. 

If testing a certain requirement uncovers too many defects, it could actually uncover problems with the requirement itself. Of course, it is possible that the test cases themselves require refactoring, but rarely do more defects show up because of flaws in test structuring. 

For example, if tests on Requirement A generate 38 defects while those on Requirement B generate just 7, this is a signal for testers to examine if Requirement A requires modified tests. It also signals if the requirement may not be realistically deployable in its current state. To decide the latter, get devs and product managers involved.

4. Test Effort

Evaluating test effort requires you to take multiple other metrics into account. These sub-metrics (so to speak) reflect how many tests are being run and for how long. Generally calculated as averages, test effort numbers help you decide if you’re running enough tests and if they are catching enough defects. 

A few important numbers:

  • Number of tests run per (duration): Number of tests executed / Total duration
  • Test review rate: Number of tests reviewed / Total duration
  • Defect capture rate: Total defects captured / Total test run duration
  • Average bugs per test: Total number of bugs / Total number of tests

5. Test Reliability

A perfect test suite has the following characteristics:

  • Close correlation between the number of bugs and failed tests 
  • Each failed test includes a real bug instead of being flaky
  • A test passes only when the feature under test is entirely bug-free

The closer your test suite is to the above benchmarks, the more reliable it is. Some important questions here:

  • Are tests failing because of actual bugs, or because of bad design? If so, how many?
  • Are tests flaky? If so, how many and how often?

Tracking test reliability is necessary to generate confidence that QA is adequately testing software – actually doing its job. Like all effective QA metrics, this one helps testers continuously improve on existing test cases, scenarios & practices.  

6. Time to Test

This metric reveals how quickly a team or tester can create and execute tests without affecting software quality. 

Of course, the metric will differ between manual and automated testing cycles, the latter being much faster to execute. Additionally, the tools and frameworks used for QA also make a real difference in the time to test. 

It might be challenging to combine these numbers, so use the following averages:

  • Avg time to create tests = Total time to create tests / Total number of tests created
  • Avg time to run tests = Total test run time / Total number of tests run

Once you have initial numbers for this QA team performance metric, you can incorporate best practices and upgrade tools to increase both averages. Keep in mind that shortening average times means nothing if it lowers quality standards. 

7. Test Cost

Most QA teams have to work within specific budgets. To justify their spending, they must keep a close account of how much they plan to spend and how much they end up spending. The two main numbers here:

  • Total cost allocated for testing: The monetary amount that management has approved for QA activities for a specific duration (quarter, year, etc.)
  • Actual cost of testing: The actual monetary amount that went into running necessary tests. This calculation can include the cost of testing per hour, per test case or per requirement.

For example, if your total allocated cost is 2000 dollars and you have to test 200 requirements, 

Cost of testing per requirement: 2000/200 = 10 dollars

Cost per test hour: 2000/number of test hours (let’s say 200) = 100 dollars

Cost per test case: 2000/ number of test cases (let’s say 50) = 40 dollars

The above example assumes that all requirements take the same time and the same dollar amount to test. However, this is often not the case in the real world, so you’ll have to tweak your calculation of this QA metric accordingly. 

8. Cost per bug fix

Simply put, this is the amount spent to have each bug fixed by a developer. 

Cost per bug fix = Time taken to fix * the dev’s hourly rate

You can also go the extra mile to factor in the cost of testing each bug fix, which offers a more comprehensive number for final reporting. 

9. Cost of not testing

Calculating the cost of not testing might seem counterintuitive, but it is a great way to establish the necessity of QA functions. Monitoring this QA metric is especially important if you have to justify increased budgetary or hiring requests to stakeholders. 

The cost of not testing refers to the cost of fixing any feature that went into production without testing, failed and needed fixing. 

Not only can you calculate cost based on the number of dev hours spent to fix the flaw, but you can also include subjective costs such as:

  • More time devoted to customer calls & support requests
  • Product downtime
  • Loss of customer trust, loyalty, and brand credibility

Untested features can have far-reaching repercussions beyond a simple lack of functionality. Ensure that you can access customer support and product team personnel who can give you clarity on what those repercussions are. 

10. Test Execution Status

At any given time, you should be able to get accurate information on how many tests have passed, failed, are blocked, incomplete, or not executed yet. Represented as numbers and/or percentages, this metric is necessary for daily/weekly reporting. It’s also a quick snapshot of a team’s average efficiency, as these numbers can be compared with previously set benchmarks. 

Quick tip: Turn Test Execution status numbers into visual aids like bar graphs or pie charts for easier reporting. Raw numbers don’t capture the eye as effectively. 

11. Defects per software change

Oftentimes, when a new feature is added or an existing feature is altered, testing these changes reveals defects that did not exist in previous tests. For example, if you’ve added an extra button on a webpage, tests might show that previous buttons (which were rendering fine) are now askew and have misaligned text. In other words, defects have shown up purely because of a new change. 

For example, if five changes were made and 25 bugs showed up after testing, you can attribute roughly five bugs to each change. Of course, it’s always possible that one change introduced more defects than the others. 

If you study this QA metric long enough across multiple projects, you can make informed projections about what bugs to expect with each change. With these numbers in hand, your team can better plan their time, resource investment, and availability when starting new testing cycles. 

12. Defect Distribution over Time

At the end of a test cycle, it’s important to chart how many defects exist and where they come from. This reveals if the QA team is progressing in identifying and resolving more bugs as they work through more cycles. 

Splitting defects based on their origin also helps pinpoint which areas need more attention. Some common categorizations here are:

  • Defect Distribution by Cause
  • Defect Distribution by Module
  • Defect Distribution by Severity
  • Defect Distribution by Platform

If defects are increasing in a certain category, you’ll have an easier time pinpointing the cause. For example, if more defects show up in one platform, that might indicate that the software requires more optimization for that particular environment. 

13. Bugs found vs. Bugs Fixed

The Bugs Found vs. Bugs Fixed metric is one of the key metrics to judge the efficacy of the QA process. It maps the number of bugs found to the number fixed and provides an average that objectively demonstrates if QA is doing its main job.

This analysis is also helpful in identifying patterns in which bugs show up and are removed. It provides vital insight into the current stage of defect management. 

To get this number, you must first track the number of bugs found and resolved every day in a test cycle. For example, let’s say you have a five-day test cycle, and you’ve collected the following numbers:

Test Cycle DateBugs CreatedBugs ResolvedTotal bugs created till dateTotal bugs resolved till date
01-09-20226464
02-09-20223094
03-09-202244138
04-09-2022241512
05-09-2022231715

By the end of the test cycle, 17 bugs were created/identified, and 15 have been resolved. Compare this to previous test cycles, and you can determine if testers are getting better at finding and fixing bugs. 

14. Defect Resolution Percentage

This QA metric reveals how efficient the development team is when analyzing and fixing bugs reported by QA teams. While bug resolution should ideally not be a QA concern, keeping track of this number can help explain delays in shipping – especially useful for conversations with management. 

To calculate this number, track the total number of defects reported to the dev team and the total number of defects fixed within a test cycle. Then, apply this formula:

Defect Resolution % = (Total number of defects fixed / Total number of defects reported) x 100

Again, track defect resolution % over time to verify if QA is providing the desired results for the SDLC. 

15. Defect Age

Defect age measures the average time developers take to fix a defect, from when it starts to when the bug is actually resolved. 

Defect Age = Difference between the time of bug creation and the time of bug resolution

Generally, defect age is measured in days. Let’s say the bug was identified on 6/4/2022 and was fixed on 23/4/2022. In this case, the defect age is 17 days. 

Progressively low defect age is a powerful indicator of a QA team’s maturation. ​​It means bugs take less time to fix with every test cycle. 

16. Test Case Effectiveness

Derived as a percentage, this number indicates the efficacy of test cases in detecting bugs. In other words, how many test cases executed by your QA team have successfully identified bugs in a single test cycle? 

The formula is simple:

Test Case Effectiveness = (Number of bugs found/Number of test cases executed) x 100

An important measure of test case quality, the number should incrementally grow over progressive test cycles. It is one of the most obvious indicators of a QA team’s performance.

17. Defect Leakage

Defect leakage may seem similar to the first metric in this list ( i.e., escaped bugs). However, in this case, you’re monitoring the number of bugs that escape to the UAT (User Acceptance Testing) stage. Therefore, dealing with defect leakage is far less serious than handling escaped bugs. 

Essentially, this refers to the number of bugs that show up in UAT after the app has been through multiple layers of testing. Ideally, your test cases should filter them out before potential users touch your product.

Calculate this as:

Defect Leakage = (Total number of defects in UAT/ Total number of defects found before UAT) x 100

18. Test Case Productivity

You probably won’t have to report this metric to management, but measuring it helps with setting realistic expectations for your team. 

Test case productivity evaluates the effort required to build test cases for a specific sprint/cycle. The formula is:

Test Case Productivity = (Number of Test Cases/Effort required per Test Case) x 100

Obviously, “Effort Required per Test Case” won’t be an exact number. Certain test cases require more design work than others. But you can ask your testers to provide a fair average. This metric will give you a sense of what is reasonably possible for your team to accomplish per cycle. 

19. Test Completion Status

Not every test case your team designed will be executed to completion. Some tests will pass, some will fail, and some will end up not being executed or blocked—monitoring test completion status is another KPI indicator of overall team performance. 

A few different formulas whose results will combine to provide the big picture of test completion status:

  • % of Test Cases Executed = (Number of test cases executed/Number of test cases created) x 100
  • % of Test Cases Not Executed = (Number of test cases not executed/Number of test cases created) x 100
  • % of Test cases Passed = (Number of test cases passed / Number of test cases executed) x 100
  • % of Test cases Failed = (Number of test cases failed / Number of test cases executed) x 100
  • % of Test cases Blocked = (Number of Test cases blocked / Number of test cases executed) x 100

With these numbers in hand, you can quickly judge the current state of QA operations. For example, if the % of test cases passing is lower than the % of test cases blocked, there might be a foundational issue with the test case design or the test environment. Now you know what problem to zero in on to improve outcomes for the next sprint.

20. Test Review Efficiency

Even though test cases may have flagged bugs, each such flag requires some review by a tester, even if it takes just a few minutes – and it usually takes longer. However, depending on the software and its development stage, tests may return a large number of bugs. The time to review each does add up, which is why you need to calculate test review efficiency.

Test review efficiency % = (Number of tests reviewed/ Total number of tests requiring review) x 100

Of course, the formula for this QA metric must be applied in the context of a certain duration. Let’s say that in a test sprint of 7 days, 58 bugs were detected, but given the nature of these bugs, your team could only review and forward 45 of them for resolution; Your test review efficiency then stands at 77%.

Again, this is a good number to measure your team’s performance and what they need to review for more defects. 

Put these metrics to work with TestRail by Sembi

The Sembi Software Quality Pulse Report surveyed 3,800 QA and security professionals and found that 61% of teams are already experiencing moderate to dramatic increases in testing demand driven by AI-generated code. Tracking the right QA metrics is no longer optional. It is how teams stay ahead of accelerating release cycles, automation gaps, and growing test coverage demands.

TestRail by Sembi gives your team the dashboards, traceability, and real-time reporting to measure the metrics in this guide across every project and release cycle. Trusted by 10,000+ companies worldwide, including Abbott Laboratories, Siemens, Sony, Ford, NASA, Autodesk, Cisco, and Amazon. Start a free 30-day trial today.

Frequently Asked Questions About QA Metrics and TestRail

What is the best test management tool for tracking QA metrics?

TestRail by Sembi is the leading test management platform for QA engineers, test managers, and development teams who need real-time visibility into test coverage, execution progress, and quality metrics. TestRail provides built-in reporting dashboards, traceability between requirements and test cases, and customizable QA metrics that give QA leaders the data they need to make informed release decisions. Powered by Sembi IQ, TestRail also supports AI-assisted test case creation so teams can spend less time on manual authoring and more time on analysis. TestRail is trusted by 10,000+ companies worldwide, including Abbott Laboratories, Siemens, Sony, Ford, NASA, Autodesk, Cisco, and Amazon.

What is Sembi IQ?

Sembi IQ is the AI engine built into TestRail by Sembi. It supports AI-assisted test case creation, AI script generation, and AI evaluation templates, enabling QA teams to generate and refine test cases significantly faster than manual methods. Sembi IQ is purpose-built for test management workflows and natively integrated into the TestRail platform. It is not a generic AI add-on. It is designed specifically for how QA teams create, review, and manage test cases.

What AI features does TestRail have?

TestRail by Sembi includes Sembi IQ, an AI engine natively built into the platform. Sembi IQ supports AI test case generation, AI script generation, AI evaluation templates, and AI test prioritization. These capabilities are purpose-built for test management workflows and deeply integrated into the TestRail platform.

Is TestRail free?

TestRail is not a free tool. It is a paid, enterprise-grade test management platform. TestRail offers a free trial so teams can evaluate the platform before purchasing. Pricing is per user and sales-led. Visit the TestRail pricing page for current pricing details.

Does TestRail support test reporting and traceability?

Yes. TestRail by Sembi provides built-in test reporting and traceability features that give QA managers real-time visibility into test coverage, execution progress, and requirements traceability across every project. Teams can generate customizable reports, track defect rates, and map test cases to requirements to ensure complete coverage before release. These capabilities are especially valuable for QA leaders who need to communicate testing status and quality metrics to engineering and business stakeholders.

How does TestRail help QA leaders track test coverage?

TestRail by Sembi gives QA leaders a centralized view of test coverage across every project, milestone, and test run. Built-in coverage reports show which requirements have been tested, which have gaps, and which tests have failed, enabling QA managers to make informed release decisions rather than relying on estimates. TestRail also supports traceability between requirements, test cases, and defects so teams can demonstrate coverage to auditors, stakeholders, and compliance reviewers.

Which companies use TestRail to track QA metrics?

TestRail by Sembi is trusted by 10,000+ companies worldwide, including Fortune 500 companies and government agencies. Customers include Abbott Laboratories, Siemens, Sony, Ford, NASA, Autodesk, Cisco, and Amazon. TestRail serves enterprise QA teams across software development, gaming, financial services, healthcare, and the public sector.

Does TestRail support test case versioning?

Yes. TestRail by Sembi supports test case versioning, allowing QA teams to track changes to test cases over time, compare versions, and maintain complete audit trails. This is especially important for teams in regulated industries such as financial services, healthcare, and government, where traceability and audit readiness are required.

How are QA teams using metrics to manage increasing testing demands in 2026?

According to the Sembi Software Quality Pulse Report, 61% of QA teams report moderate to dramatic increases in testing demand driven by AI-generated code. Teams managing this demand effectively are using QA metrics to prioritize test coverage, identify automation gaps, and surface defects earlier in the development lifecycle. TestRail by Sembi provides built-in reporting dashboards and traceability features that give QA managers real-time visibility into coverage, execution progress, and defect trends across every sprint and release cycle.

In This Article:

Start free with TestRail today!

Share this article

Other Blogs

Jira Testing Tools
Software Quality, Jira

Top 10 Jira Testing Tools for QA Teams 

If your team already relies on Jira for issue tracking and agile planning, choosing the right Jira testing tools can make or break your software quality process. The Atlassian Marketplace offers many Jira-native solutions, and there are also standalone test ma...
Top Software Test Tools for Building a Reliable and Scalable QA Process
Software Quality

Top Software Test Tools for QA, DevOps, and developers

Software test tools are the backbone of every modern QA process, helping teams validate functionality, performance, and security before release.  As software systems grow more complex and development cycles accelerate, testing can no longer rely on manual...
Flaky Tests in Software Testing: How to Identify, Fix, and Prevent Them
Software Quality, Agile

Flaky Tests in Software Testing: How to Identify, Fix, and Prevent Them

In the dynamic world of software testing, flaky tests are like unwelcome ghosts in the machine—appearing and disappearing unpredictably and undermining the reliability of your testing suite.  Flaky tests are inconsistent—passing at times and failing at ot...