Understanding False Positives in Diagnostic Testing
A false positive occurs when a diagnostic test returns a positive result for someone who is actually healthy. In a two-by-two contingency table of test outcomes and true disease status, false positives sit alongside true positives (correct disease detection), true negatives (correct health confirmation), and false negatives (missed disease cases).
The frequency of false positives depends on two critical parameters: the specificity of the test (its ability to correctly identify healthy individuals) and the prevalence of the disease in the population being tested. A highly specific test produces fewer false positives, but even perfect specificity cannot prevent false positives entirely if prevalence is high—because more diseased individuals exist to test positive genuinely.
Calculating False Positives and Related Metrics
Three interconnected equations govern false positive calculations. The first two tell us how many individuals fall into each category when testing a population, while the third expresses test accuracy as a simple rate.
False Positives = (1 − Specificity) × (1 − Prevalence)
True Negatives = Specificity × (1 − Prevalence)
False Positive Rate = 1 − Specificity
Specificity— The proportion of healthy people correctly identified by the test, expressed as a decimal (0 to 1) or percentage (0–100%).Prevalence— The proportion of the disease in the tested population, expressed as a decimal (0 to 1) or percentage (0–100%).False Positive Rate— The probability that a positive test result occurs in a healthy individual; equivalent to one minus specificity.
The Relationship Between Specificity and False Positive Rate
The false positive rate has an elegant mathematical relationship with specificity: it is simply 100% − Specificity. This means a test with 95% specificity has a false positive rate of 5%—regardless of disease prevalence.
This independence from prevalence is counterintuitive but correct. Prevalence determines how many false positives occur in absolute numbers (via the first equation), but the rate at which positive tests are false among healthy people depends only on specificity. For example:
- A test with 90% specificity has a 10% false positive rate in all populations.
- In a disease-rare setting, this produces few false positives in absolute terms but still a 10% error rate among the healthy.
- In a high-prevalence setting, the same test generates more false positives overall but maintains the same 10% error rate.
True Negatives: The Complement of False Positives
True negatives represent healthy individuals correctly identified as disease-free. Their count is calculated as Specificity × (1 − Prevalence), making them the inverse of the false positive equation.
Understanding this distinction is crucial for interpreting test results. In screening programmes with low disease prevalence, the vast majority of negatives tests are true negatives—correctly reassuring people. However, positive tests in the same low-prevalence setting are more likely to be false positives. This phenomenon is why high-specificity tests are preferred in low-prevalence screening and why clinicians often request confirmatory testing after a positive screen in rare diseases.
Common Pitfalls When Working With False Positives
Avoid these frequent mistakes when calculating or interpreting false positive metrics.
- Confusing Prevalence with False Positive Rate — Prevalence alone does not determine the false positive rate. Two populations with identical disease prevalence but different test specificities will have different false positive rates. Always include specificity in your calculations.
- Forgetting That Rates Are Proportions Among Health — The false positive rate is the proportion of positive tests that are wrong among healthy individuals only—not among all positive tests. This is distinct from positive predictive value, which accounts for prevalence and tells you the probability a positive test is truly diseased.
- Applying Population-Level Equations to Individual Predictions — The formulas yield proportions or rates for populations. They don't predict whether any single positive test is a false positive; they estimate how many errors to expect across many tests.
- Neglecting Test Specificity Variation — Specificity can vary by equipment calibration, operator skill, or patient factors. Always verify the specificity figure used in your calculation comes from a reliable, relevant clinical study.