Understanding Pre-Test and Post-Test Probabilities
Before any diagnostic test, a patient carries a baseline likelihood of having a condition—this is the pre-test probability, also called prevalence. It reflects how common the disease is in the general population or patient demographic being evaluated.
After the test result is known, that probability shifts. A positive result on a sensitive test increases suspicion; a negative result on a specific test decreases it. The post-test probability is the revised likelihood after incorporating the test's performance.
These probabilities differ fundamentally from odds. If a condition affects 1 in 4 people, the probability is 25%, but the odds are 1:3 (or stated as odds of 0.33). Converting between them is essential for Bayesian calculations:
- Odds = Probability ÷ (1 − Probability)
- Probability = Odds ÷ (1 + Odds)
Post-Test Probability Calculation
The full calculation chain begins with prevalence and incorporates test accuracy metrics. Three pathways exist depending on what information you provide:
Pre-Test Odds = Prevalence ÷ (1 − Prevalence)
Post-Test Odds = Pre-Test Odds × Likelihood Ratio
Post-Test Probability = Post-Test Odds ÷ (1 + Post-Test Odds)
Where the likelihood ratio (LR) comes from:
LR+ = Sensitivity ÷ (1 − Specificity)
LR− = (1 − Sensitivity) ÷ Specificity
Prevalence— Baseline proportion of people with the condition (pre-test probability)Sensitivity— Proportion of diseased individuals with a positive test result (true positive rate)Specificity— Proportion of non-diseased individuals with a negative test result (true negative rate)LR+— Positive likelihood ratio; how much a positive test increases odds of diseaseLR−— Negative likelihood ratio; how much a negative test decreases odds of disease
Deriving Sensitivity, Specificity, and Likelihood Ratios
If you have a contingency table of test outcomes, you can extract all needed metrics. Begin with your four-cell confusion matrix:
- True Positives (TP): Disease present, test positive
- True Negatives (TN): Disease absent, test negative
- False Positives (FP): Disease absent, test positive
- False Negatives (FN): Disease present, test negative
From these, calculate:
- Sensitivity = TP ÷ (TP + FN) — catches diseased cases
- Specificity = TN ÷ (TN + FP) — rules out healthy individuals
- Prevalence = (TP + FN) ÷ (TP + FN + FP + TN) — overall disease frequency
A high-sensitivity test is excellent for ruling out disease; a high-specificity test is excellent for ruling in disease.
Interpreting Likelihood Ratios
Likelihood ratios quantify how much a test result shifts probability. An LR+ of 5 means a positive result is five times more likely in someone with the disease than without it. Conversely, an LR− of 0.2 means a negative result is five times more likely in someone without the disease.
Rough clinical thresholds:
- LR+ > 10 — Strong evidence for disease; minor shift in suspicion usually sufficient
- LR+ 5–10 — Moderate increase; useful in combination with other data
- LR+ 1–5 — Small increase; often insufficient alone
- LR− < 0.1 — Strong evidence against disease; excellent at ruling out
A likelihood ratio near 1.0 indicates the test provides almost no diagnostic value.
Practical Pitfalls and Caveats
Avoid these common mistakes when interpreting post-test probabilities.
- Don't Ignore Baseline Prevalence — A highly specific test applied to a rare disease still produces many false positives if prevalence is low. The post-test probability depends critically on the starting prevalence; ignoring it leads to overconfidence in positive results. Always anchor calculations to the correct population baseline.
- Confusing Sensitivity and Positive Predictive Value — Sensitivity (TP rate in the diseased) is a property of the test itself and doesn't change with prevalence. Positive predictive value (probability disease is present given a positive test) is what clinicians actually want and does shift with prevalence. The calculator computes post-test probability, which is equivalent to PPV—always verify you're interpreting the right metric.
- Applying Test Metrics from the Wrong Population — Sensitivity and specificity measured in a hospital cohort may differ substantially in primary care. If the validation study enrolled sicker or healthier patients than your population, those metrics don't apply directly. Seek evidence from your own clinical setting or adjust for demographic differences.
- Chaining Multiple Tests Naively — When two tests are applied sequentially, the second test's baseline prevalence is now the first test's post-test probability. Using pre-test population prevalence for both tests double-counts evidence. Recalculate the likelihood ratio or adjust expectations for conditional dependence between tests.