What is the Hypergeometric Distribution?
The hypergeometric distribution describes outcomes when sampling without replacement. Suppose your population contains a fixed number of items, some with a desired property (successes) and others without. When you draw a sample, the probability of success changes after each item is removed—unlike the binomial distribution where replacement keeps odds constant.
This distribution appears whenever items cannot be redrawn: selecting employees for a committee, inspecting parts from a batch, drawing cards from a deck, or evaluating defective units in manufacturing. The key constraint is sampling without replacement, making hypergeometric probability essential for finite-population scenarios.
The Hypergeometric Probability Formula
Three parameters fully define the hypergeometric distribution:
- N — total population size
- K — number of success states in the population
- n — sample size drawn
- k — number of successes observed in the sample
The probability mass function for exactly k successes is:
P(X = k) = C(K,k) × C(N−K, n−k) / C(N,n)
where C(a,b) = a! / (b!(a−b)!)
N— Total number of items in the populationK— Number of items with the desired property in the populationn— Number of items drawn in a single samplek— Number of successes (desired property items) in the drawn sample
Mean, Variance, and Distribution Properties
Two key descriptive statistics characterize any hypergeometric distribution:
Mean (Expected Value): The average number of successes in a sample is proportional to the success rate in the population.
μ = n × K / N
Variance: Variability depends on both the success proportion and the ratio of sample size to population size. Smaller populations or larger samples relative to the population reduce variance.
σ² = n × (K/N) × ((N−K)/N) × ((N−n)/(N−1))
Notice the correction factor (N−n)/(N−1) that accounts for sampling without replacement. As N grows very large, this factor approaches 1, and hypergeometric probabilities converge toward binomial probabilities.
Hypergeometric vs. Binomial Distribution
Both distributions calculate success probabilities over multiple trials, but differ fundamentally:
- Hypergeometric: Sampling without replacement. Each draw removes an item, shifting the population composition and success probability for the next draw.
- Binomial: Sampling with replacement (or infinite population). Success probability remains constant across all trials.
In practice, when sample size is much smaller than population size (rule of thumb: n < 5% of N), hypergeometric and binomial results are nearly identical, and binomial calculations become easier. For small populations or large samples relative to population, hypergeometric is the only correct choice.
Common Pitfalls When Using Hypergeometric Probabilities
Avoid these frequent mistakes when applying hypergeometric calculations:
- Confusing sampling with and without replacement — Hypergeometric assumes you cannot redraw the same item. If you're sampling with replacement, use binomial distribution instead. This distinction matters most when the population is small or the sample is large relative to N.
- Incorrectly counting success states in the population — Ensure K (successes in population) does not exceed N, and that K ≥ 0. Similarly, verify that n ≤ N and that the maximum possible successes in the sample cannot exceed min(K, n). Miscounting population successes invalidates all downstream calculations.
- Applying hypergeometric to large populations unnecessarily — When N is very large (thousands or millions) and n is relatively small, computational complexity increases without meaningful accuracy gain over binomial. Use binomial as a practical approximation in these scenarios, or limit precision expectations due to rounding in factorials.
- Misinterpreting cumulative probabilities — P(X ≥ k) means k or more successes. P(X ≤ k) means k or fewer. Ensure you select the correct cumulative direction. The tail direction matters for hypothesis testing and decision-making.