Understanding Normal Approximation to Binomial
The binomial distribution describes the probability of observing exactly k successes in N independent trials, each with success probability p. For small N, the binomial probabilities follow a stepped, discrete pattern. However, as N grows, this stepped distribution converges to a smooth, continuous curve that closely matches a normal (Gaussian) distribution centered at μ = Np.
This convergence is what the central limit theorem guarantees: the sum of many independent random events tends toward normality, regardless of the underlying distribution. By treating the binomial as normal, you avoid computing factorial-heavy binomial coefficients and instead use standard normal tables or software to find probabilities in seconds.
The approximation works best when both N × p ≥ 5 and N × (1 − p) ≥ 5, ensuring the binomial isn't too skewed toward either tail.
Key Formulas
Given N trials with success probability p, calculate the mean, variance, and standard deviation of the equivalent normal distribution. Then standardize your observation using the z-score formula to look up probabilities in the standard normal table.
μ = N × p
σ² = N × p × (1 − p)
σ = √(σ²)
z = (x − μ) / σ
N— Number of independent trials or occurrencesp— Probability of success on each trial (0 < p < 1)q— Probability of failure, equal to 1 − pμ— Mean of the normal distribution (expected number of successes)σ²— Variance of the binomial distributionσ— Standard deviation of the normal distributionx— Number of observed successes (the value you want probability for)z— Standardized score; the distance from the mean in units of standard deviation
When to Apply the Normal Approximation
Not every binomial problem benefits from normal approximation. The rule of thumb is straightforward: both N × p and N × (1 − p) must exceed 5. This ensures the binomial distribution is reasonably symmetric and close enough to normal shape.
For example:
- Fair coin, 100 tosses: N × p = 100 × 0.5 = 50 ✓ Use normal approximation
- Rare event (p = 0.01), 50 trials: N × p = 50 × 0.01 = 0.5 ✗ Sample too small; approximation fails
- Rare event (p = 0.01), 1000 trials: N × p = 1000 × 0.01 = 10 ✓ Large enough now
When p is very close to 0 or 1, or when N is small, stick with exact binomial calculations or Poisson approximation instead.
The Continuity Correction
Because the binomial is discrete (0, 1, 2, ... successes) and the normal is continuous, a small adjustment improves accuracy. Add or subtract 0.5 from your observed count before calculating the z-score:
- For P(X = k): Use P(k − 0.5 < X < k + 0.5)
- For P(X ≤ k): Use P(X < k + 0.5)
- For P(X < k): Use P(X < k − 0.5)
- For P(X ≥ k): Use P(X > k − 0.5)
For large N (say, N > 100), the 0.5 correction matters less, but including it is good practice and ensures consistency with statistical software defaults.
Common Pitfalls and Practical Advice
Avoid these frequent mistakes when using normal approximation to binomial probabilities.
- Forgetting the sample size check — Many skip the <em>N</em> × <em>p</em> ≥ 5 test and apply the approximation blindly. Always verify both conditions before trusting your result. If either is false, use exact binomial or Poisson instead.
- Omitting continuity correction — The 0.5 adjustment is small but systematic. Ignoring it introduces consistent bias, especially when <em>N</em> is moderate (10–50). Most statistical references and software apply it by default, so do the same for consistency.
- Confusing z-score direction — The formula is (x − μ) / σ, not (μ − x) / σ. A negative z-score means your observation is below the mean. Double-check the sign before looking up probabilities in the standard normal table.
- Using the approximation for extreme proportions — When <em>p</em> ≈ 0 or <em>p</em> ≈ 1 (e.g., defect rates of 1% or 99%), the binomial skews heavily. Large <em>N</em> helps, but the normal approximation still may underperform compared to exact methods or Poisson approximation for rare events.