Understanding the Central Limit Theorem
The central limit theorem describes how the distribution of sample means behaves. When you repeatedly draw samples from any population and calculate their means, those means themselves form a roughly normal (bell-shaped) distribution—even if the original population is skewed or non-normal. This remarkable property holds as long as your sample size is large enough.
This theorem underpins nearly all modern statistical inference. It allows researchers to make probability statements about sample means without knowing the population's exact shape. The larger your samples, the closer the sampling distribution approaches perfect normality, and the tighter the clustering around the true population mean.
Sample Statistics from Population Parameters
When working with the central limit theorem, you need two key relationships. The sample mean equals the population mean (assuming unbiased sampling), while the standard deviation of sample means—called the standard error—shrinks as your sample grows larger.
Sample Mean (x̄) = μ (population mean)
Standard Error (SE) = σ ÷ √n
μ— Population mean—the average of all values in your entire populationσ— Population standard deviation—measures spread in the full populationn— Sample size—the number of observations in your samplex̄— Sample mean—the average of your sample valuesSE— Standard error—the standard deviation of the sampling distribution of the mean
Applying the Central Limit Theorem in Practice
Suppose you know a city's average age is 60 years with a standard deviation of 35 years. You plan to survey 49 residents randomly. Using the theorem:
- Your sample mean will centre around 60 years
- The standard error becomes 35 ÷ √49 = 35 ÷ 7 = 5 years
- About 95% of repeated samples will have means between 50 and 70 years
This shrinking of variability with larger samples is why pollsters and quality engineers conduct sizeable surveys. A sample of 49 is far more stable than a sample of 9, where the standard error would be 35 ÷ 3 ≈ 11.7 years.
Key Conditions for Valid Application
The central limit theorem works reliably only when certain conditions are met:
- Random selection: Each observation must have an equal chance of being chosen, avoiding systematic bias
- Independence: One sample value shouldn't influence another—random sampling typically ensures this
- Adequate sample size: A minimum of 30 observations is the standard benchmark, though larger samples yield better normal approximation
- Population proportion: Your sample should not exceed 10% of the total population (or use 1000 as a practical maximum), preventing dependence issues in finite populations
Common Pitfalls When Using the Central Limit Theorem
Avoid these frequent mistakes when applying the theorem to your data analysis.
- Ignoring the sample size requirement — Using samples smaller than 30 risks non-normal sampling distributions, especially with skewed populations. If your data comes from a heavily skewed or bimodal source, consider aiming for 50+ observations to ensure the normal approximation holds reliably.
- Confusing sample standard deviation with standard error — The population standard deviation σ divided by √n gives the standard error—the spread of sample means, not individual data values. Misusing one for the other will lead to dramatically wrong confidence intervals and hypothesis tests.
- Assuming the theorem applies to any statistic — The central limit theorem primarily describes sample means and sums. It's less reliable for medians, extremes, or proportions from small samples. For non-mean statistics, verify distributional assumptions before proceeding.
- Overlooking non-random sampling — Convenience or stratified sampling can introduce bias. The theorem assumes truly random selection. If your sampling procedure favours certain subgroups, your sample mean may not converge to the true population mean, invalidating your inferences.