Understanding the Poisson Distribution
The Poisson distribution belongs to the family of discrete probability distributions. It answers: given a known average event frequency, what is the probability of observing exactly k events in the next interval?
Consider a call centre receiving an average of 8 calls per hour. What is the probability of exactly 5 calls in the next hour? Or at least 10? The Poisson distribution handles both scenarios elegantly.
This distribution applies when:
- Events are independent (one occurrence does not influence another)
- The average rate remains constant across intervals
- Two events cannot occur at exactly the same instant
Classic examples include website traffic per minute, insurance claims per month, typos per manuscript page, and equipment breakdowns per year.
The Poisson Formula
The probability of exactly x events occurring when the average rate is λ follows this formula:
P(X = x) = (e−λ × λx) ÷ x!
P(X < x) = Σ P(X = k) for k = 0 to x−1
P(X ≤ x) = Σ P(X = k) for k = 0 to x
P(X > x) = 1 − P(X ≤ x)
P(X ≥ x) = 1 − P(X < x)
λ (lambda)— The average number of events per interval; the rate parameterx— The exact number of occurrences you want to calculate probability fore— Euler's number, approximately 2.71828x!— Factorial of x (e.g., 5! = 5 × 4 × 3 × 2 × 1 = 120)
Key Properties and Characteristics
The Poisson distribution has elegant mathematical properties that make it powerful for modelling rare events:
- Mean: Equal to λ. If you average many observations, you get λ.
- Variance: Also equal to λ. This unique property (mean = variance) distinguishes Poisson from other distributions.
- Standard deviation: The square root of λ, or √λ.
As λ increases, the distribution becomes more symmetric and bell-shaped, approaching the normal distribution. For λ = 1, the distribution is heavily skewed right; for λ ≥ 10, it approximates normality.
The Poisson distribution is the limiting case of the binomial distribution when the number of trials is very large and the success probability is very small, but their product (n × p) remains fixed.
Common Pitfalls and Practical Guidance
Avoid these mistakes when applying Poisson analysis:
- Non-independence assumptions — The Poisson assumes each event is independent. If bus arrivals bunch together (dependent events), or disease cases cluster geographically, the distribution breaks down. Verify independence before analysis.
- Changing rates over time — λ must be constant. Traffic intensity differs between rush hour and midnight. Internet traffic varies by season. Always define your time interval carefully and confirm the rate is stable within it.
- Integer-only outcomes — Poisson only works for whole-number counts: 0, 1, 2, 3... You cannot have 2.7 defects or 5.3 calls. If your data contains decimals, a different distribution (like gamma or exponential) is appropriate.
- Zero-event impossibility — Avoid using Poisson when zero events are impossible by definition. For example, if you are tracking 'number of insurance claims per policy holder' and every policyholder must file at least one claim, the Poisson model is unsuitable.
Practical Worked Example
Suppose a manufacturing plant experiences an average of 2.4 equipment failures per month. What is the probability of exactly 3 failures next month?
Given:
- λ = 2.4 (average failures per month)
- x = 3 (desired number of failures)
Calculation:
P(X = 3) = (e−2.4 × 2.43) ÷ 3!
= (0.0907 × 13.824) ÷ 6
= 1.254 ÷ 6
≈ 0.209 or 20.9%
The calculator also instantly shows cumulative probabilities: the probability of at most 3 failures (35.8%), fewer than 3 failures (14.9%), more than 3 failures (64.2%), and at least 3 failures (85.1%). Maintenance teams can use this to plan spare-parts inventory and downtime budgets.