What is the Lognormal Distribution?
A lognormal distribution describes a continuous random variable whose natural logarithm is normally distributed. If Y ~ N(μ, σ²), then X = exp(Y) follows a lognormal distribution. This transformation ensures X takes only positive values, making it ideal for modeling quantities that cannot be negative: component lifespans, financial asset prices, rainfall amounts, and biological measurements.
The distribution is heavily right-skewed, with a long tail extending toward larger values. This asymmetry reflects real-world processes where most observations cluster near lower values but occasional extremes occur. The parameters μ (scale) and σ (shape) control the distribution's location and spread on the logarithmic scale, not on the original scale of X.
Lognormal Distribution Formulas
The probability density function and key descriptive statistics depend on the two parameters μ and σ, which represent the mean and standard deviation of ln(X) respectively. Below are the essential formulas for working with lognormal distributions.
f(x) = 1 / (x × σ × √(2π)) × exp(−(ln(x) − μ)² / (2σ²))
Mean = exp(μ + σ² / 2)
Median = exp(μ)
Mode = exp(μ − σ²)
Variance = [exp(σ²) − 1] × exp(2μ + σ²)
Skewness = [exp(σ²) + 2] × √[exp(σ²) − 1]
x— The argument value at which to evaluate the probability density functionμ— Scale parameter; the mean of the natural logarithm of Xσ— Shape parameter; the standard deviation of the natural logarithm of Xp— Quantile level (0 to 1) for which to find the corresponding X value
Calculator Modes and Usage
This tool offers six distinct calculation modes to handle different analytical needs:
- Probability density function (PDF): Enter μ, σ, and an x-value to find f(x), the height of the distribution curve at that point.
- Cumulative distribution function (CDF): Calculate F(x), the probability that a random observation falls at or below a given value.
- Quantile: Find the x-value corresponding to a specified cumulative probability (e.g., the 95th percentile).
- Descriptive statistics: Compute mean, median, mode, variance, standard deviation, and skewness for your parameter set.
- Tail probabilities: Determine P(X ≤ x), P(X < x), P(X ≥ x), or P(X > x) for risk and reliability assessments.
Begin by selecting your desired mode, entering μ and σ, then provide any additional arguments your mode requires.
Real-World Applications
The lognormal distribution excels at modeling inherently positive, skewed phenomena across diverse fields. In engineering and reliability, it represents component failure times and equipment time-to-failure in wear-out regimes. Financial analysts use it to model stock prices, returns, and asset valuations, since prices cannot be negative and exhibit proportional growth.
Environmental and hydrological data frequently follow lognormal patterns: daily rainfall accumulation, pollutant concentrations, and river discharge rates. In biology and medicine, lognormal distributions describe bacterial colony sizes, enzyme concentrations, and survival times post-treatment. Internet and network systems show lognormal characteristics in message sizes, download times, and request intervals. Industrial quality control leverages it for particle size distributions and defect frequencies.
Common Pitfalls and Practical Tips
Avoid these frequent mistakes when working with lognormal distributions.
- Confusing μ and σ with X statistics — Remember that μ and σ are the mean and standard deviation of ln(X), not of X itself. The actual mean of X is exp(μ + σ²/2), which is always larger than exp(μ). Confusing these leads to incorrect parameter estimates and wrong probability calculations.
- Assuming mode always exists — The mode exists only when σ > 1. For σ ≤ 1, the distribution is monotonically decreasing and has no mode. Always check your shape parameter before interpreting modal behaviour.
- Ignoring the right tail in risk assessment — The lognormal distribution's heavy right tail means extreme values carry non-negligible probability. When modeling financial losses or system failures, standard deviations and means can underestimate tail risk. Use quantiles (e.g., 99th percentile) for safety-critical applications.
- Parameter fitting from small samples — Estimating μ and σ from limited data introduces substantial uncertainty. Use at least 30-50 observations, and consider confidence intervals around your parameter estimates when making consequential decisions.