What Is Median Absolute Deviation?
Median absolute deviation measures the typical distance between each data point and the median of the dataset. It belongs to a family of robust statistics—tools designed to resist the influence of extreme values.
In contrast, standard deviation treats all deviations equally and can be pulled heavily by outliers. If you're working with real-world measurements that contain occasional errors or unusual observations, MAD offers a more trustworthy picture of variation.
The term "absolute" is crucial: it means we ignore whether a point lies above or below the median. We care only about how far away it is. The "median" step at the end ensures that even the deviations themselves are analysed robustly, creating a double shield against outliers.
Median Absolute Deviation Formula
To find MAD, you work through five straightforward steps. Begin with your raw dataset, sort it, and find the middle value (the median). Then subtract that median from every point, take absolute values to eliminate negative signs, sort the resulting deviations, and finally extract the median of those deviations.
MAD = median(|Xi − m|)
where:
m = median of the original dataset
Xi = each individual data point
MAD— The median absolute deviation—the final robust measure of spreadm— The median (middle value) of your datasetX<sub>i</sub>— Each individual observation in your dataset
Step-by-Step Worked Example
Suppose you recorded sprint times (in seconds) for six runners: 12, 16, 12, 11, 14, 15.
Step 1: Sort the data. Ordered times: [11, 12, 12, 14, 15, 16].
Step 2: Find the median. With six values, the median is the average of the 3rd and 4th: m = (12 + 14) ÷ 2 = 13 seconds.
Step 3: Subtract the median from each value.11 − 13 = −212 − 13 = −112 − 13 = −114 − 13 = 115 − 13 = 216 − 13 = 3
Step 4: Take absolute values. [2, 1, 1, 1, 2, 3].
Step 5: Find the median of the deviations. Sorted: [1, 1, 1, 2, 2, 3]. The median of these six values is (1 + 2) ÷ 2 = 1.5 seconds.
Common Pitfalls and Practical Considerations
When computing or interpreting median absolute deviation, watch for these frequent mistakes:
- Forgetting to sort before finding the median — The median is meaningless without ordering. Always arrange your dataset in ascending or descending order first, then locate the centre value (or average of the two central values if your dataset has an even count).
- Neglecting the absolute value step — If you skip taking absolute values of the deviations, negative and positive distances will partially cancel out, giving a misleading result. Always convert every deviation to its positive magnitude before proceeding.
- Confusing MAD with mean absolute deviation — Mean absolute deviation (average of absolute deviations) is different from median absolute deviation (median of absolute deviations). Both are robust, but MAD is more resistant to outliers because it uses the median rather than the mean in the final step.
- Over-relying on a single statistic — MAD describes spread but says nothing about location, skewness, or the shape of your distribution. Pair it with complementary measures like the median itself, quartiles, or a histogram for a complete picture.
When to Use Median Absolute Deviation
Choose MAD when your data are non-normally distributed, contain outliers, or come from a process prone to occasional extreme events. Environmental monitoring, financial data analysis, and medical measurements often fit this profile.
For example, if you measure pollutant levels in a river and one storm causes a spike, MAD will remain stable while standard deviation balloons. Similarly, in clinical trials, if one patient has an unusual response, MAD isolates the typical variation experienced by the rest.
You can report MAD as is, or multiply it by 1.4826 (a scaling constant) to make it comparable to standard deviation in normally distributed data. This adjustment allows seamless comparison with legacy analyses or industry benchmarks.