For data that follows a normal distribution (the bell curve), three numbers cover almost everything you need to know about probability. The 68-95-99.7 rule — also called the empirical rule — connects standard deviation directly to the percentage of data near the mean.
The rule
For any normal distribution with mean μ and standard deviation σ:
- About 68% of data falls within 1 SD of the mean (μ ± σ)
- About 95% of data falls within 2 SDs (μ ± 2σ)
- About 99.7% of data falls within 3 SDs (μ ± 3σ)
That's it. Memorize this and you can answer probability questions about normal data with mental math.
Worked example: SAT scores
SAT total scores have approximately mean 1050, SD 200.
- 68% of test-takers score between 850 and 1250 (1050 ± 200).
- 95% score between 650 and 1450 (1050 ± 400).
- 99.7% score between 450 and 1650 (1050 ± 600).
A 1450 is 2 SDs above the mean — top 2.5% nationally (since 95% are between ±2 SD, the upper tail beyond 2 SD is half of the remaining 5% = 2.5%).
A 1650 is 3 SDs above — top 0.15% (about 1 in 670 test-takers).
Worked example: adult heights
U.S. adult male heights ≈ mean 70 in, SD 3 in.
- 68% of men are between 67" and 73" (5'7" to 6'1").
- 95% are between 64" and 76" (5'4" to 6'4").
- 99.7% are between 61" and 79" (5'1" to 6'7").
So a 7-foot man (84 in) is more than 4 SDs above the mean — fewer than 1 in 30,000 men. Now you know why NBA players stand out.
Why these specific percentages?
The percentages come from integrating the normal distribution's bell-shaped probability density function. The integral from μ−σ to μ+σ works out to ~0.6827; from μ−2σ to μ+2σ to ~0.9545; from μ−3σ to μ+3σ to ~0.9973.
The "68-95-99.7" labels are approximations rounded for memorability. The exact values are 68.27%, 95.45%, 99.73%. Close enough for almost any practical use.
Z-scores: how many SDs from the mean
A z-score measures how many SDs a value is from the mean:
z = (x − μ) / σ
If your test score is 1450 and the mean is 1050 with SD 200, your z-score is (1450 − 1050) / 200 = 2.0. You're exactly 2 SDs above the mean.
Z-scores let you compare values across different distributions. A 90th-percentile score on the SAT (z ≈ 1.28) is statistically equivalent to a 90th-percentile score on the ACT (also z ≈ 1.28), even though the raw numbers are very different.
Tail probabilities
For more precise probabilities beyond 3 SDs, use a z-table or any statistics calculator. Some useful values:
- z = 1.28: 90th percentile (top 10%)
- z = 1.645: 95th percentile (top 5%)
- z = 1.96: 97.5th percentile (top 2.5%) — used for 95% confidence intervals
- z = 2.33: 99th percentile (top 1%)
- z = 2.575: top 0.5% — used for 99% confidence intervals
What "normal" actually means
A normal distribution is the bell-shaped curve that arises from many independent factors adding together. Heights, test scores, blood pressure, machine measurement errors — all tend toward normal because each is the sum of many small effects.
This is the Central Limit Theorem: averages of large samples are approximately normal regardless of the underlying distribution. Hence the rule's wide applicability.
Where the rule fails
Real-world data isn't always normal. Watch out for:
- Income: heavily right-skewed. The mean is much higher than the median because billionaires pull the tail. The 68-95-99.7 rule does not apply.
- House prices: right-skewed and bimodal in many markets.
- Stock returns: approximately normal in the middle but with fat tails — extreme moves happen more often than the rule predicts.
- Binary outcomes: coin flips, yes/no surveys — these are binomial, not normal (though they approach normal in large samples).
Always check whether your data is approximately normal before applying the rule. A histogram and quick bell-shape sanity check is enough.
Quality control: the "six sigma" idea
Manufacturing aims for processes that are at least 6 SDs from any defect threshold. At 6 SDs, the failure rate is about 3.4 per million — ultra-low. The "six sigma" methodology is named after this: drive variation low enough that 6 SDs fits comfortably between mean and any defect line.
Most processes operate around 3-sigma (one in 740 errors), which sounds good but means a major-airline operation pumping out millions of flights per year would have thousands of errors. Six-sigma is the aspiration.
Confidence intervals
The 68-95-99.7 rule underlies confidence intervals. A "95% confidence interval" is mean ± 1.96 × (standard error). The 1.96 comes from the rule — it's the z-score that captures 95% of the area.
Compute SDs from your data
Our standard deviation calculator computes the SD of any list of numbers. Pair that with the rule above to estimate where most of your data lies, and to spot potential outliers (anything beyond 2–3 SDs is statistically unusual under a normal model).