IB Maths AI HL Statistics Toolkit Paper 1 & 2 ~8 min read

Measures of Dispersion

A measure of central tendency only tells you where the centre of the data is β€” it doesn’t say whether the values are tightly clustered around the centre or spread out across a wide range. Two classes with the same mean of 60% can have wildly different distributions: in one class everyone is between 55 and 65, while in the other scores are between 30 and 90. That’s where measures of dispersion come in. You’ll meet four of them here: the range, interquartile range (IQR), variance, and standard deviation. Each one captures spread differently β€” and choosing the right one matters when the data has outliers.

πŸ“˜ What you need to know

Range and quartiles

The simplest measure of spread is the range β€” just the gap between the largest and smallest values. Quick to compute, but a single outlier can blow it up.

Range range = maximum value βˆ’ minimum value

To get a more robust spread, ignore the extreme 25% at each end. The middle 50% sits between the lower quartile Q1 and the upper quartile Q3, and the gap between them is the interquartile range.

Interquartile range IQR = Q3 βˆ’ Q1 in the formula booklet βœ“
quartiles divide an ordered dataset into four equal parts
Quartiles, range, and IQR on a number line 25% 25% 25% 25% min Q₁ Qβ‚‚ (median) Q₃ max IQR = Q₃ βˆ’ Q₁ (middle 50%) range = max βˆ’ min (all data)
Each quartile section contains 25% of the data. The IQR captures only the middle 50%, ignoring the outer extremes β€” that’s what makes it robust to outliers.

Variance and standard deviation

The range and IQR are quick spread measures, but they don’t use every value in the data. Standard deviation does β€” it measures the average distance from the mean. Variance is just its square (so the units are squared; less intuitive but useful in further calculations).

Variance β€” definition form Οƒ2 = Ξ£ fi(xi βˆ’ ΞΌ)2n not in the formula booklet
Variance β€” computational form (easier by hand) Οƒ2 = Ξ£ fixi2n βˆ’ ΞΌ2 “mean of the squares minus the square of the mean”
Standard deviation Οƒ = βˆšΟƒ2

You’re not expected to memorise these formulas β€” the IB expects you to use your GDC. But the computational form (mean of squares minus square of the mean) is far easier than the definition form when working by hand, so it’s worth knowing.

πŸ€” Why are variance units squared but SD’s aren’t?

The variance formula uses squared deviations: (xi βˆ’ ΞΌ)2. Squaring removes negative signs (so deviations don’t cancel out) and emphasises larger deviations. But it also squares the units. If your data is in cm, the variance is in cm2. Taking the square root undoes this and brings the spread measure back to cm β€” that’s why standard deviation is the one you report in real-world contexts.

🧭 Recipe β€” variance and standard deviation by hand

  1. Compute the mean ΞΌ = Ξ£x/n.
  2. Compute Ξ£x2 = sum of the squares of every value.
  3. Variance: Οƒ2 = Ξ£x2/n βˆ’ ΞΌ2.
  4. Standard deviation: take the square root of the variance.
  5. Always verify with GDC β€” type the data into statistics mode and confirm.

Range vs IQR vs SD β€” when to use which

MeasureUses all data?Affected by outliers?Best for
RangeNo (just two values)Yes β€” heavilyQuick rough check
IQRNo (middle 50%)No β€” robustSkewed data with outliers
VarianceYesYesFurther calculations
Standard deviationYesYesSymmetric data, no outliers

🧠 Memory aid β€” pairing centre with spread

Statisticians always pair a centre measure with a spread measure of the same “family”. Median goes with IQR β€” both ignore extremes. Mean goes with standard deviation β€” both use every value. Mixing them (e.g. quoting mean with IQR) is technically OK but unusual.

Worked examples

WE 1

Range and IQR

Find the range and interquartile range for the data set below.

42    28    67    51    64    42

Step 1: sort the data 28, 42, 42, 51, 64, 67 Step 2: range = max βˆ’ min = 67 βˆ’ 28 = 39 range = 39 Step 3: quartiles using GDC (or by hand) lower half {28, 42, 42} β†’ Q₁ = 42 upper half {51, 64, 67} β†’ Q₃ = 64 Step 4: IQR = Q₃ βˆ’ Q₁ = 64 βˆ’ 42 = 22 IQR = 22 IB exams expect GDC values for quartiles β€” different by-hand methods give slightly different answers.
WE 2

Variance and standard deviation by hand

Using the same data set as WE 1, find the variance and standard deviation. Give your answers to 3 significant figures where appropriate.

42    28    67    51    64    42

Step 1: mean (from WE 1 of previous topic) ΞΌ = 294/6 = 49 Step 2: Ξ£xΒ² = sum of squares 42Β² + 28Β² + 67Β² + 51Β² + 64Β² + 42Β² = 1764 + 784 + 4489 + 2601 + 4096 + 1764 = 15498 Step 3: variance σ² = Ξ£xΒ²/n βˆ’ ΞΌΒ² = 15498/6 βˆ’ 49Β² = 2583 βˆ’ 2401 = 182 variance σ² = 182 Step 4: SD = √variance Οƒ = √182 β‰ˆ 13.491 SD Οƒ β‰ˆ 13.5 (3 sf) GDC confirms: Οƒβ‚“Β² = 182, Οƒβ‚“ β‰ˆ 13.491. Always use the GDC in the actual exam.
WE 3

All four measures together

The midday temperatures (Β°C) on 8 consecutive days at a weather station are:

15,   18,   22,   19,   25,   20,   17,   24

Find (a) the range, (b) the IQR, (c) the variance, and (d) the standard deviation. Give answers to 3 sf.

sort the data 15, 17, 18, 19, 20, 22, 24, 25 (a) range = 25 βˆ’ 15 range = 10 Β°C (b) quartiles (n = 8, even) lower half {15,17,18,19} β†’ Q₁ = (17+18)/2 = 17.5 upper half {20,22,24,25} β†’ Q₃ = (22+24)/2 = 23 IQR = 23 βˆ’ 17.5 IQR = 5.5 Β°C (c) mean and Ξ£xΒ² ΞΌ = 160/8 = 20 Ξ£xΒ² = 225+324+484+361+625+400+289+576 = 3284 σ² = 3284/8 βˆ’ 20Β² = 410.5 βˆ’ 400 variance σ² = 10.5 Β°CΒ² (d) SD Οƒ = √10.5 β‰ˆ 3.24 SD Οƒ β‰ˆ 3.24 Β°C variance units = Β°CΒ²; SD units = Β°C β€” note the difference.
WE 4

Compare two datasets

Two basketball teams record their scores (points) in 5 games each:

Game12345
Team A4045505560
Team B4849505152

Compare the spread of scores using both range and standard deviation. Which team is more consistent?

Team A mean = 250/5 = 50 range = 60 βˆ’ 40 = 20 Ξ£xΒ² = 1600+2025+2500+3025+3600 = 12750 σ² = 12750/5 βˆ’ 50Β² = 2550 βˆ’ 2500 = 50 Οƒ = √50 β‰ˆ 7.07 Team B mean = 250/5 = 50 (same!) range = 52 βˆ’ 48 = 4 Ξ£xΒ² = 2304+2401+2500+2601+2704 = 12510 σ² = 12510/5 βˆ’ 2500 = 2502 βˆ’ 2500 = 2 Οƒ = √2 β‰ˆ 1.41 compare A: range 20, Οƒ β‰ˆ 7.07 B: range 4, Οƒ β‰ˆ 1.41 Team B is much more consistent same mean, very different spread β€” exactly why a centre measure alone is never enough.
WE 5

Outlier effect β€” range vs IQR

The wait times (in minutes) at a clinic on Monday are:

12,   14,   15,   16,   18,   20,   22,   23

On Tuesday, one extra patient with a complex case had a wait of 60 minutes, making the dataset:

12,   14,   15,   16,   18,   20,   22,   23,   60

Compare the range and IQR for both days, and comment on which is more robust to the outlier.

Monday (n = 8) range = 23 βˆ’ 12 = 11 Q₁ = (14+15)/2 = 14.5; Q₃ = (20+22)/2 = 21 IQR = 21 βˆ’ 14.5 = 6.5 Tuesday (n = 9, includes 60) range = 60 βˆ’ 12 = 48 (jumped from 11!) lower half {12,14,15,16}: Q₁ = 14.5 upper half {18,20,22,23}: Q₃ = 21 wait β€” with 9 values exclude middle: Q₁ = 14.5, Q₃ = 22.5 IQR = 22.5 βˆ’ 14.5 = 8 compare range: 11 β†’ 48 (quadrupled by one outlier) IQR: 6.5 β†’ 8 (barely changed) IQR is much more robust to the outlier when data has outliers, always report the IQR rather than the range.
WE 6

Compare two players β€” same mean, different SD

Two students each throw 5 darts at a target. Their distances (in cm) from the bullseye are:

Alice:   2,   3,   4,   5,   6
Bob:   1,   3,   4,   5,   7

(a) Show that both have the same mean distance.
(b) Calculate the standard deviation for each.
(c) State which is the more consistent player.

(a) means Alice: (2+3+4+5+6)/5 = 20/5 = 4 Bob: (1+3+4+5+7)/5 = 20/5 = 4 βœ“ (b) Alice’s SD Ξ£xΒ² = 4+9+16+25+36 = 90 σ² = 90/5 βˆ’ 4Β² = 18 βˆ’ 16 = 2 Οƒ = √2 β‰ˆ 1.41 cm Bob’s SD Ξ£xΒ² = 1+9+16+25+49 = 100 σ² = 100/5 βˆ’ 16 = 20 βˆ’ 16 = 4 Οƒ = √4 = 2 cm Alice Οƒ β‰ˆ 1.41, Bob Οƒ = 2 (c) lower SD = more consistent (c) Alice is more consistent same mean, different spread β€” SD reveals what the mean hides.

πŸ’‘ Top tips

⚠ Common mistakes

Next up β€” Frequency Tables. Real datasets are rarely small enough to write out one by one. Frequency tables let you compress repeated values into compact summaries β€” and you’ll learn how to recover the mean, median, mode, range, IQR, variance, and standard deviation directly from a table, both for ungrouped and grouped data.

Need help with Statistics?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session β†’