IB Maths AA HL Topic 4 — Statistics & Probability Paper 1 & 2 ~7 min read

Measures of Dispersion

Two data sets can have the same mean but look completely different — one tightly clustered, one wildly scattered. Measures of dispersion (range, IQR, variance, standard deviation) capture that spread, each in a slightly different way.

šŸ“˜ What you need to know

Range and quartiles

Range
max āˆ’ min
full span; very sensitive to outliers
Interquartile range
IQR = Q3 āˆ’ Q1
middle 50%; ignores extremes

To find quartiles by hand: sort the data, find the median, then find the median of each half (excluding the median itself for an odd-sized data set). On a GDC, just use 1-Var Stats.

Variance and standard deviation

Variance (definition) σ²  =  Ī£ fi(xi āˆ’ μ)²n
Variance (computational form) σ²  =  Ī£ fixi²n  āˆ’  μ²
Standard deviation σ  =  √(σ²)
You don’t need to memorise these formulas — the GDC’s 1-Var Stats reports σ directly. The formulas help you understand what the GDC is doing under the hood.

Which measure to use

MeasureUsesAffected by outliers?
Rangequick rough idea of spreadyes — strongly
IQRmiddle 50%; pairs with medianno — resistant
Standard deviationspread about the mean (uses every value)yes
Variancesame as SD, but squared units (less interpretable directly)yes

🧭 Recipe — find dispersion measures

  1. Sort the data in ascending order.
  2. Range = last value āˆ’ first value.
  3. Quartiles: split sorted data at the median; Q1 = median of lower half, Q3 = median of upper half. (For odd n, exclude the median itself from each half.)
  4. IQR = Q3 āˆ’ Q1.
  5. SD and variance: use 1-Var Stats on your GDC. To check by hand: σ² = Ī£(xi āˆ’ μ)²/n; σ = √σ².

Worked examples

WE 1

Range and IQR — odd-sized data

Find the range and the interquartile range of the data set:   14, 5, 9, 22, 12, 16, 3, 18, 10, 20, 7.

Step 1: Sort (n = 11) 3, 5, 7, 9, 10, 12, 14, 16, 18, 20, 22 Step 2: Range = max āˆ’ min 22 āˆ’ 3 = 19 Step 3: Median = 6th value = 12 Step 4: Quartiles (exclude median for n odd) Lower half: 3, 5, 7, 9, 10 → Q₁ = 7 (3rd value) Upper half: 14, 16, 18, 20, 22 → Qā‚ƒ = 18 (3rd value) Step 5: IQR = Qā‚ƒ āˆ’ Q₁ 18 āˆ’ 7 = 11 Range = 19; IQR = 11 middle 50% spans only 11 units, while full range is 19 — most data is centrally clustered
WE 2

Range and IQR — even-sized data

Find the range, median, and interquartile range of the data set:   10, 18, 25, 14, 30, 12, 22, 28, 16, 20.

Step 1: Sort (n = 10) 10, 12, 14, 16, 18, 20, 22, 25, 28, 30 Step 2: Range 30 āˆ’ 10 = 20 Step 3: Median (n even → avg of 5th and 6th) Median = (18 + 20)/2 = 19 Step 4: Quartiles Lower half: 10, 12, 14, 16, 18 → Q₁ = 14 (3rd value) Upper half: 20, 22, 25, 28, 30 → Qā‚ƒ = 25 (3rd value) Step 5: IQR 25 āˆ’ 14 = 11 Range = 20; Median = 19; IQR = 11 for even n, lower half is the first 5 values, upper half is the last 5
WE 3

Variance and standard deviation by hand

Find the variance and standard deviation of the data set:   7, 9, 10, 12, 13, 15. Give your standard deviation to 3 s.f.

Step 1: Mean μ = (7 + 9 + 10 + 12 + 13 + 15)/6 = 66/6 = 11 Step 2: Deviations from the mean āˆ’4, āˆ’2, āˆ’1, 1, 2, 4 Step 3: Square and sum 16 + 4 + 1 + 1 + 4 + 16 = 42 Step 4: Variance = sum/n σ² = 42/6 = 7 Step 5: SD = √(variance) σ = √7 ā‰ˆ 2.6458 Variance = 7; SD ā‰ˆ 2.65 deviations are symmetric around the mean — a sign of clean by-hand data
WE 4

Variance and SD using the computational formula

Use σ² = (Ī£x²)/n āˆ’ μ² to find the variance and standard deviation of:   12, 15, 18, 20, 22, 25, 28, 30. Give your answers to 3 s.f.

Step 1: Mean Ī£x = 12+15+18+20+22+25+28+30 = 170 μ = 170/8 = 21.25 Step 2: Ī£x² 144 + 225 + 324 + 400 + 484 + 625 + 784 + 900 = 3886 Step 3: Apply formula σ² = 3886/8 āˆ’ 21.25² = 485.75 āˆ’ 451.5625 = 34.1875 Step 4: SD σ = √34.1875 ā‰ˆ 5.8470 Variance ā‰ˆ 34.2; SD ā‰ˆ 5.85 computational form is faster when the mean isn’t a whole number
WE 5

Find missing values from mean and range

Five numbers in ascending order are 4, 9, x, 14, y. Their mean is 11 and range is 14. Find x and y.

Step 1: Use the range to find y range = y āˆ’ 4 = 14 → y = 18 Step 2: Use the mean to find x (4 + 9 + x + 14 + 18)/5 = 11 45 + x = 55 → x = 10 Step 3: Verify ordering 4 < 9 < 10 < 14 < 18 āœ“ x = 10, y = 18 range gives the largest value; mean closes the system
WE 6

Compare two data sets with the same mean

The pulse rates of two groups of 7 athletes (in beats per minute) are recorded:
Group A: 64, 68, 70, 72, 74, 76, 80    Group B: 60, 64, 70, 72, 74, 80, 84.
(a) Find the mean of each group. (b) Find the IQR of each group. (c) Find the standard deviation of each group. (d) Comment on which group is more consistent.

(a) Means A: 504/7 = 72;   B: 504/7 = 72 (identical) (b) IQR — both have median 72 A: lower half 64, 68, 70 → Q₁ = 68; upper 74, 76, 80 → Qā‚ƒ = 76; IQR = 8 B: lower 60, 64, 70 → Q₁ = 64; upper 74, 80, 84 → Qā‚ƒ = 80; IQR = 16 (c) SD using GDC A: σ ā‰ˆ 4.90 (variance 24) B: σ ā‰ˆ 7.78 (variance ā‰ˆ 60.6) (d) Same mean, but A has smaller IQR and SD Group A is more consistent — values cluster more tightly around the mean moral: two groups can have identical averages but very different consistency

šŸ’” Top tips

⚠ Common mistakes

Next: Frequency Tables. When data has lots of repeats, frequency tables compress everything into a compact form. The mean, median, mode, and SD all generalise — but you weight each value by its frequency. For grouped data, you use mid-interval values to estimate the mean.

Need help with Statistics & Probability?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session →