IB Maths AI SLTopic 4 — Statistics ToolkitPaper 1 & 2Mean, median, mode~7 min read
Measures of Central Tendency
Three “averages” describe where the centre of a data set sits: the mode (most common), the median (middle), and the mean (sum ÷ count). Each behaves differently — especially when there is an outlier — so always think about which one best represents the data.
📘 What you need to know
Mode = value that occurs most often. A data set can have one mode, several modes, or no mode.
Median = middle value when the data is in order. For an even count, the median is the midpoint of the two middle values.
Meanx̄ = sum of valuescount of values. Symbol x̄ (“x-bar”) or μ.
Outliers affect the mean strongly but barely move the median or mode.
Same units as the data: if heights are in cm, all three averages are in cm.
GDC computes all three from a list. Always know the by-hand method too — Paper 1 has no calculator.
The three averages
Mode — just look for the most common value. Useful for qualitative data (“most popular flavour”) where mean and median don’t apply.
Median — sort the data, then pick the middle.
• Odd count n: median is the value in position n + 12.
• Even count: median is the mean of the two middle values, in positions n2 and n2 + 1.
Mean — sum every value, divide by how many. Uses every piece of data, which is why a single huge outlier can pull it sideways.
For the data 2, 3, 4, 4, 5, 6, 7, 9, 14 the three averages all land on different values. Mode (red) sits at the tallest stack, median (orange) at the middle value, mean (teal) is the balance point — pulled right by the outlier 14.
The mean formulax̄ = 1nΣxi = sum of all valuesn
🧠Recipe — find mode, median, mean by hand
Mode: scan the list; pick whichever value appears most. If two appear the same maximum number of times, list both (bimodal).
Order the data from smallest to largest. Essential before finding the median.
Median: if n is odd, the middle entry; if even, the average of the two middle entries.
Mean: add every value, then divide by n.
Sanity check: all three averages should sit between the smallest and largest values.
Spotting outliers: a single value far from the rest can drag the mean noticeably while leaving the median untouched. When a question features an obvious extreme value, expect to discuss this.
Worked examples
WE 1
Mode, median, mean from a small set
Eight students took a test (max 20). Their marks were:
12, 18, 14, 20, 18, 16, 18, 14
Find the mode, median, and mean.
Step 1 — order12, 14, 14, 16, 18, 18, 18, 20Mode = most common value18 appears 3 times → mode = 18Median: n = 8 (even) → mean of 4th & 5th(16 + 18) / 2 = 17Mean: sum / nsum = 130, n = 8x̄ = 130 / 8 = 16.25mode = 18 · median = 17 · mean = 16.25all three sit between 12 (min) and 20 (max), so the sanity check passes.
Order9, 9, 12, 12, 15, 18, 20Mode — check most frequent9 appears 2 times12 appears 2 times→ two modes (bimodal): 9 and 12Median: n = 7 (odd) → 4th valuemedian = 12MeanxÌ„ = 95 / 7 ≈ 13.6modes 9 & 12 · median 12 · mean ≈ 13.6if two (or more) values tie for “most common”, quote them all. Saying “no mode” or picking one only is wrong.
WE 3
Median of an even-count data set
The monthly rainfall (mm) of a town for 8 months was:
12, 15, 11, 14, 18, 13, 16, 11
Find the median and mean.
Order11, 11, 12, 13, 14, 15, 16, 18Median: n = 8 (even) → mean of 4th & 5th(13 + 14) / 2 = 13.5Meansum = 110, n = 8xÌ„ = 110 / 8 = 13.75median = 13.5 mm · mean = 13.75 mmfor even n, the median lies between two data values — it doesn’t have to be a value that actually appears in the data. Keep the same units.
WE 4
Finding a missing value given the mean
The mean of six numbers is 14. Five of the numbers are:
10, 18, 12, 9, 20
Find the sixth number.
Use: mean × n = total sumtotal = 14 × 6 = 84Sum of the five known values10 + 18 + 12 + 9 + 20 = 69Sixth value = total − known sum84 − 69 = 15sixth number = 15“reverse mean” problems use mean × n = total. This trick comes up a lot — memorise it.
WE 5
Outliers — choose the right average
A small bakery has seven employees. Their weekly hours worked are:
20, 22, 24, 21, 23, 22, 80
The 80 is the owner. (a) Find the mean and median. (b) Which better represents a “typical employee”?
(a) Order20, 21, 22, 22, 23, 24, 80Median: n = 7 → 4th valuemedian = 22Meansum = 212, n = 7xÌ„ = 212 / 7 ≈ 30.3mean ≈ 30.3 h · median = 22 h(b) The 80 is an outlierit pulls the mean up to 30.3 (above every staff value!)the median = 22 sits with the restmedian better represents a typical employeewhenever ONE value is far from the rest, the median is the safer “centre”. The mean tells you about the total workload, not the typical person.
WE 6
Combined mean of two groups
Class A has 20 students with a mean test mark of 14. Class B has 30 students with a mean test mark of 18. Find the mean mark of all 50 students combined.
Use: total = mean × n for each classClass A total: 20 × 14 = 280Class B total: 30 × 18 = 540Combined sum & counttotal sum = 280 + 540 = 820total count = 20 + 30 = 50Combined meanx̄ = 820 / 50 = 16.4combined mean = 16.4the answer is NOT (14 + 18) / 2 = 16 — that ignores the different group sizes. Always convert each mean back to a total first, then add and divide.
💡 Top tips
Always order before finding the median. Forgetting to sort is the #1 error.
“mean × n = total” unlocks “find the missing value” and “combined mean” problems.
Outlier present? The median is usually the fairer “typical value”; the mean tells you about the total.
Qualitative data: only the mode works (you can’t average colours).
GDC: use 1-Var Stats (TI) or “One-Variable Statistics” (Casio) — saves time on Paper 2.
âš Common mistakes
Finding the median without ordering: the “middle of the list as written” is meaningless. Sort first.
Writing “mode = 0” when no value repeats: better answer is “no mode” or “every value is a mode”.
Averaging two means directly when groups have different sizes (WE 6). Convert to totals first.
Off-by-one when locating the median: position is n + 12 for odd n — not n2.
Rounding the mean too early: keep full precision until the final step.
Next up: Measures of Dispersion — range, interquartile range, standard deviation, and variance. Knowing the centre is only half the picture; you also need to describe how spread out the data is. Same data, different spreads, very different stories.
Need help with AI SL Statistics?
Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.