IB Maths AA SL Topic 4 β€” Statistics Toolkit Paper 1 & 2 ~10 min read

Interpreting Data

Knowing how to find the mean, median, and SD is one thing β€” knowing which one to use and how to write a comparison answer is what wins exam marks. This note pulls the whole Statistics Toolkit together into a clear decision-making guide.

πŸ“˜ What you need to know

Which statistic should I use?

The toolkit has three “centres” (mean, median, mode) and three “spreads” (range, IQR, standard deviation). Picking between them is mostly about whether your data is clean (symmetric) or messy (outliers, skewed).

Measure of central tendency β€” which one?

Use this…When…Why
MeanData is roughly symmetric, no outliersUses every value β€” most “complete” answer
MedianData has outliers or is skewedOutliers don’t drag it off course
ModeData is qualitative (words/categories)It’s the only “average” that works for words

Measure of dispersion β€” which one?

Use this…When…Why
RangeQuick rough checkEasy, but ruined by a single outlier
IQRData has outliers or is skewedIgnores the top and bottom 25%
Standard deviationData is roughly symmetric, no outliersUses every value β€” pairs naturally with the mean
🧠

Memory trick: “Mean & SD travel together. Median & IQR travel together.”

Both pairs use a measure of the centre + a measure of the spread around that centre. Pick one pair based on whether the data is clean (mean & SD) or messy (median & IQR). Don’t mix and match β€” quote the matching pair.

If a question doesn’t tell you whether the data has outliers, look for clues β€” the word “skewed”, an outlier mentioned in a previous part, or a box plot with a long whisker. When in doubt, the median & IQR is the safer choice.

Is smaller or bigger better?

Once you’ve picked the right measure, ask: in the context of the question, do you want a bigger or smaller value?

What you’re measuringSmaller is betterBigger is better
Time to finish a taskβœ“
Test scoresβœ“
Sales / revenueβœ“
Errors / faultsβœ“
Wait timeβœ“
Customer satisfactionβœ“

And what about the spread?

For spread, smaller is usually better β€” it means the data is more consistent, more reliable, more predictable. A small SD or IQR shows the values cluster tightly around the centre. A large spread shows wild variation.

πŸ“

The two questions to ask

(1) Is a bigger or smaller centre preferred here? (2) Is a bigger or smaller spread preferred? Answer both β€” and your comparison will hit the marks every time.

Which graph should I use?

Different graph types suit different data β€” pick the one that shows what the question is actually asking about.

πŸ“Š Box plot

Use for: ungrouped data when comparing two data sets quickly, or showing the 5-number summary.
Best at: showing range, IQR, median, and outliers in one image.

πŸ“ˆ Cumulative frequency graph

Use for: grouped continuous data when you need to find the median, quartiles, or percentiles.
Best at: showing running totals, finding “how many below x” type answers.

πŸ“‰ Histogram

Use for: grouped continuous data with equal class widths, when you want to see the shape of the distribution.
Best at: showing modal class, symmetry, skewness.

πŸ”΅ Scatter diagram

Use for: ungrouped data with two variables (bivariate data), when looking for a relationship.
Best at: showing correlation between two variables.

How to write a comparison answer

“Compare the two data sets” is one of the most common Paper 2 commands. The pattern for full marks is always the same.

The 3-step comparison structure

1
Compare centre

Quote a measure of central tendency for both groups. Say which one is bigger and by how much. Then link it to context: “On average, group A’s wait time is shorter than group B’s…”

2
Compare spread

Quote a measure of dispersion for both groups. Say which has more variation. Link to context: “Group A’s IQR is smaller, so wait times at A are more consistent.”

3
Connect to context

Always end with what this means in real-world terms. Don’t just say “the median is bigger” β€” say “shoppers at store A spent more on average than at store B”.

What full-mark vs no-mark answers look like

βœ— No marks

“The median of A is bigger than B. The IQR of B is smaller.”

βœ“ Full marks

“A’s median wait time (24 min) is greater than B’s (20 min), so on average people waited longer at A. B’s IQR (13 min) is smaller than A’s (15 min), so wait times at B were more consistent.”

The bad answer just lists numbers. The good answer (1) names the actual statistic with values, (2) compares them, (3) links to context (wait times, people, on average). Same maths β€” totally different mark.
The comparison sentence pattern
“[Group A’s] [statistic] ([value]) is [bigger/smaller] than [Group B’s] ([value]),
so [context-linked conclusion].”

Beyond the maths β€” context matters

Sometimes a “compare the data sets” question wants a bit more than just numbers. Look out for opportunities to mention:

πŸ“

Don’t over-claim

If a data set is small, has outliers, or comes from a biased sample, your conclusion should reflect that. Saying “this proves shop A is better” from a sample of 10 customers is too strong β€” say “based on this sample, shop A appeared to perform better, but a larger sample would be needed to confirm”.

Worked examples

WE 1

Choose the right measure of centre and spread

For each scenario, state which measure of central tendency and which measure of dispersion would be most appropriate.

(a) The salaries at a small company, where the owner earns much more than everyone else.

(b) The heights of 50 randomly chosen 16-year-olds (no outliers).

(c) The favourite ice-cream flavour of 20 students.

Identify whether each set has outliers, is symmetric, or qualitative.part (a) Owner’s salary is an outlier β€” drags the mean. Median & IQRpart (b) Random sample, no outliers β€” symmetric. Mean & standard deviationpart (c) Flavours = words = qualitative data. Mode   (no spread measure for qualitative data) match the centre and spread to the data type β€” outliers? skewed? words?
WE 2

Compare two distributions in context

The box plots below show waiting times in minutes at HealthHut and FitFirst surgeries. Compare the two distributions in context.

0 10 20 30 40 50 60 Waiting time (minutes) HealthHut FitFirst
Apply the 3-step comparison structure: centre β†’ spread β†’ context.step 1 β€” centre Read medians off box plots: HealthHut median = 20 min, FitFirst median = 24 min HealthHut’s median (20) is smaller than FitFirst’s (24). Patients are seen quicker at HealthHut on averagestep 2 β€” spread IQR for each: HealthHut IQR = 30 βˆ’ 15 = 15 min FitFirst IQR = 31 βˆ’ 18 = 13 min FitFirst’s IQR is smaller. Wait times are more consistent at FitFirststep 3 β€” context Wrap it together: “On average HealthHut sees patients faster (lower median), but FitFirst is more predictable (smaller IQR).” HealthHut quicker, FitFirst more consistent always use BOTH centre AND spread, BOTH linked to context
WE 3

Choose the most appropriate graph

For each scenario, state the most appropriate graph type.

(a) Showing whether height and weight of 50 students are related.

(b) Showing the shape of the distribution of 200 reaction times (grouped).

(c) Comparing test scores in two different classes (raw data).

(d) Estimating the median height from grouped data.

Match the question to the chart that’s best at showing it.part (a) Two variables (height & weight) β€” looking for a relationship. Scatter diagrampart (b) Grouped continuous data β€” wanting to see distribution shape. Histogrampart (c) Two raw data sets β€” comparing centre and spread. Box plots (drawn on the same axis)part (d) Need to estimate median from grouped data. Cumulative frequency graph match the chart to the question β€” there’s usually one obviously right answer
WE 4

Justify your choice when outliers exist

The number of books read by 8 students last summer: 3, 4, 4, 5, 6, 7, 8, 50.

(a) Calculate the mean and median.   (b) Which is more representative of a typical student? Justify your answer.

50 is clearly an outlier. We need to see how each measure responds.part (a) Mean: (3+4+4+5+6+7+8+50) Γ· 8 = 87/8 β‰ˆ 10.9 Sort: 3, 4, 4, 5, 6, 7, 8, 50 β†’ middle two: 5, 6 Median: (5+6)/2 = 5.5 Mean β‰ˆ 10.9,   Median = 5.5part (b) Most students read 3–8 books β€” the value 50 is extreme. Mean of 10.9 is misleading: nobody read 10.9 books, and 7 of 8 read fewer. Median of 5.5 sits in the middle of the typical range. Median is more representative β€” outlier doesn’t drag it when an outlier exists, the mean lies β€” the median tells the truth
WE 5

Write a full comparison with sample-size caveat

Two restaurants are reviewed. Restaurant A’s last 5 ratings (out of 10): 9, 9, 10, 9, 10. Restaurant B’s last 50 ratings: mean = 7.8, SD = 1.2.

Compare the two and comment on the reliability of the comparison.

Both samples roughly symmetric β€” use mean & SD. But sample sizes differ wildly!step 1 β€” centre A mean: (9+9+10+9+10) Γ· 5 = 47/5 = 9.4 B mean: 7.8 A’s mean (9.4) is bigger than B’s (7.8) β€” A scored higher on average.step 2 β€” spread A’s SD (using GDC): Οƒ β‰ˆ 0.49 B’s SD: 1.2 A’s SD is smaller β†’ A’s ratings are more consistent.step 3 β€” context & reliability In context: A averages higher and is more consistent than B. BUT β€” A’s sample is only 5 reviews, B has 50. A’s small sample may not be representative. A appears better, but A’s sample size is too small for a reliable comparison always check sample size before drawing big conclusions β€” top mark answer!

πŸ’‘ Top tips

⚠ Common mistakes

πŸŽ‰ You’ve finished the Statistics Toolkit! You can now collect data, summarise it with the right measures, choose the right graph, and write a marker-friendly comparison. Next up in Topic 4 is regression and correlation β€” analysing the relationship between two variables.

Need help with Interpreting Data?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session β†’