IB Maths AA HL
Topic 4 β Statistics & Probability
Paper 1 & 2
~7 min read
Interpreting Data
All the tools assembled β averages, dispersion, box plots, histograms, cumulative graphs β boil down to one practical skill: which one to use, and what to say about the result. The answer depends on whether outliers are present, whether the data is symmetric, and what real-world claim you’re trying to support.
π What you need to know
- Outliers present β use median + IQR (both resistant).
- Roughly symmetric, no outliers β use mean + SD (both use every value).
- Qualitative data β use the mode; a mean or median doesn’t apply.
- Bigger / smaller is “better” depends on the context: smaller times for races, bigger marks for tests.
- Smaller spread = more consistent; bigger spread = more variable.
- Choosing a graph: bar chart (qualitative), histogram (continuous grouped), CF graph (positions and percentiles), box plot (compare two distributions), scatter (bivariate).
- Always relate findings to context β “B has a smaller IQR” is a half-answer; “B’s commute times are more predictable” is the full one.
- Consider the sampling method when commenting on reliability or generalisability.
Choosing the right statistical measure
| Situation | Average to use | Spread to use |
|---|
| Symmetric, no outliers | mean | standard deviation |
| Skewed or has outliers | median | interquartile range |
| Qualitative / categorical data | mode | range (or comparison of frequencies) |
Pair them correctly: mean β standard deviation; median β IQR. Mixing them (like “median + SD”) is technically valid but unusual.
Choosing the right graph
| Graph type | Best for⦠|
|---|
| Bar chart | qualitative or discrete data β gaps between bars |
| Histogram | continuous grouped data β shape of distribution |
| Cumulative frequency graph | continuous grouped data β medians, quartiles, percentiles, “how many β€” |
| Box plot | quick centre-and-spread comparison; ideal for two or more distributions |
| Scatter plot | bivariate data β relationship between two variables |
Framework for comparing two distributions
Step 1: Centre
compare averages
“X has a higher/lower mean (or median) than Y”
Step 2: Spread
compare dispersion
“X is more/less consistent than Y”
Then add step 3: relate to context. Translate “smaller IQR” into “more predictable journey times”, “consistent quality”, “tighter exam scores”, or whatever the scenario calls for.
π§ Recipe β interpret and compare
- Check for outliers in the data β if present, prefer median and IQR.
- Pick paired measures: mean+SD or median+IQR.
- Compare the centres: which group has the higher/lower average?
- Compare the spreads: which group is more consistent?
- State the conclusion in context β translate numbers into a real-world claim.
- Comment on reliability: was the sample large enough, randomly chosen, free of bias?
Worked examples
WE 1Choose the most representative average
The annual salaries at Company X have a mean of $58,000 and a median of $44,000. State which measure is more representative of a typical employee’s earnings, and justify your answer.
Step 1: Compare the two values
Mean ($58k) > median ($44k)
Step 2: Diagnose
A larger mean than median indicates the distribution is right-skewed
β a few high earners (e.g., directors) are pulling the mean up
Step 3: Decision
Median is resistant to outliers; mean is not
Median ($44k) is more representative of a typical employee
salary data almost always uses median for this reason
WE 2Compare two athletes using mean and SD
Two long-jumpers have the following recent results (in metres):
Athlete A: mean = 6.85, SD = 0.20 Athlete B: mean = 7.05, SD = 0.40.
Compare the two athletes in context.
Step 1: Compare means (centres)
B’s mean (7.05 m) > A’s mean (6.85 m)
β B jumps further on average
Step 2: Compare SDs (spreads)
A’s SD (0.20 m) < B’s SD (0.40 m)
β A is more consistent
Step 3: Context
B better for peak performance (higher average)
A more reliable (less variable from jump to jump)
B jumps further on average; A is more consistent
“better” is context-dependent β depends on whether the team values peaks or reliability
WE 3Compare two groups using median and IQR
The commute times of two groups of workers are summarised below.
Group A: median = 28 min, IQR = 12 min Group B: median = 24 min, IQR = 25 min.
Compare the two groups in context.
Step 1: Compare medians
B’s median (24 min) < A’s median (28 min)
β B has shorter typical commute
Step 2: Compare IQRs
A’s IQR (12 min) < B’s IQR (25 min)
β A’s commute times are more consistent
Step 3: Context
B is faster on average, but A’s commutes are more predictable
B has a shorter typical commute; A is more predictable
median+IQR pairing suggests the data has outliers or skew β quite possible for commute times
WE 4Compare two box plots
Two factories assemble the same product. Box plots of assembly times (in minutes) are summarised:
Factory P: Min=10, Q1=15, Med=20, Q3=25, Max=35.
Factory Q: Min=8, Q1=14, Med=22, Q3=32, Max=45.
Compare the two factories’ performance.
Step 1: Compare medians
P: 20 min; Q: 22 min
β P faster on average
Step 2: Compare IQRs
P: 25 β 15 = 10; Q: 32 β 14 = 18
β P more consistent (middle 50% spans only 10 min vs 18 min)
Step 3: Compare ranges
P: 35 β 10 = 25; Q: 45 β 8 = 37
β Q has wider total spread
Step 4: Context
Factory P assembles products faster on average and more consistently
all three measures (median, IQR, range) point the same way β strong conclusion
WE 5Match the graph type to the data
For each scenario, state the most appropriate graph type. (a) Showing how 100 students answered “what’s your favourite ice cream flavour?”. (b) Heights of 50 students grouped into 5 cm intervals. (c) Investigating whether hours studied predicts exam scores. (d) Quickly comparing the player heights of two basketball teams.
(a) Qualitative data β bar chart (with gaps)
(b) Continuous grouped data β histogram (or CF graph if quartiles needed)
(c) Bivariate data β scatter plot
(d) Quick centre-and-spread comparison β side-by-side box plots
(a) bar chart; (b) histogram; (c) scatter plot; (d) box plots
“box plots” are unbeatable for at-a-glance comparison of two or more distributions
WE 6Quality-control interpretation
A factory uses two processes to make a part with target weight 100 g. Sampling gives:
Process A: mean = 100.1 g, SD = 0.4 g Process B: mean = 99.9 g, SD = 1.5 g.
(a) Comment on which process is closer to the target on average. (b) Comment on consistency. (c) Recommend which process to use for quality-controlled production and justify.
(a) Distance from target
A: |100.1 β 100| = 0.1 g
B: |99.9 β 100| = 0.1 g
β both equally close to the target on average
(b) SDs
A’s SD (0.4) βͺ B’s SD (1.5)
β A’s outputs cluster more tightly around their mean
(c) Recommendation
For quality control, consistency matters most
A’s smaller SD means fewer parts deviate from target
Use Process A β both processes hit the target on average, but A is much more consistent
when both means are equal, SD becomes the deciding factor
π‘ Top tips
- Pair the measures correctly: mean+SD or median+IQR β never mean+IQR or median+SD without a reason.
- For comparison questions, always cover BOTH centre and spread β half-answers lose marks.
- End every comparison with a contextual sentence that translates the numbers into a real-world claim.
- “Consistent” maps to small spread; “varies a lot” maps to large spread.
- If asked which is “better”, decide what “better” means in the context (e.g., faster races, more accurate measurements).
β Common mistakes
- Stating numerical differences without context β e.g., “A’s median is 5 less” rather than “Process A is 5 minutes faster on average”.
- Using mean + SD when outliers are present β both get distorted; switch to median + IQR.
- Concluding from centre alone when the spreads tell a different story.
- Picking a graph without considering data type β qualitative data needs a bar chart, not a histogram.
- Forgetting that “better” is context-dependent β bigger SD isn’t bad if reaching extremes is desirable (e.g., a creative class might value variety).
That closes the Statistics Toolkit. From sampling to interpreting, you now have the full vocabulary for describing data: where it sits, how it spreads, how to visualise it, and how to compare two distributions in plain language. The probability sub-section starts next β building from sample spaces and Venn diagrams up to conditional probability, independence, and Bayes’ theorem.
Need help with Statistics & Probability?
Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.
Book Free Session β