IB Maths AI SL Topic 4 — Statistics Toolkit Paper 1 & 2 1.5 × IQR rule ~6 min read

Outliers

An outlier is a data value far outside the rest. The IB uses a strict definition: a value is an outlier if it lies more than 1.5 × IQR from the nearest quartile. Whether to keep or remove an outlier depends on whether it is a genuine extreme value or just a recording error.

๐Ÿ“˜ What you need to know

The 1.5 × IQR rule visualised

Picture the box plot. The box covers the middle 50% of the data (Q1 to Q3). Then extend invisible “fences” 1.5 IQRs further out on each side. Anything that escapes those fences is an outlier.

Box plot with outlier fences (data: 12, 35, 40, 42, 45, 47, 50, 52, 55, 120) 020406080100120 lower fence 22 Q₁ โˆ’ 1.5 IQR upper fence 70 Q₃ + 1.5 IQR Q₁ = 40 median 46 Q₃ = 5235 55 12 120 outlier outlierIQR = 12
The box covers Q1 to Q3 (IQR = 12). Orange fences sit 1.5 IQR beyond each quartile: 40 − 18 = 22 and 52 + 18 = 70. The whiskers end at 35 and 55 (the most extreme non-outliers), and the values 12 and 120 fall beyond the fences — both flagged as outliers (red ×).
The outlier rule x is an outlier  ⇔  x < Q1 − 1.5 IQR   or   x > Q3 + 1.5 IQR
 
where  IQR = Q3Q1

๐Ÿงญ Recipe — identify outliers

  1. Order the data and find Q1 and Q3. Use the GDC on Paper 2.
  2. Compute IQR = Q3Q1.
  3. Compute the two fences: lower = Q1 − 1.5 IQR  ยท  upper = Q3 + 1.5 IQR.
  4. List any data values below the lower fence or above the upper fence.
  5. Decide: keep (genuine extreme) or remove (clear error / typo). Justify with the context.
Strict inequality: the rule says more than 1.5 IQR. A value exactly on the fence (e.g. x = Q1 − 1.5 IQR) is NOT an outlier — it’s the borderline case.

Worked examples

WE 1

Identify outliers from given quartiles

A data set has Q1 = 12 and Q3 = 24. Determine which of the following values are outliers:

3,  15,  18,  28,  50

Step 1 โ€” IQR IQR = 24 โˆ’ 12 = 12 Step 2 โ€” fences lower = 12 โˆ’ 1.5 ร— 12 = 12 โˆ’ 18 = โˆ’6 upper = 24 + 1.5 ร— 12 = 24 + 18 = 42 Step 3 โ€” check each value 3 โˆˆ [โˆ’6, 42] โ†’ not outlier 15, 18, 28 โ†’ all inside โ†’ not outliers 50 > 42 โ†’ outlier โœ“ only 50 is an outlier don’t be fooled by 3 โ€” it looks small, but it’s still above the lower fence of โˆ’6. Compare to the FENCE, not to the median.
WE 2

Full workflow from raw data

The 5 km running times (min) of 11 club athletes are:

35,  38,  40,  42,  44,  45,  47,  48,  50,  52,  95

Identify any outliers.

Step 1 โ€” quartiles (n = 11) median = 6th value = 45 lower 5: 35, 38, 40, 42, 44 โ†’ Qโ‚ = 40 upper 5: 47, 48, 50, 52, 95 โ†’ Qโ‚ƒ = 50 Step 2 โ€” IQR IQR = 50 โˆ’ 40 = 10 Step 3 โ€” fences lower = 40 โˆ’ 15 = 25 upper = 50 + 15 = 65 Step 4 โ€” flag values outside [25, 65] 95 > 65 โ†’ outlier outlier: 95 min 95 minutes for 5 km is extremely slow โ€” likely a recording error or a walker. Decide context before removing it.
WE 3

Two outliers (low AND high)

The annual incomes of 10 employees (in $1000s) are:

12,  35,  40,  42,  45,  47,  50,  52,  55,  120

Identify any outliers.

Step 1 โ€” quartiles (n = 10) median = (45+47)/2 = 46 lower 5: 12, 35, 40, 42, 45 โ†’ Qโ‚ = 40 upper 5: 47, 50, 52, 55, 120 โ†’ Qโ‚ƒ = 52 Step 2 โ€” IQR & fences IQR = 12 lower = 40 โˆ’ 18 = 22 upper = 52 + 18 = 70 Step 3 โ€” values outside [22, 70] 12 < 22 โ†’ outlier 120 > 70 โ†’ outlier outliers: 12 and 120 an outlier set can be one-sided, two-sided, or empty. Always check BOTH fences โ€” students often forget to test for low outliers.
WE 4

Keep or remove? — justify the decision

Seven recent house sale prices on a quiet street are recorded (in $000s):

240,  260,  275,  290,  305,  320,  750

(a) Show that 750 is an outlier.   (b) State whether 750 should be removed, with justification.

(a) Quartiles (n = 7, odd) median = 4th value = 290 lower 3: 240, 260, 275 โ†’ Qโ‚ = 260 upper 3: 305, 320, 750 โ†’ Qโ‚ƒ = 320 IQR = 60 upper fence = 320 + 90 = 410 750 > 410 โ†’ outlier โœ“ (b) Decision 750k is a plausible mansion price โ€” not an error keep 750 โ€” it is a genuine extreme value, not a typo “Keep / remove” decisions need a CONTEXT-based reason. “It’s plausibly real” โ†’ keep. “Clearly a typo / impossible value” โ†’ remove.
WE 5

Find the smallest integer that is an outlier

A data set has Q1 = 18 and Q3 = 26. Find the smallest integer value that would be classified as an upper outlier.

Step 1 โ€” IQR & upper fence IQR = 26 โˆ’ 18 = 8 upper fence = 26 + 1.5 ร— 8 = 26 + 12 = 38 Step 2 โ€” outlier means STRICTLY greater than 38 38 itself: NOT an outlier (on the boundary) 39: 39 > 38 โœ“ smallest integer outlier = 39 trap: the value AT the fence is not an outlier. The smallest integer above 38 is 39.
WE 6

Comprehensive — full Paper-2 style question

A commuter records her daily journey times (min) over 11 working days:

12,  17,  18,  19,  21,  22,  24,  25,  26,  28,  50

(a) Find Q1, Q3 and the IQR.   (b) Identify any outliers.   (c) Suggest whether the outlier should be removed.

(a) Quartiles (n = 11, odd) median = 6th value = 22 lower 5: 12, 17, 18, 19, 21 โ†’ Qโ‚ = 18 upper 5: 24, 25, 26, 28, 50 โ†’ Qโ‚ƒ = 26 IQR = 26 โˆ’ 18 = 8 Qโ‚ = 18 ยท Qโ‚ƒ = 26 ยท IQR = 8 (b) Fences lower = 18 โˆ’ 12 = 6 upper = 26 + 12 = 38 12 not < 6 โ†’ not outlier 50 > 38 โ†’ outlier โœ“ outlier: 50 min (c) Decision 50 min is plausible โ€” a bad-traffic day keep 50; it is a genuine value commuting times can spike for legitimate reasons (accidents, delays). Unless told it was a recording error, this outlier represents real-world variability and should be kept.

๐Ÿ’ก Top tips

โš  Common mistakes

Next up: Box & Whisker Diagrams. You’ve already seen the box plot in action; now you’ll draw and read them formally — from the five-number summary (min, Q1, median, Q3, max), with outliers shown separately. Box plots are the fastest way to compare two distributions at a glance.

Need help with AI SL Statistics?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session →