IB Maths AA HL Topic 4 — Statistics & Probability Paper 1 & 2 ~7 min read

The Normal Distribution

A normal distribution is the bell-shaped curve described by N(μ, σ²) — symmetric about the mean μ, with spread controlled by the variance σ². Real-world variables like heights, weights and exam scores are modelled this way, provided the data is symmetrical with one mode. The 68-95-99.7 rule gives quick percentage estimates within ±1, ±2, ±3 standard deviations of the mean.

📘 What you need to know

The N(μ, σ²) notation

If you read X ~ N(μ, σ²), the first parameter is always the mean and the second is the variance. To get the standard deviation, take the square root of the second parameter — this is the single biggest source of slip-ups in this topic.

Reading the parameters X ~ N(μ, σ²)   ⇒   mean = μ,   variance = σ²,   SD = σ = √(σ²)

The 68-95-99.7 empirical rule

The shape of every normal curve is the same — only the location and width change. Because of this, the percentage of the population within a fixed number of standard deviations of the mean is always the same, regardless of μ or σ:

The 68-95-99.7 Rule μ−3σ μ−2σ μ−σ μ μ+σ μ+2σ μ+3σ 68% 95% 99.7% ★ ~99.7% within ±3σ ★ ~95% within ±2σ ★ ~68% within ±σ
Total area under any normal curve is 1. The percentage within ±k standard deviations is the same for every N(μ, σ²).
Tail probability shortcut: the area outside ±2σ is roughly 5% (≈ 2.5% in each tail); outside ±3σ is roughly 0.3% (≈ 0.15% in each tail). Useful for “more than”/”less than” estimates without a GDC.

When to use a normal model

ScenarioNormal model?Reason
Adult heights or weights in a countryYESlarge population, symmetric, single peak
Mass of randomly selected applesYESbiological measurement, bell-shaped
Rolling a single fair dieNOdiscrete; uniform, not bell-shaped
Daily rainfall (mm)NOspike at zero; right-skewed
Lifespan of LED bulbsNOtypically right-skewed (long tail)
Output of a random number generatorNOuniform, no single mode

Two non-negotiable conditions for normal modelling: the variable should be roughly symmetric and have a single mode. A long tail or a spike at zero is a deal-breaker.

🧭 Recipe — answer empirical-rule questions

  1. Read μ and σ² from the notation; take the square root for σ.
  2. Sketch the curve with marks at μ, μ±σ, μ±2σ, μ±3σ.
  3. Locate the value in question — how many σ above/below the mean is it?
  4. Apply 68% / 95% / 99.7% for the central region; subtract from 1 for tails.
  5. Use symmetry to split a tail in half if you only need one side.

Worked examples

WE 1

Read parameters from N(μ, σ²)

The random variable T ~ N(75, 64). State the mean, variance and standard deviation of T.

Identify each parameter First slot is the mean: μ = 75 Second slot is the variance: σ² = 64 Square-root for SD σ = √64 = 8 μ = 75; σ² = 64; σ = 8 never report 64 as the standard deviation — it’s the variance
WE 2

Apply the 68-95-99.7 rule

Adult male heights (in cm) in a population are modelled by H ~ N(178, 49). (a) State the standard deviation. (b) Approximately what percentage of men are between 164 cm and 192 cm tall? (c) Approximately what percentage are taller than 199 cm?

(a) σ = √49 = 7 (b) Locate the bounds in σ-units from the mean 164 = 178 − 14 = μ − 2σ 192 = 178 + 14 = μ + 2σ → within ±2σ → ~95% (c) 199 = 178 + 21 = μ + 3σ ~99.7% within ±3σ → ~0.3% outside By symmetry, half of that is in the upper tail → P(H > 199) ≈ 0.15% (a) σ = 7; (b) ~95%; (c) ~0.15% always count σ-multiples first, THEN apply 68/95/99.7
WE 3

Compare two normal curves

Two random variables are A ~ N(50, 4) and B ~ N(50, 25). (a) State the SD of each. (b) Describe how the two curves compare in shape. (c) State the mode of B.

(a) Square-root each variance σ_A = √4 = 2; σ_B = √25 = 5 (b) Compare Both curves are centred at μ = 50 (same mean) B has the larger SD → B is wider and shorter A has the smaller SD → A is narrower and taller (c) Normal is symmetric → mode = mean Mode of B = 50 σ_A = 2, σ_B = 5; B is wider and shorter; mode of B = 50 same mean = same horizontal position; bigger σ = squashed flatter
WE 4

Identify which scenarios suit a normal model

For each random variable, state with reason whether a normal distribution is appropriate.

(a) Daily rainfall (in mm) at a weather station.
(b) The mass of randomly selected apples from a single orchard.
(c) The score on one roll of a fair six-sided die.
(d) The lifespan of a brand of LED bulb (which has a long right tail).

(a) Rainfall — NO many days have zero rain → spike at 0; not symmetric (b) Apple mass — YES biological measurement: symmetric, bell-shaped, single mode (c) Die score — NO discrete, uniform — not bell-shaped or continuous (d) LED lifespan — NO stated long right tail → not symmetric (a) No; (b) Yes; (c) No; (d) No two checks: symmetric? single peak? both must pass for normal modelling
WE 5

Use symmetry to find a probability

The random variable X ~ N(60, σ²). (a) Find P(X > 60). (b) Given that P(X < 50) = 0.2, find P(50 < X < 70).

(a) Symmetry about the mean P(X > 60) = 0.5 (b) 50 and 70 are symmetric about 60 (each 10 away) By symmetry: P(X > 70) = P(X < 50) = 0.2 P(50 < X < 70) = 1 − P(X < 50) − P(X > 70) = 1 − 0.2 − 0.2 = 0.6 (a) 0.5; (b) P(50 < X < 70) = 0.6 σ wasn’t needed — symmetry alone solved it
WE 6

Real-world: egg masses + empirical rule

The mass M (in grams) of a randomly chosen egg from a particular flock is modelled by M ~ N(62, 16).

(a) State the mean and standard deviation of M. (b) State two assumptions needed to use this model. (c) Approximately what percentage of eggs have mass between 58 g and 66 g? (d) An egg is classified as “extra large” if its mass exceeds 70 g. Approximately what percentage are extra large?

(a) Read parameters μ = 62 g; σ² = 16, so σ = √16 = 4 g (b) Assumptions • distribution of egg masses is symmetrical • distribution is bell-shaped (single mode) (c) 58 = 62 − 4 = μ − σ; 66 = 62 + 4 = μ + σ Within ±σ → ~68% (d) 70 = 62 + 8 = μ + 2σ ~95% within ±2σ → ~5% outside Half in each tail → P(M > 70) ≈ 2.5% (a) μ = 62, σ = 4; (c) ~68%; (d) ~2.5% always state assumptions explicitly — they’re often a separate mark

💡 Top tips

⚠ Common mistakes

Next: Calculations with Normal Distribution. Now that you can read parameters and sketch the curve, the GDC’s NormCdf and InvNorm functions will give you exact (rather than empirical-rule-approximate) probabilities for any range — even when the bounds aren’t whole multiples of σ.

Need help with Statistics & Probability?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session →