IB Maths AA HL Topic 4 — Statistics & Probability Paper 2 ~7 min read

Calculations with Normal Distribution

For a normal distribution, the GDC’s NormCdf function gives P(a < X < b) directly — input the bounds, the mean, and the standard deviation. P(X = k) is always 0 for any continuous variable, and one-sided tails like P(X < b) or P(X > a) use a very negative or very large number for the missing bound. Inverse normal goes the other way: given a probability, find the threshold value.

📘 What you need to know

Why P(X = k) = 0

For a continuous variable, the probability of getting any single exact value is zero. Picture the bell curve: a single value is a vertical line at x = k, and a line has zero width — so it encloses zero area. This is why you only ever compute probabilities for ranges.

Strict-or-weak doesn’t matter: because P(X = a) = 0, swapping < for ≤ adds nothing. P(a < X < b) and P(aXb) give identical answers — you don’t need to convert.

Range probabilities P(a < X < b)

GDC call for a finite range P(aXb)  =  NormCdf(a, b, μ, σ)

Different calculators name this Normal CD, NCD, or Normal Cdf. They all want four inputs in roughly the same order: lower bound, upper bound, mean, standard deviation. Some calculators ask for SD before mean — check the order.

One-sided tails P(X < b) and P(X > a)

The NormCdf function only takes a finite range, so for one-sided tails you fake the missing endpoint with a very large or very negative number. The normal density is essentially 0 beyond ±4σ, so any value far enough out works.

Lower tail P(X < b)
NormCdf(−10⁹⁹, b, μ, σ)
use a very negative lower bound
Upper tail P(X > a)
NormCdf(a, 10⁹⁹, μ, σ)
use a very large upper bound

Most calculators have a button labelled −1E99 / 1E99 or similar — that’s the conventional placeholder for “infinity”.

Inverse normal: from probability to threshold

If the question gives you a probability and asks for the value a, use the InvNorm function instead. Given P(X < a) = p, InvNorm(p, μ, σ) returns a.

Going backwards: probability → value P(X < a) = p  ⇒  a = InvNorm(p, μ, σ)

If the question gives the upper-tail probability instead — i.e. P(X > a) = p — convert first: P(X < a) = 1 − p, then apply InvNorm with 1 − p.

Sanity-check trick: if p < 0.5, the answer should be below the mean; if p > 0.5, above. Sketch the curve and check before pressing buttons.

🧭 Recipe — solve any normal probability question

  1. Identify μ and σ — square-root the variance if needed.
  2. Sketch the curve and shade the region you want.
  3. Translate the wording into bounds for NormCdf.
  4. For one-sided tails, use ±10⁹⁹ as the missing bound.
  5. For inverse questions, use InvNorm — and convert “greater than” probabilities to “less than” using 1 − p.

Worked examples

WE 1

Single value and a basic range

The random variable X ~ N(100, 144). Find: (i) P(X = 100). (ii) P(80 < X < 110).

(i) Single value on a continuous distribution P(X = 100) = 0 (ii) σ = √144 = 12 Apply NormCdf with finite bounds P(80 < X < 110) = NormCdf(80, 110, 100, 12) = 0.74988… (i) 0; (ii) 0.750 (3 sf) always square-root the variance before passing it to NormCdf
WE 2

Lower tail — banana weights

The weights of bananas at a market are modelled by X ~ N(150, 25²) grams. Find P(X < 130).

μ = 150, σ = 25 (already SD, not variance) Lower tail → use −10⁹⁹ for lower bound P(X < 130) = NormCdf(−10⁹⁹, 130, 150, 25) = 0.21185… P(X < 130) ≈ 0.212 (3 sf) 130 is below the mean (150), so the answer must be less than 0.5 — sanity check passes
WE 3

Upper tail — battery lifetimes

The lifetimes of a brand of battery are modelled by T ~ N(48, 9) hours. Find P(T > 52).

σ = √9 = 3 (variance was given, not SD) Upper tail → use 10⁹⁹ for upper bound P(T > 52) = NormCdf(52, 10⁹⁹, 48, 3) = 0.09121… P(T > 52) ≈ 0.0912 (3 sf) 52 is above the mean (48), so the upper-tail prob < 0.5 — checks out
WE 4

Mixed inclusive/strict range — coffee wait times

Coffee shop wait times are modelled by W ~ N(4, 0.81) minutes. Find P(2.5 ≤ W < 5).

σ = √0.81 = 0.9 Strict vs weak doesn’t matter for continuous → use both bounds directly P(2.5 ≤ W < 5) = NormCdf(2.5, 5, 4, 0.9) = 0.81895… P(2.5 ≤ W < 5) ≈ 0.819 (3 sf) no need to convert 2.5 ≤ W to 2.5 < W — same answer for continuous variables
WE 5

Inverse normal — bulb lifespans

The lifespans of a brand of LED bulb are modelled by X ~ N(2000, 250²) hours. The manufacturer wants to find the time a below which only 10% of bulbs fail. Find a.

Translate: P(X < a) = 0.10 Apply InvNorm directly a = InvNorm(0.10, 2000, 250) = 1679.61… Round to 3 sf a ≈ 1680 hours a ≈ 1680 hours (3 sf) p = 0.10 < 0.5, so the answer is below the mean (2000) — consistent
WE 6

Multi-part: distribution setup, inverse normal, and empirical-rule check

The masses of newborn babies at a hospital are normally distributed with mean 3.4 kg and standard deviation 0.5 kg. The hospital classifies a baby as “unusually large” if its mass exceeds the threshold m, where only 2.5% of babies exceed this mass.

(a) State the distribution of the mass X. (b) Find the value of m. (c) Approximately what percentage of babies have mass between 2.4 kg and 4.4 kg?

(a) μ = 3.4, σ = 0.5, so σ² = 0.25 X ~ N(3.4, 0.25) (b) P(X > m) = 0.025 — convert to lower-tail form P(X < m) = 1 − 0.025 = 0.975 m = InvNorm(0.975, 3.4, 0.5) = 4.37998… m ≈ 4.38 kg (c) 2.4 = 3.4 − 1.0 = μ − 2σ; 4.4 = 3.4 + 1.0 = μ + 2σ Within ±2σ → ~95% (empirical rule) (a) X ~ N(3.4, 0.25); (b) m ≈ 4.38 kg; (c) ~95% always convert “greater than” into “less than” form before InvNorm

💡 Top tips

⚠ Common mistakes

Next: Standardisation of Normal Variables & z-Values. The standard normal distribution Z ~ N(0, 1) lets you compare values from completely different normal distributions (e.g. exam scores from different subjects). The z-value tells you how many standard deviations a value lies above or below the mean.

Need help with Statistics & Probability?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session →