IB Maths AA HL Topic 5 — Calculus Paper 1 & 2 HL ~10 min read

First Principles Differentiation

“From first principles” means deriving f′(x) directly from the limit definition — no power rule, no shortcuts. It’s a Paper-1 favourite “show that” question because it tests whether you really understand WHY the derivative exists. The technique is mechanical: substitute, expand, simplify, cancel h, take the limit. The challenge is staying sharp on the algebra and keeping limh → 0 written until the last line.

📘 What you need to know

The definition and what it means

Limit definition of the derivative f′(x) = limh → 0 f(x+h) − f(x)h

Geometrically, this is the slope of the secant line connecting two nearby points on the curve, in the limit as those two points merge into one — the tangent.

x y y = f(x) A (x, f(x)) B (x+h, f(x+h)) x x+h h secant (slope = [f(x+h)−f(x)]/h) tangent (slope = f'(x)) As h → 0, secant (red, dashed) rotates into the tangent (green) at A
The first-principles formula computes the slope of the secant AB. As B slides toward A (i.e., as h → 0), the secant becomes the tangent at A — whose slope is f′(x).
Why “lim” matters: setting h = 0 directly gives 0/0, which is undefined. The limit lets us say what value the quotient approaches WITHOUT actually putting h = 0. The algebraic trick (cancelling the h) is what makes the limit computable.

🧭 Recipe — differentiate from first principles

  1. Substitute into the formula: write f′(x) = limh → 0 [f(x+h) − f(x)] / h with the specific f from the question. Keep limh → 0 attached.
  2. Expand f(x+h): polynomials — multiply out; (x+h)n — binomial theorem; surds — multiply by the conjugate; reciprocals — common denominator.
  3. Subtract f(x) in the numerator. All the terms without an h should cancel — constants disappear.
  4. Factor h out of the numerator, then cancel it against the h in the denominator. The expression left should still depend on h, but no longer be 0/0.
  5. Take the limit by setting h = 0 in what’s left. State the final derivative as the concluding line — this is where limh → 0 finally drops off.

Worked examples

WE 1

Quadratic with linear and constant terms

Prove, from first principles, that the derivative of f(x) = 4x² + 5x − 1 is 8x + 5.

Step 1 — substitute into the limit definition f'(x) = lim h→0 [f(x+h) − f(x)] / h = lim h→0 [(4(x+h)² + 5(x+h) − 1) − (4x² + 5x − 1)] / h Step 2 — expand f(x+h) 4(x+h)² = 4(x² + 2xh + h²) = 4x² + 8xh + 4h² 5(x+h) = 5x + 5h ⇒ f(x+h) = 4x² + 8xh + 4h² + 5x + 5h − 1 Step 3 — subtract f(x); constants and “no-h” terms cancel f(x+h) − f(x) = 8xh + 4h² + 5h Step 4 — factor out h, cancel with denominator = h(8x + 4h + 5) ÷ h: f'(x) = lim h→0 (8x + 4h + 5) Step 5 — let h → 0 = 8x + 0 + 5 = 8x + 5 ✓ ∴ f'(x) = 8x + 5 notice the constant “−1” never appears in f(x+h) − f(x): every term WITHOUT an h cancels. The technique always sweeps constants away.
WE 2

Cubic with two terms

Prove, from first principles, that the derivative of f(x) = 2x³ − 3x² is 6x² − 6x.

Step 1 — set up f'(x) = lim h→0 [2(x+h)³ − 3(x+h)² − (2x³ − 3x²)] / h Step 2 — expand using (x+h)³ = x³ + 3x²h + 3xh² + h³ and (x+h)² = x² + 2xh + h² 2(x+h)³ = 2x³ + 6x²h + 6xh² + 2h³ 3(x+h)² = 3x² + 6xh + 3h² f(x+h) = 2x³ + 6x²h + 6xh² + 2h³ − 3x² − 6xh − 3h² Step 3 — subtract f(x); 2x³ and −3x² cancel f(x+h) − f(x) = 6x²h + 6xh² + 2h³ − 6xh − 3h² Step 4 — factor h, cancel = h(6x² + 6xh + 2h² − 6x − 3h) ÷ h: f'(x) = lim h→0 (6x² + 6xh + 2h² − 6x − 3h) Step 5 — let h → 0 = 6x² + 0 + 0 − 6x − 0 = 6x² − 6x ✓ ∴ f'(x) = 6x² − 6x every term that survives the limit has NO h in it. Every h-term contributes 0 once h → 0 — that’s how the surviving terms get isolated.
WE 3

Reciprocal function — common denominator technique

Prove, from first principles, that the derivative of f(x) = 1/x is −1/x².

Step 1 — set up f'(x) = lim h→0 [1/(x+h) − 1/x] / h Step 2 — combine the numerator over a common denominator 1/(x+h) − 1/x = [x − (x+h)] / [x(x+h)] = −h / [x(x+h)] Step 3 — divide by h (i.e. multiply by 1/h) f'(x) = lim h→0 [−h / (x(x+h))] · (1/h) = lim h→0 −1 / [x(x+h)] Step 4 — let h → 0 = −1 / [x(x+0)] = −1 / x² ✓ ∴ f'(x) = −1/x² no polynomial expansion needed — the key move is creating a common denominator. The h in the numerator (after combining fractions) cancels the h in the outer denominator.
WE 4

Square root — conjugate (rationalisation) technique

Prove, from first principles, that the derivative of f(x) = √x is 1/(2√x).

Step 1 — set up f'(x) = lim h→0 [√(x+h) − √x] / h Step 2 — multiply num and den by the conjugate (√(x+h) + √x) num: (√(x+h) − √x)(√(x+h) + √x) = (x+h) − x = h den: h · (√(x+h) + √x) Step 3 — simplify f'(x) = lim h→0 h / [h · (√(x+h) + √x)] Step 4 — cancel h = lim h→0 1 / [√(x+h) + √x] Step 5 — let h → 0 = 1 / [√x + √x] = 1 / (2√x) ✓ ∴ f'(x) = 1/(2√x) conjugate trick: (a − b)(a + b) = a² − b² turns “√(x+h) − √x” into “(x+h) − x = h” in the numerator. Every surd problem in first principles uses this move.
WE 5

Quartic — binomial theorem on (x+h)⁴

Prove, from first principles, that the derivative of f(x) = x⁴ is 4x³.

Step 1 — set up f'(x) = lim h→0 [(x+h)⁴ − x⁴] / h Step 2 — binomial expansion of (x+h)⁴ (x+h)⁴ = x⁴ + 4x³h + 6x²h² + 4xh³ + h⁴ Step 3 — subtract x⁴ (x+h)⁴ − x⁴ = 4x³h + 6x²h² + 4xh³ + h⁴ Step 4 — factor h, cancel with denominator = h(4x³ + 6x²h + 4xh² + h³) ÷ h: f'(x) = lim h→0 (4x³ + 6x²h + 4xh² + h³) Step 5 — let h → 0 = 4x³ + 0 + 0 + 0 = 4x³ ✓ ∴ f'(x) = 4x³ binomial coefficients (1, 4, 6, 4, 1) for n = 4. After subtracting x⁴ and factoring h, only the 4x³ term survives the limit — exactly the second binomial coefficient times x³.
WE 6

First principles at a specific point

The function f is defined by f(x) = x³ + 1. Use first principles to find the gradient of y = f(x) at the point where x = 2.

Step 1 — set up with x = 2 directly f'(2) = lim h→0 [f(2+h) − f(2)] / h Step 2 — compute f(2) and expand f(2+h) f(2) = 2³ + 1 = 9 f(2+h) = (2+h)³ + 1 = 8 + 12h + 6h² + h³ + 1 = 9 + 12h + 6h² + h³ Step 3 — subtract; the constant 9 cancels f(2+h) − f(2) = 12h + 6h² + h³ Step 4 — factor h, cancel = h(12 + 6h + h²) ÷ h: f'(2) = lim h→0 (12 + 6h + h²) Step 5 — let h → 0 = 12 + 0 + 0 = 12 f'(2) = 12    (gradient at x = 2 is 12) you can plug the number in right at step 1 — no need to find the general f'(x) first. The “specific point” version often runs faster because the numerator simplifies sooner.

💡 Top tips

⚠ Common mistakes

Up next: Differentiating Inverse Functions. If y = f−1(x), then x = f(y), and the chain rule gives dy/dx = 1 / (dx/dy). This is the cleanest way to differentiate things you can’t easily isolate — including the classic proof that d/dx[ln x] = 1/x via the inverse of ex.

Need help with Calculus?

Get 1-on-1 help from an IB examiner who knows exactly what Paper 1 & 2 are looking for.

Book Free Session →