56 Numerical methods

When the exact answer isn’t the point

The heat equation through an irregular fin. The trajectory of a satellite perturbed by the moon’s gravity. The implied volatility of an options contract given its market price. Three equations you can write down in seconds. None of them have closed-form solutions.

This is not unusual. Most equations that describe something real cannot be solved exactly. Numerical methods are what you use instead — systematic procedures that produce answers you can trust, with errors you can bound.

Three families of problems appear everywhere:

Root-finding: find x such that f(x) = 0.
Interpolation: given values at a set of points, estimate the value elsewhere.
Numerical integration: compute \int_a^b f(x)\,dx when f has no antiderivative in closed form.

Each family has the same structure: start with something computable, iterate or accumulate, and keep track of how wrong you might be.

56.1 What this chapter helps you do

Symbols to keep handy

These are the bits of notation you'll see a lot. If a line of symbols feels like a fence, read it out loud once, then keep going.

O(h^p): big-O of h to the p — the error shrinks as h to the power p as h → 0

Definitions to keep handy

These are the words we keep coming back to. If one feels slippery, come back here and steady it before you push on.

numerical method: A systematic approximation procedure that produces an answer you can bound and trust.
iterate: One step of a repeated algorithm; xₙ is your current best guess.
error / truncation error: The gap between the true value and the approximation, often predictable in how it shrinks as h gets smaller.
convergence: As you refine the method (more iterations or smaller step size), the approximation approaches the true answer.
stability: Whether small errors stay small as the computation proceeds, or blow up.

Here is the main move this chapter is making, in plain terms. You do not need to be fast. You just need to keep the thread.

Coming in: You can write down the equation. You cannot always solve it in closed form.
Leaving with: Every root-finding, integration, and interpolation problem is a systematic approximation with a controllable error. The method is the answer.

56.2 Root-finding

56.2.1 The problem

You have a function f(x) — read “f of x”, the function whose root you want — and you need to find x^* such that f(x^*) = 0. This is every nonlinear equation in disguise. If you need to solve g(x) = h(x), define f(x) = g(x) - h(x) and find its root.

56.2.2 Bisection method

The simplest idea: if f(a) < 0 and f(b) > 0, and f is continuous, then somewhere in (a, b) the function must cross zero. The midpoint m = (a+b)/2 is either a root, or the root lies in (a, m), or it lies in (m, b). Evaluate f(m) and keep the half that changes sign. Repeat.

Algorithm:

Choose [a, b] with f(a) and f(b) having opposite signs.
Set m = (a + b)/2.
If |f(m)| < \varepsilon (your tolerance), stop. m is the root.
If f(a) \cdot f(m) < 0, the root is in [a, m]: set b = m. Otherwise set a = m.
Return to step 2.

After n steps the bracket width is (b - a)/2^n, so the error is bounded by (b - a)/2^{n+1}. This is the bisection error bound. To guarantee an error below \varepsilon, you need at most \lceil \log_2((b-a)/\varepsilon) \rceil iterations.

The method is slow — the error halves each step — but it always converges as long as the function is continuous and the initial bracket is valid. Nothing else is assumed.

Worked example. Find the positive real root of f(x) = x^3 - x - 2, starting in [1, 2].

Note that f(1) = 1 - 1 - 2 = -2 < 0 and f(2) = 8 - 2 - 2 = 4 > 0. A root exists in [1, 2].

Step	a	b	m	f(m)	New bracket
1	1	2	1.5	1.5^3 - 1.5 - 2 = -0.125	[1.5, 2]
2	1.5	2	1.75	1.75^3 - 1.75 - 2 \approx 1.609	[1.5, 1.75]
3	1.5	1.75	1.625	1.625^3 - 1.625 - 2 \approx 0.666	[1.5, 1.625]

After three steps the root is bracketed in [1.5, 1.625], width 0.125 = 1/2^3. The exact root is x^* \approx 1.5214.

Use the controls to watch the bracket halve and the guaranteed uncertainty shrink step by step.

Code

{
  const funcs = {
    cubic: { label:"x^3 - x - 2", f:x => x**3 - x - 2, a:1, b:2 },
    cosx: { label:"cos(x) - x", f:x => Math.cos(x) - x, a:0, b:1 }
  };
  const W = 720, H = 360;

  function svgEl(tag) { return document.createElementNS("http://www.w3.org/2000/svg", tag); }
  function runBisection(name, n) {
    const { f, a: a0, b: b0 } = funcs[name];
    let a = a0, b = b0;
    const states = [{ a, b, m:(a+b)/2, fm:f((a+b)/2) }];
    for (let k = 0; k < n; k++) {
      const m = (a + b) / 2;
      const fm = f(m);
      if (f(a) * fm <= 0) b = m; else a = m;
      states.push({ a, b, m:(a+b)/2, fm:f((a+b)/2) });
    }
    return states.at(-1);
  }

  const container = document.createElement("div");
  container.style.cssText = "max-width:760px; margin:1rem 0 1.25rem 0; font-family:inherit;";
  const controls = document.createElement("div");
  controls.style.cssText = "display:grid; grid-template-columns:auto 1fr auto; gap:0.5rem 0.75rem; align-items:center; margin-bottom:0.75rem;";
  const selLabel = document.createElement("label"); selLabel.textContent = "Function";
  const select = document.createElement("select");
  Object.entries(funcs).forEach(([k, v]) => {
    const opt = document.createElement("option");
    opt.value = k; opt.textContent = v.label;
    select.appendChild(opt);
  });
  const selOut = document.createElement("span"); selOut.textContent = "";
  controls.append(selLabel, select, selOut);
  const stepLabel = document.createElement("label"); stepLabel.textContent = "Iteration";
  const step = document.createElement("input");
  step.type = "range"; step.min = "0"; step.max = "8"; step.step = "1"; step.value = "3";
  const stepOut = document.createElement("span");
  stepOut.style.cssText = "font-variant-numeric:tabular-nums; min-width:3rem;";
  controls.append(stepLabel, step, stepOut);

  const summary = document.createElement("div");
  summary.style.cssText = "display:flex; gap:1rem; flex-wrap:wrap; margin-bottom:0.6rem; color:#334155;";
  const bracketBox = document.createElement("span");
  const midBox = document.createElement("span");
  const widthBox = document.createElement("span");
  summary.append(bracketBox, midBox, widthBox);

  const svg = svgEl("svg");
  svg.setAttribute("viewBox", `0 0 ${W} ${H}`);
  svg.style.width = "100%";
  svg.style.height = "auto";
  svg.style.border = "1px solid #e2e8f0";
  svg.style.background = "#fff";

  const caption = document.createElement("p");
  caption.style.cssText = "margin:0.65rem 0 0 0; color:#475569;";

  function redraw() {
    const name = select.value;
    const info = funcs[name];
    const state = runBisection(name, Number(step.value));
    selOut.textContent = info.label;
    stepOut.textContent = step.value;
    svg.replaceChildren();

    const xMin = info.a - 0.2, xMax = info.b + 0.2;
    const yVals = Array.from({ length: 200 }, (_, i) => info.f(xMin + i / 199 * (xMax - xMin)));
    const yMin = Math.min(...yVals, -0.5), yMax = Math.max(...yVals, 0.5);
    const sx = x => 55 + (x - xMin) / (xMax - xMin) * 620;
    const sy = y => 290 - (y - yMin) / (yMax - yMin) * 210;

    const axis = svgEl("line");
    axis.setAttribute("x1", "55"); axis.setAttribute("y1", String(sy(0)));
    axis.setAttribute("x2", "675"); axis.setAttribute("y2", String(sy(0)));
    axis.setAttribute("stroke", "#64748b");
    axis.setAttribute("stroke-width", "1.2");
    svg.appendChild(axis);

    const path = [];
    for (let i = 0; i < 200; i++) {
      const x = xMin + i / 199 * (xMax - xMin);
      path.push(`${i === 0 ? "M" : "L"} ${sx(x)} ${sy(info.f(x))}`);
    }
    const curve = svgEl("path");
    curve.setAttribute("d", path.join(" "));
    curve.setAttribute("fill", "none");
    curve.setAttribute("stroke", "#2563eb");
    curve.setAttribute("stroke-width", "3");
    svg.appendChild(curve);

    const band = svgEl("rect");
    band.setAttribute("x", String(sx(state.a)));
    band.setAttribute("y", "40");
    band.setAttribute("width", String(sx(state.b) - sx(state.a)));
    band.setAttribute("height", "250");
    band.setAttribute("fill", "rgba(245,158,11,0.18)");
    svg.appendChild(band);

    [["a", state.a, "#dc2626"], ["m", state.m, "#0f172a"], ["b", state.b, "#16a34a"]].forEach(([label, x, color]) => {
      const line = svgEl("line");
      line.setAttribute("x1", String(sx(x))); line.setAttribute("y1", "40");
      line.setAttribute("x2", String(sx(x))); line.setAttribute("y2", "290");
      line.setAttribute("stroke", color);
      line.setAttribute("stroke-width", "2.5");
      svg.appendChild(line);
      const t = svgEl("text");
      t.setAttribute("x", String(sx(x))); t.setAttribute("y", "32");
      t.setAttribute("text-anchor", "middle");
      t.setAttribute("font-size", "12");
      t.setAttribute("fill", color);
      t.textContent = `${label} = ${Number(x).toFixed(4)}`;
      svg.appendChild(t);
    });

    bracketBox.textContent = `current bracket = [${state.a.toFixed(4)}, ${state.b.toFixed(4)}]`;
    midBox.textContent = `midpoint = ${state.m.toFixed(4)}`;
    widthBox.textContent = `width = ${(state.b - state.a).toFixed(4)}`;
    caption.textContent = "Bisection never loses the root: it simply keeps the half-interval where the sign change still occurs.";
  }

  [select, step].forEach(el => el.addEventListener("input", redraw));
  container.append(controls, summary, svg, caption);
  redraw();
  return container;
}

56.2.3 Newton-Raphson method

Bisection is robust but slow. If f is differentiable, you can do much better. The idea: at your current estimate x_n — read “x sub n”, the nth iterate — draw the tangent line to y = f(x). Follow the tangent to where it hits zero; call that x_{n+1}.

The tangent line at x_n has equation:

y - f(x_n) = f'(x_n)(x - x_n)

where f'(x_n) — read “f prime at x_n” — is the derivative of f at the current point. Setting y = 0 and solving for x:

x_{n+1} = x_n - \frac{f(x_n)}{f'(x_n)}

This is the Newton-Raphson iteration. Apply it repeatedly until |f(x_n)| is below your tolerance.

Why it converges so fast. The error e_n = x_n - x^* satisfies:

Substituting a Taylor expansion of f(x_n) about the true root x^* into the Newton iteration and simplifying:

e_{n+1} \approx -\frac{f''(x^*)}{2f'(x^*)} e_n^2

The next error is proportional to the square of the current error. If you are accurate to 3 decimal places, one more step gives you roughly 6. This is quadratic convergence — the number of correct digits doubles each step. Bisection adds about 0.3 correct digits per step; Newton-Raphson roughly doubles them.

The method fails if f'(x_n) = 0 (the tangent is flat) or if the starting point is far from the root (the iteration diverges or cycles). Always check convergence.

Worked example. Compute \sqrt{2} using Newton-Raphson. The positive root of f(x) = x^2 - 2 is \sqrt{2}, so f'(x) = 2x.

The iteration is:

x_{n+1} = x_n - \frac{x_n^2 - 2}{2x_n} = \frac{x_n + 2/x_n}{2}

Starting from x_0 = 1:

n	x_n	f(x_n) = x_n^2 - 2	\|f(x_n)\|
0	1.000000	−1.000000	1.000000
1	1.500000	0.250000	0.250000
2	1.416667	0.006944	0.006944
3	1.414216	0.000006	6 \times 10^{-6}

Three steps to six-digit accuracy. Quadratic convergence in action.

Use the stepper to see each tangent line generate the next iterate.

Code

{
  const f = x => x * x - 2;
  const fp = x => 2 * x;
  const xs = [1.0];
  for (let i = 0; i < 4; i++) {
    const x = xs.at(-1);
    xs.push(x - f(x) / fp(x));
  }

  function svgEl(tag) { return document.createElementNS("http://www.w3.org/2000/svg", tag); }
  const container = document.createElement("div");
  container.style.cssText = "max-width:760px; margin:1rem 0 1.25rem 0; font-family:inherit;";
  const actions = document.createElement("div");
  actions.style.cssText = "display:flex; gap:0.6rem; align-items:center; margin-bottom:0.75rem;";
  const prevBtn = document.createElement("button"); prevBtn.textContent = "Previous";
  const nextBtn = document.createElement("button"); nextBtn.textContent = "Next iteration";
  const stage = document.createElement("strong");
  actions.append(prevBtn, nextBtn, stage);

  const readout = document.createElement("div");
  readout.style.cssText = "display:flex; gap:1rem; flex-wrap:wrap; margin-bottom:0.6rem; color:#334155;";
  const xBox = document.createElement("span");
  const fBox = document.createElement("span");
  const errBox = document.createElement("span");
  readout.append(xBox, fBox, errBox);

  const svg = svgEl("svg");
  svg.setAttribute("viewBox", "0 0 720 360");
  svg.style.width = "100%";
  svg.style.height = "auto";
  svg.style.border = "1px solid #e2e8f0";
  svg.style.background = "#fff";

  const log = document.createElement("table");
  log.style.cssText = "border-collapse:collapse; width:100%; max-width:420px; margin-top:0.75rem; font-size:0.92rem;";

  const caption = document.createElement("p");
  caption.style.cssText = "margin:0.65rem 0 0 0; color:#475569;";

  let idx = 0;
  function redraw() {
    const x = xs[idx];
    const xNext = xs[Math.min(idx + 1, xs.length - 1)];
    stage.textContent = `iteration n = ${idx}`;
    xBox.textContent = `x_n = ${x.toFixed(6)}`;
    fBox.textContent = `f(x_n) = ${f(x).toFixed(6)}`;
    errBox.textContent = `|f(x_n)| = ${Math.abs(f(x)).toExponential(3)}`;

    svg.replaceChildren();
    const xMin = 0.7, xMax = 1.7, yMin = -1.2, yMax = 1.2;
    const sx = x => 55 + (x - xMin) / (xMax - xMin) * 620;
    const sy = y => 280 - (y - yMin) / (yMax - yMin) * 220;

    const axis = svgEl("line");
    axis.setAttribute("x1", "55"); axis.setAttribute("y1", String(sy(0)));
    axis.setAttribute("x2", "675"); axis.setAttribute("y2", String(sy(0)));
    axis.setAttribute("stroke", "#64748b"); axis.setAttribute("stroke-width", "1.2");
    svg.appendChild(axis);

    const path = [];
    for (let i = 0; i < 240; i++) {
      const xv = xMin + i / 239 * (xMax - xMin);
      path.push(`${i === 0 ? "M" : "L"} ${sx(xv)} ${sy(f(xv))}`);
    }
    const curve = svgEl("path");
    curve.setAttribute("d", path.join(" "));
    curve.setAttribute("fill", "none");
    curve.setAttribute("stroke", "#2563eb");
    curve.setAttribute("stroke-width", "3");
    svg.appendChild(curve);

    const m = fp(x), b = f(x) - m * x;
    const t1 = xMin, t2 = xMax;
    const tangent = svgEl("line");
    tangent.setAttribute("x1", String(sx(t1)));
    tangent.setAttribute("y1", String(sy(m * t1 + b)));
    tangent.setAttribute("x2", String(sx(t2)));
    tangent.setAttribute("y2", String(sy(m * t2 + b)));
    tangent.setAttribute("stroke", "#f59e0b");
    tangent.setAttribute("stroke-width", "2.5");
    svg.appendChild(tangent);

    const p = svgEl("circle");
    p.setAttribute("cx", String(sx(x)));
    p.setAttribute("cy", String(sy(f(x))));
    p.setAttribute("r", "5.5");
    p.setAttribute("fill", "#0f172a");
    svg.appendChild(p);

    const drop = svgEl("line");
    drop.setAttribute("x1", String(sx(xNext))); drop.setAttribute("y1", String(sy(0)));
    drop.setAttribute("x2", String(sx(xNext))); drop.setAttribute("y2", "305");
    drop.setAttribute("stroke", "#dc2626");
    drop.setAttribute("stroke-width", "2");
    svg.appendChild(drop);

    log.replaceChildren();
    const head = document.createElement("tr");
    ["n","x_n","|f(x_n)|"].forEach(h => {
      const th = document.createElement("th");
      th.textContent = h;
      th.style.cssText = "border:1px solid #cbd5e1; padding:0.35rem 0.5rem; text-align:center;";
      head.appendChild(th);
    });
    log.appendChild(head);
    xs.slice(0, idx + 2).forEach((val, n) => {
      const tr = document.createElement("tr");
      [String(n), val.toFixed(6), Math.abs(f(val)).toExponential(3)].forEach(text => {
        const td = document.createElement("td");
        td.textContent = text;
        td.style.cssText = "border:1px solid #cbd5e1; padding:0.35rem 0.5rem; text-align:center;";
        tr.appendChild(td);
      });
      log.appendChild(tr);
    });

    caption.textContent = "The orange tangent uses the local linear approximation at x_n. Its x-intercept becomes x_(n+1), which quickly homes in on √2.";
  }

  prevBtn.addEventListener("click", () => { idx = Math.max(0, idx - 1); redraw(); });
  nextBtn.addEventListener("click", () => { idx = Math.min(xs.length - 2, idx + 1); redraw(); });

  container.append(actions, readout, svg, log, caption);
  redraw();
  return container;
}

56.2.4 Secant method

Newton-Raphson requires the derivative f'(x). When that is expensive or unavailable, replace it with a finite-difference approximation using the two most recent iterates:

x_{n+1} = x_n - f(x_n) \cdot \frac{x_n - x_{n-1}}{f(x_n) - f(x_{n-1})}

This is the secant method. It needs two starting values x_0 and x_1. The convergence order is approximately 1.618 (the golden ratio) — super-linear but slower than quadratic. No derivative evaluation required.

Which root-finder to use

Bisection is your fallback: slow, guaranteed, needs only function values and a sign change. Newton-Raphson is your workhorse when you have a derivative and a decent starting point: fast, elegant, can fail. The secant method sits between them: no derivative, faster than bisection, needs two starting points. In practice, hybrid methods (Brent’s method, Illinois algorithm) combine bisection safety with Newton-like speed.

56.3 Interpolation

56.3.1 The problem

You have n+1 data points (x_0, y_0), (x_1, y_1), \ldots, (x_n, y_n) and you want to estimate f(x) at a new x lying between them. The classical approach: find a polynomial that passes exactly through all the given points, then evaluate it at x.

56.3.2 Lagrange interpolation

For n+1 points there is exactly one polynomial of degree \leq n passing through all of them. Write it as:

P_n(x) = \sum_{k=0}^{n} y_k \, L_k(x)

where each Lagrange basis polynomial L_k(x) is:

L_k(x) = \prod_{\substack{j=0 \\ j \neq k}}^{n} \frac{x - x_j}{x_k - x_j}

Read this as: L_k(x) equals 1 at x_k and equals 0 at every other node x_j. So P_n(x_k) = y_k for each k — the polynomial interpolates every point exactly.

Worked example. Interpolate f at x = 1.5 using the points (1, 1), (2, 8), (3, 27) — the cube function sampled at three integers.

With nodes x_0 = 1, x_1 = 2, x_2 = 3:

L_0(x) = \frac{(x-2)(x-3)}{(1-2)(1-3)} = \frac{(x-2)(x-3)}{2}

L_1(x) = \frac{(x-1)(x-3)}{(2-1)(2-3)} = \frac{(x-1)(x-3)}{-1} = -(x-1)(x-3)

L_2(x) = \frac{(x-1)(x-2)}{(3-1)(3-2)} = \frac{(x-1)(x-2)}{2}

At x = 1.5:

L_0(1.5) = \frac{(1.5-2)(1.5-3)}{2} = \frac{(-0.5)(-1.5)}{2} = \frac{0.75}{2} = 0.375

L_1(1.5) = -(1.5-1)(1.5-3) = -(0.5)(-1.5) = 0.75

L_2(1.5) = \frac{(1.5-1)(1.5-2)}{2} = \frac{(0.5)(-0.5)}{2} = -0.125

P_2(1.5) = 1 \cdot 0.375 + 8 \cdot 0.75 + 27 \cdot (-0.125) = 0.375 + 6 - 3.375 = 3

The exact value is 1.5^3 = 3.375. The polynomial through three points of x^3 is not perfect — it is a degree-2 polynomial approximating a degree-3 function. Adding the fourth node (4, 64) would give exact interpolation.

56.3.3 Newton’s divided-difference form

The Lagrange form is conceptually clean but computationally inconvenient: to add a new node you must recompute everything. Newton’s divided-difference form builds the polynomial incrementally.

Define the divided differences recursively:

f[x_k] = y_k

f[x_k, x_{k+1}] = \frac{f[x_{k+1}] - f[x_k]}{x_{k+1} - x_k}

f[x_k, \ldots, x_{k+j}] = \frac{f[x_{k+1}, \ldots, x_{k+j}] - f[x_k, \ldots, x_{k+j-1}]}{x_{k+j} - x_k}

To see how this builds, use two of the cube-function points: x_0 = 1, y_0 = 1 and x_1 = 2, y_1 = 8. The first divided difference is f[x_0, x_1] = (8-1)/(2-1) = 7. Newton’s form gives P_1(x) = 1 + 7(x - 1) — the straight line through (1,1) and (2,8). Adding a third node brings in the next divided difference and bends the curve to pass through that point too.

The interpolating polynomial is then:

P_n(x) = f[x_0] + f[x_0,x_1](x - x_0) + f[x_0,x_1,x_2](x-x_0)(x-x_1) + \cdots

Adding a new node x_{n+1} requires only one new divided difference and one new term — the previous work is retained.

56.3.4 Runge’s phenomenon and why splines exist

High-degree polynomial interpolation over equally-spaced nodes can oscillate wildly near the endpoints, even when the underlying function is smooth. This is Runge’s phenomenon. The standard fix is spline interpolation: use low-degree polynomials (typically cubics) on each sub-interval, stitched together with continuity conditions. Splines are covered in the next chapter.

56.4 Numerical integration

56.4.1 The problem

Compute \int_a^b f(x)\,dx when f has no closed-form antiderivative, or when f is known only at discrete points (sampled data from a sensor, say).

Partition [a, b] into n equal panels of width h = (b - a)/n. Let x_i = a + ih for i = 0, 1, \ldots, n, and write f_i = f(x_i).

56.4.2 Trapezoid rule

Approximate f on each panel by a straight line joining (x_i, f_i) and (x_{i+1}, f_{i+1}). The area of each trapezoid is \frac{h}{2}(f_i + f_{i+1}). Summing over all panels:

\int_a^b f(x)\,dx \approx T_n = \frac{h}{2}\bigl(f_0 + 2f_1 + 2f_2 + \cdots + 2f_{n-1} + f_n\bigr)

The error is O(h^2) — halving the step size h reduces the error by a factor of four. More precisely:

\text{Error} = -\frac{(b-a)h^2}{12} f''(\xi)

for some \xi \in (a,b) — a point whose exact location is unknown, but which lies somewhere in the integration interval. The formula says: the error depends on the curvature of f somewhere in (a,b). You don’t need to know where \xi is; you only need an upper bound on |f''| over the interval. The error is proportional to the second derivative of f — large curvature means larger error.

Worked example. Estimate \int_0^1 e^x\,dx with n = 4 panels.

h = 0.25, nodes at x = 0, 0.25, 0.5, 0.75, 1.

x_i	f_i = e^{x_i}
0	1.000000
0.25	1.284025
0.5	1.648721
0.75	2.117000
1	2.718282

T_4 = \frac{0.25}{2}\bigl(1 + 2(1.284025) + 2(1.648721) + 2(2.117000) + 2.718282\bigr)

= 0.125 \times (1 + 2.568050 + 3.297442 + 4.234000 + 2.718282)

= 0.125 \times 13.817774 = 1.727222

The exact value is e - 1 \approx 1.718282. Error: |1.727222 - 1.718282| \approx 0.008940.

56.4.3 Simpson’s 1/3 rule

Replace the straight-line approximation on each pair of panels with a quadratic through three consecutive nodes. Require n even.

\int_a^b f(x)\,dx \approx S_n = \frac{h}{3}\bigl(f_0 + 4f_1 + 2f_2 + 4f_3 + \cdots + 4f_{n-1} + f_n\bigr)

The coefficients alternate 4, 2, 4, 2, \ldots, 4 for interior nodes. The error is O(h^4):

\text{Error} = -\frac{(b-a)h^4}{180} f^{(4)}(\xi)

Halving h reduces the error by a factor of sixteen. Simpson’s rule is exact for polynomials of degree up to 3, even though it uses degree-2 approximations — a consequence of symmetry.

Worked example. Apply Simpson’s 1/3 rule to \int_0^1 e^x\,dx with n = 4.

Using the same nodes as above:

S_4 = \frac{0.25}{3}\bigl(f_0 + 4f_1 + 2f_2 + 4f_3 + f_4\bigr)

= \frac{0.25}{3}\bigl(1 + 4(1.284025) + 2(1.648721) + 4(2.117000) + 2.718282\bigr)

= \frac{0.25}{3}\bigl(1 + 5.136100 + 3.297442 + 8.468000 + 2.718282\bigr)

= \frac{0.25}{3} \times 20.619824 = 1.718319

Error: |1.718319 - 1.718282| \approx 0.000037. A factor of 241 smaller than the trapezoid rule with the same number of function evaluations.

56.4.4 Simpson’s 3/8 rule

A variant using groups of three panels instead of two, again with a cubic approximant. Require n divisible by 3.

\int_a^b f(x)\,dx \approx \frac{3h}{8}\bigl(f_0 + 3f_1 + 3f_2 + 2f_3 + 3f_4 + 3f_5 + 2f_6 + \cdots + f_n\bigr)

Error is also O(h^4). The 1/3 rule is more commonly used; the 3/8 rule appears when n is odd so the standard Simpson pairing fails.

Use the panel slider to compare how trapezoid and Simpson approximations improve as the partition is refined.

Code

{
  const f = x => Math.sin(Math.PI * x) + 0.4 * x * x;
  const exact = 2 / Math.PI + 0.4 / 3;
  function trap(n) {
    const h = 1 / n;
    let s = 0.5 * (f(0) + f(1));
    for (let i = 1; i < n; i++) s += f(i * h);
    return h * s;
  }
  function simpson(n) {
    if (n % 2 === 1) n += 1;
    const h = 1 / n;
    let s = f(0) + f(1);
    for (let i = 1; i < n; i++) s += (i % 2 === 0 ? 2 : 4) * f(i * h);
    return h * s / 3;
  }

  function svgEl(tag) { return document.createElementNS("http://www.w3.org/2000/svg", tag); }
  const container = document.createElement("div");
  container.style.cssText = "max-width:760px; margin:1rem 0 1.25rem 0; font-family:inherit;";
  const controls = document.createElement("div");
  controls.style.cssText = "display:grid; grid-template-columns:auto 1fr auto; gap:0.5rem 0.75rem; align-items:center; margin-bottom:0.75rem;";
  const label = document.createElement("label"); label.textContent = "Panels n";
  const slider = document.createElement("input");
  slider.type = "range"; slider.min = "2"; slider.max = "16"; slider.step = "2"; slider.value = "4";
  const out = document.createElement("span"); out.style.cssText = "font-variant-numeric:tabular-nums; min-width:3rem;";
  controls.append(label, slider, out);

  const summary = document.createElement("div");
  summary.style.cssText = "display:flex; gap:1rem; flex-wrap:wrap; margin-bottom:0.6rem; color:#334155;";
  const trapBox = document.createElement("span");
  const simpBox = document.createElement("span");
  const exactBox = document.createElement("span");
  summary.append(trapBox, simpBox, exactBox);

  const svg = svgEl("svg");
  svg.setAttribute("viewBox", "0 0 720 360");
  svg.style.width = "100%";
  svg.style.height = "auto";
  svg.style.border = "1px solid #e2e8f0";
  svg.style.background = "#fff";

  const caption = document.createElement("p");
  caption.style.cssText = "margin:0.65rem 0 0 0; color:#475569;";

  function redraw() {
    const n = Number(slider.value);
    out.textContent = String(n);
    const tVal = trap(n);
    const sVal = simpson(n);
    svg.replaceChildren();
    const sx = x => 55 + x * 620;
    const sy = y => 300 - y / 1.45 * 220;

    const axisX = svgEl("line");
    axisX.setAttribute("x1", "55"); axisX.setAttribute("y1", "300");
    axisX.setAttribute("x2", "675"); axisX.setAttribute("y2", "300");
    axisX.setAttribute("stroke", "#64748b"); axisX.setAttribute("stroke-width", "1.2");
    svg.appendChild(axisX);

    for (let i = 0; i < n; i++) {
      const x0 = i / n, x1 = (i + 1) / n;
      const poly = svgEl("polygon");
      poly.setAttribute("points", `${sx(x0)},300 ${sx(x0)},${sy(f(x0))} ${sx(x1)},${sy(f(x1))} ${sx(x1)},300`);
      poly.setAttribute("fill", "rgba(59,130,246,0.12)");
      poly.setAttribute("stroke", "#93c5fd");
      svg.appendChild(poly);
    }

    const curve = svgEl("path");
    curve.setAttribute("d", Array.from({ length: 220 }, (_, i) => {
      const x = i / 219;
      return `${i === 0 ? "M" : "L"} ${sx(x)} ${sy(f(x))}`;
    }).join(" "));
    curve.setAttribute("fill", "none");
    curve.setAttribute("stroke", "#0f172a");
    curve.setAttribute("stroke-width", "3");
    svg.appendChild(curve);

    trapBox.textContent = `trapezoid = ${tVal.toFixed(5)} (error ${Math.abs(tVal - exact).toFixed(5)})`;
    simpBox.textContent = `Simpson = ${sVal.toFixed(5)} (error ${Math.abs(sVal - exact).toFixed(5)})`;
    exactBox.textContent = `exact = ${exact.toFixed(5)}`;
    caption.textContent = "The blue panels show the trapezoid construction. Simpson's rule uses the same partition but fits local parabolas, so its error usually falls much faster.";
  }

  slider.addEventListener("input", redraw);
  container.append(controls, summary, svg, caption);
  redraw();
  return container;
}

56.4.5 Gaussian quadrature

The rules above use evenly-spaced nodes. If you are free to choose both the node locations and the weights, you can do much better.

Gauss-Legendre quadrature on [-1, 1] with n nodes is exact for polynomials of degree up to 2n-1 — twice the order you might expect. The nodes are the roots of the Legendre polynomial P_n(x), and the weights are determined by the condition of exactness.

The Legendre polynomials are a specific family of functions whose roots have an extremal property: placing quadrature nodes there squeezes the most accuracy out of a given number of evaluations. You do not derive these roots by hand — they are tabulated in every numerical methods reference and built into every serious quadrature library. For n = 2, the nodes and weights are:

For n = 2: nodes x_{1,2} = \pm 1/\sqrt{3}, weights w_{1,2} = 1:

\int_{-1}^1 f(x)\,dx \approx f\!\left(-\frac{1}{\sqrt{3}}\right) + f\!\left(\frac{1}{\sqrt{3}}\right)

This is exact for all polynomials of degree \leq 3.

To integrate over [a, b], substitute x = \frac{a+b}{2} + \frac{b-a}{2}t where t \in [-1, 1]:

\int_a^b f(x)\,dx = \frac{b-a}{2} \int_{-1}^1 f\!\left(\frac{a+b}{2} + \frac{b-a}{2}t\right) dt

Worked example. Estimate \int_0^1 e^x\,dx using 2-point Gauss-Legendre.

Change variables to [-1,1]: x = \frac{1}{2} + \frac{1}{2}t, dx = \frac{1}{2}\,dt.

Nodes in t: t_{1,2} = \pm 1/\sqrt{3} \approx \pm 0.577350.

Corresponding x-values: x_1 = 0.5 - 0.288675 = 0.211325, x_2 = 0.5 + 0.288675 = 0.788675.

\int_0^1 e^x\,dx \approx \frac{1}{2}\Bigl(e^{0.211325} + e^{0.788675}\Bigr) = \frac{1}{2}(1.235369 + 2.200819) = 1.718094

Error: |1.718094 - 1.718282| \approx 0.000188. Two function evaluations; better accuracy than the 5-point trapezoid rule.

Summary: error orders

Method	Error	Error reduction when n doubles
Trapezoid	O(h^2)	4× smaller
Simpson’s 1/3	O(h^4)	16× smaller
Gauss–Legendre (n pts)	exact for degree \leq 2n-1	—

When f is smooth and you control the nodes, Gaussian quadrature wins. When f comes from sampled data at fixed points, use Simpson or trapezoid.

56.5 Where this goes

The three families here — root-finding, interpolation, numerical integration — are the building blocks for almost everything else in numerical computation. Numerical linear algebra (the next chapter in this section) extends root-finding ideas to systems: you want \mathbf{x} such that A\mathbf{x} = \mathbf{b}, and for large systems you solve it iteratively using methods (Gauss-Seidel, conjugate gradient) that are direct descendants of the Newton-Raphson idea.

Numerical integration connects back to ordinary differential equations: Euler’s method for ODEs is the forward-rectangle (left-endpoint) rule applied to the integral form of the ODE — at each step you approximate the integrand by its value at the left endpoint, which is first-order accurate. Runge-Kutta methods are higher-order quadrature applied at each step. Knowing the error orders from this chapter tells you immediately why RK4 (fourth-order) is preferred over Euler (first-order): halving the step size reduces RK4’s error by 16×, Euler’s by only 2×.

Where these methods appear

Structural engineering (FEA): each element stiffness matrix is assembled using Gaussian quadrature over the element domain.
Signal processing: interpolation underlies sample-rate conversion (resampling audio or images); Newton-Raphson is used in carrier phase estimation.
Options pricing: Newton-Raphson is the standard method for computing implied volatility from the Black-Scholes formula — the exact inverse doesn’t exist.
Astrophysics: orbital trajectories are integrated numerically step by step using adaptive-step Runge-Kutta, which uses error estimates derived from the quadrature theory here.
Machine learning: most optimisers (Adam, L-BFGS) are Newton-like methods applied to high-dimensional loss functions. The convergence order matters enormously at scale.

56.6 Exercises

These are puzzles. Each has a clean numerical answer. The interesting work is carrying the method through correctly and checking the error bound.

1. Apply three iterations of bisection to find the positive root of f(x) = x^3 - x - 2 = 0, starting with the bracket [1, 2]. State the midpoint after each iteration and the final bracket width.

2. Apply three iterations of Newton-Raphson to compute \sqrt{2}, starting from x_0 = 1. Use f(x) = x^2 - 2. Record x_n and |f(x_n)| at each step and comment on the convergence rate.

3. Use the Lagrange interpolation formula to estimate f(1.5), given the data points (1, 1), (2, 8), (3, 27). Compare with the exact value 1.5^3 = 3.375 and explain the discrepancy.

4. Apply the trapezoid rule to \int_0^1 e^x\,dx with n = 4 panels. Compute the numerical result and the absolute error against the exact value e - 1 \approx 1.718282.

5. Apply Simpson’s 1/3 rule to the same integral \int_0^1 e^x\,dx with n = 4 panels. Compare your error with Exercise 4 and confirm the improvement is consistent with the O(h^4) order.

6. A diode circuit satisfies I \cdot R + V_T \ln(I / I_0) = V_s, where R = 100\,\Omega, V_T = 0.026\,\text{V}, I_0 = 10^{-12}\,\text{A}, V_s = 5\,\text{V}. Starting from I_0^* = 0.04\,\text{A}, apply one step of Newton-Raphson to find a better estimate. (Linearise the equation about the starting point.)