45 Matrices and linear systems

Organising equations so machines — and humans — can solve them

The structural analysis of the Tacoma Narrows Bridge involved linear systems of equations — the same fundamental form as any system of resistors, any truss, any set of chemical mass-balance equations. Thousands of numbers, hundreds of equations, but the same underlying question every time: does this system have a solution, is it unique, and what is it?

For small systems — two or three equations — you can solve by substitution, the way you learned in school. For larger systems, that approach bogs down in bookkeeping errors. The method in this chapter, Gaussian elimination, is systematic substitution: a sequence of operations that any careful person (or machine) can apply without creativity, and that always terminates with a definitive answer.

Before any of that, we need the language: matrices and the algebra of matrix operations. The notation is not decoration — it compresses a system of twenty equations into a single line, and makes the structure of the solution visible.

45.1 What this chapter helps you do

Symbols to keep handy

These are the bits of notation you'll see a lot. If a line of symbols feels like a fence, read it out loud once, then keep going.

(): rank of A
= : A x equals b
[|]: augmented matrix
_n: identity matrix of order n
^T: A transpose

Definitions to keep handy

These are the words we keep coming back to. If one feels slippery, come back here and steady it before you push on.

matrix: A rectangular array of numbers used to store and act on structured information.
linear system: A set of linear equations that must all hold at the same time.
augmented matrix: A compact way to write the coefficients and right-hand side of a system together: [A|b].
Gaussian elimination: A systematic row-reduction procedure that solves the system (or shows no solution).
rank: A count of independent information in a matrix; it predicts whether solutions are unique, none, or infinite.

This chapter turns “a pile of equations” into a single object you can reason about and solve systematically. You will learn to:

read and manipulate matrices as structured data, not as a wall of numbers
rewrite a linear system as \mathbf{A}\mathbf{x}=\mathbf{b}
solve systems reliably using Gaussian elimination (row reduction)
use rank to predict whether a solution is unique, nonexistent, or infinite
reuse work with LU factorisation when the same matrix is solved repeatedly

Watch for this

Matrix notation hides bookkeeping, not meaning. Always keep track of what the entries represent (currents, forces, constraints).
Row operations change how the system is written, not what it means: they preserve the solution set.
Rank is a structural count of independent information. It tells you “how many constraints really exist” before you do much arithmetic.

45.2 Matrices and operations

A matrix is a rectangular array of numbers. The matrix \mathbf{A} with m rows and n columns is called an m \times n matrix:

How to read the core symbols

Symbol: \mathbf{A}, a_{ij}
Reads as: “bold A”, “a sub i j”
Means: a matrix and its entries (row i, column j)
Use when: you need to store or apply a structured collection of coefficients
Common misread: the first index is the row, the second is the column
Symbol: \mathbf{A}^T
Reads as: “A transpose”
Means: swap rows and columns
Use when: dot products, symmetry, least squares, and geometric interpretations
Common misread: transpose is not an inverse; it does not undo multiplication
Symbol: \mathbf{I}_n
Reads as: “identity matrix of order n”
Means: the matrix version of 1 (it leaves vectors unchanged)
Use when: describing inverses, solving, and factorisations
Common misread: \mathbf{I} depends on size; \mathbf{I}_3 and \mathbf{I}_4 are different objects

\mathbf{A} = \begin{pmatrix} a_{11} & a_{12} & \cdots & a_{1n} \\ a_{21} & a_{22} & \cdots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{m1} & a_{m2} & \cdots & a_{mn} \end{pmatrix}

The element in row i and column j is a_{ij}. The first subscript is always the row, the second is always the column. A matrix with one column (n = 1) is a column vector; one with one row (m = 1) is a row vector.

45.2.1 Special types

Square matrix. m = n. The elements a_{11}, a_{22}, \ldots, a_{nn} form the main diagonal.

Symmetric matrix. a_{ij} = a_{ji} for all i, j. Equivalently, \mathbf{A} = \mathbf{A}^T. Symmetric matrices arise naturally: the stiffness matrix of a structure is symmetric, as is any correlation matrix in statistics.

Diagonal matrix. All off-diagonal entries are zero: a_{ij} = 0 whenever i \neq j. Multiplication by a diagonal matrix scales each component independently.

Identity matrix \mathbf{I}_n. Diagonal with all diagonal entries equal to 1. It plays the role of 1 in matrix arithmetic: \mathbf{A}\mathbf{I} = \mathbf{I}\mathbf{A} = \mathbf{A}.

Triangular matrices. A matrix is upper triangular if a_{ij} = 0 whenever i > j — all entries below the main diagonal are zero. It is lower triangular if a_{ij} = 0 whenever i < j. Gaussian elimination produces an upper triangular matrix; this is why triangular forms matter.

Transpose. The transpose \mathbf{A}^T swaps rows and columns: (\mathbf{A}^T)_{ij} = a_{ji}. A 3 \times 2 matrix transposes to a 2 \times 3 matrix. Key identities: (\mathbf{A}^T)^T = \mathbf{A}; (\mathbf{A}\mathbf{B})^T = \mathbf{B}^T \mathbf{A}^T (order reverses).

45.2.2 Matrix addition and scalar multiplication

Two matrices of the same size can be added element by element:

(\mathbf{A} + \mathbf{B})_{ij} = a_{ij} + b_{ij}

Scalar multiplication scales every entry: (\lambda \mathbf{A})_{ij} = \lambda a_{ij}.

These operations satisfy the expected algebraic properties — commutativity, associativity, distributivity. Nothing surprising here.

45.2.3 Matrix multiplication

Multiplying two matrices is less obvious. The product \mathbf{C} = \mathbf{A}\mathbf{B} requires that the number of columns of \mathbf{A} equals the number of rows of \mathbf{B}. If \mathbf{A} is m \times n and \mathbf{B} is n \times p, then \mathbf{C} is m \times p.

The (i, j) entry of \mathbf{C} is the dot product of row i of \mathbf{A} with column j of \mathbf{B}:

c_{ij} = \sum_{k=1}^{n} a_{ik}\,b_{kj}

Worked example. Let

\mathbf{A} = \begin{pmatrix} 2 & 1 \\ 0 & 3 \end{pmatrix}, \qquad \mathbf{B} = \begin{pmatrix} 1 & 4 \\ 2 & 1 \end{pmatrix}

Then c_{11} = 2 \cdot 1 + 1 \cdot 2 = 4; c_{12} = 2 \cdot 4 + 1 \cdot 1 = 9; c_{21} = 0 \cdot 1 + 3 \cdot 2 = 6; c_{22} = 0 \cdot 4 + 3 \cdot 1 = 3.

\mathbf{A}\mathbf{B} = \begin{pmatrix} 4 & 9 \\ 6 & 3 \end{pmatrix}

Important: matrix multiplication is not commutative. \mathbf{A}\mathbf{B} \neq \mathbf{B}\mathbf{A} in general — and sometimes \mathbf{B}\mathbf{A} is not even defined when \mathbf{A}\mathbf{B} is. This is a genuine departure from ordinary arithmetic, and it matters whenever you are rearranging matrix equations.

What multiplication means geometrically. Each column of \mathbf{A}\mathbf{B} is a linear combination of the columns of \mathbf{A}, with coefficients from the corresponding column of \mathbf{B}. This column-space interpretation is the key to understanding why the structure of \mathbf{A} determines the solvability of \mathbf{A}\mathbf{x} = \mathbf{b}.

The explorer below turns the columns of a matrix into visible motions of the plane.

Code

{
  const LIMIT = 3.2;
  let A = { a11: 2, a12: 0.5, a21: 0.3, a22: 1 };

  function svgEl(tag) {
    return document.createElementNS("http://www.w3.org/2000/svg", tag);
  }

  const PRESETS = {
    "Horizontal stretch": [2, 0.5, 0.3, 1],
    "Identity": [1, 0, 0, 1],
    "Rotation 45°": [0.7, -0.7, 0.7, 0.7],
    "Shear": [1, 1.2, 0, 1],
    "Reflection y-axis": [-1, 0, 0, 1],
    "Singular": [1, 2, 0.5, 1]
  };

  function matVec(m, v) {
    return [m.a11 * v[0] + m.a12 * v[1], m.a21 * v[0] + m.a22 * v[1]];
  }

  function det(m) {
    return m.a11 * m.a22 - m.a12 * m.a21;
  }

  function makeMapper() {
    const left = 32, top = 18, size = 280;
    return (x, y) => [left + size * (x + LIMIT) / (2 * LIMIT), top + size * (LIMIT - y) / (2 * LIMIT)];
  }

  const rays = [[1,0],[0,1],[1,1],[1,-1],[-1,1]];
  const square = [[0,0],[1,0],[1,1],[0,1]];

  const container = document.createElement("div");
  container.style.cssText = "max-width: 980px; margin: 1rem 0 1.25rem 0; font-family: inherit;";

  const controls = document.createElement("div");
  controls.style.cssText = "display:grid; grid-template-columns:auto 1fr auto; gap:0.45rem 0.7rem; align-items:center; margin-bottom:0.75rem;";
  function sliderRow(labelText, key) {
    const label = document.createElement("label");
    label.textContent = labelText;
    const input = document.createElement("input");
    input.type = "range";
    input.min = "-3";
    input.max = "3";
    input.step = "0.1";
    input.value = String(A[key]);
    const out = document.createElement("span");
    out.style.cssText = "font-variant-numeric: tabular-nums; min-width:3.3rem;";
    controls.append(label, input, out);
    return { input, out, key };
  }
  const rows = [
    sliderRow("a11", "a11"),
    sliderRow("a12", "a12"),
    sliderRow("a21", "a21"),
    sliderRow("a22", "a22")
  ];

  const presetRow = document.createElement("div");
  presetRow.style.cssText = "display:flex; gap:0.6rem; align-items:center; margin-bottom:0.65rem;";
  const presetLabel = document.createElement("span");
  presetLabel.textContent = "Preset";
  const preset = document.createElement("select");
  preset.style.cssText = "padding:0.35rem 0.5rem; border:1px solid #d1d5db; border-radius:8px; background:#fff; font:inherit;";
  Object.keys(PRESETS).forEach(name => {
    const opt = document.createElement("option");
    opt.value = name;
    opt.textContent = name;
    preset.appendChild(opt);
  });
  preset.value = "Horizontal stretch";
  presetRow.append(presetLabel, preset);

  const readout = document.createElement("div");
  readout.style.cssText = "display:flex; gap:0.75rem; flex-wrap:wrap; margin-bottom:0.65rem;";
  function card(label) {
    const box = document.createElement("div");
    box.style.cssText = "background:#f8fafc; border:1px solid #e5e7eb; border-radius:8px; padding:0.5rem 0.65rem;";
    const k = document.createElement("div");
    k.style.cssText = "font-size:0.76rem; color:#6b7280;";
    k.textContent = label;
    const v = document.createElement("div");
    v.style.cssText = "font-size:0.96rem; font-variant-numeric: tabular-nums;";
    box.append(k, v);
    readout.appendChild(box);
    return v;
  }
  const detBox = card("det(A)");
  const col1Box = card("col 1");
  const col2Box = card("col 2");

  const panels = document.createElement("div");
  panels.style.cssText = "display:grid; grid-template-columns:repeat(2,minmax(0,1fr)); gap:0.75rem;";

  function makePanel(title) {
    const panel = document.createElement("div");
    panel.style.cssText = "background:#fff; border:1px solid #e5e7eb; border-radius:10px; padding:0.6rem;";
    const h = document.createElement("div");
    h.style.cssText = "font-size:0.8rem; color:#6b7280; text-transform:uppercase; letter-spacing:0.03em; margin-bottom:0.4rem;";
    h.textContent = title;
    const svg = svgEl("svg");
    svg.setAttribute("viewBox", "0 0 420 330");
    svg.setAttribute("width", "100%");
    svg.style.display = "block";
    panel.append(h, svg);
    panels.appendChild(panel);
    return svg;
  }

  const leftSvg = makePanel("Input plane");
  const rightSvg = makePanel("Transformed plane");
  const caption = document.createElement("p");
  caption.style.cssText = "font-size:0.9rem; color:#4b5563; margin:0.55rem 0 0 0;";

  function drawAxes(svg, mapper) {
    for (let i = -3; i <= 3; i++) {
      const gx = svgEl("line");
      gx.setAttribute("x1", mapper(i, -3)[0]); gx.setAttribute("y1", mapper(i, -3)[1]);
      gx.setAttribute("x2", mapper(i, 3)[0]); gx.setAttribute("y2", mapper(i, 3)[1]);
      gx.setAttribute("stroke", "#f3f4f6");
      svg.appendChild(gx);
      const gy = svgEl("line");
      gy.setAttribute("x1", mapper(-3, i)[0]); gy.setAttribute("y1", mapper(-3, i)[1]);
      gy.setAttribute("x2", mapper(3, i)[0]); gy.setAttribute("y2", mapper(3, i)[1]);
      gy.setAttribute("stroke", "#f3f4f6");
      svg.appendChild(gy);
    }
    const xAxis = svgEl("line");
    xAxis.setAttribute("x1", mapper(-3, 0)[0]); xAxis.setAttribute("y1", mapper(-3, 0)[1]);
    xAxis.setAttribute("x2", mapper(3, 0)[0]); xAxis.setAttribute("y2", mapper(3, 0)[1]);
    xAxis.setAttribute("stroke", "#6b7280");
    xAxis.setAttribute("stroke-width", "1.5");
    svg.appendChild(xAxis);
    const yAxis = svgEl("line");
    yAxis.setAttribute("x1", mapper(0, -3)[0]); yAxis.setAttribute("y1", mapper(0, -3)[1]);
    yAxis.setAttribute("x2", mapper(0, 3)[0]); yAxis.setAttribute("y2", mapper(0, 3)[1]);
    yAxis.setAttribute("stroke", "#6b7280");
    yAxis.setAttribute("stroke-width", "1.5");
    svg.appendChild(yAxis);
  }

  function drawArrow(svg, mapper, from, to, color, width = 2.6, labelText = "") {
    const [x1, y1] = mapper(from[0], from[1]);
    const [x2, y2] = mapper(to[0], to[1]);
    const line = svgEl("line");
    line.setAttribute("x1", x1); line.setAttribute("y1", y1);
    line.setAttribute("x2", x2); line.setAttribute("y2", y2);
    line.setAttribute("stroke", color);
    line.setAttribute("stroke-width", width);
    line.setAttribute("stroke-linecap", "round");
    svg.appendChild(line);
    const dx = x2 - x1, dy = y2 - y1;
    const len = Math.hypot(dx, dy) || 1;
    const ux = dx / len, uy = dy / len;
    const hx = x2 - 10 * ux, hy = y2 - 10 * uy;
    const t1 = svgEl("line");
    t1.setAttribute("x1", x2); t1.setAttribute("y1", y2);
    t1.setAttribute("x2", hx - 5 * uy); t1.setAttribute("y2", hy + 5 * ux);
    t1.setAttribute("stroke", color); t1.setAttribute("stroke-width", width);
    svg.appendChild(t1);
    const t2 = svgEl("line");
    t2.setAttribute("x1", x2); t2.setAttribute("y1", y2);
    t2.setAttribute("x2", hx + 5 * uy); t2.setAttribute("y2", hy - 5 * ux);
    t2.setAttribute("stroke", color); t2.setAttribute("stroke-width", width);
    svg.appendChild(t2);
    if (labelText) {
      const text = svgEl("text");
      text.setAttribute("x", x2 + 8);
      text.setAttribute("y", y2 - 6);
      text.setAttribute("font-size", "12");
      text.setAttribute("fill", color);
      text.textContent = labelText;
      svg.appendChild(text);
    }
  }

  function drawPolygon(svg, mapper, pts, fill, stroke) {
    const poly = svgEl("polygon");
    poly.setAttribute("points", pts.map(([x, y]) => mapper(x, y).join(",")).join(" "));
    poly.setAttribute("fill", fill);
    poly.setAttribute("stroke", stroke);
    poly.setAttribute("stroke-width", "2");
    svg.appendChild(poly);
  }

  function redraw() {
    rows.forEach(r => {
      A[r.key] = parseFloat(r.input.value);
      r.out.textContent = A[r.key].toFixed(1);
    });
    while (leftSvg.firstChild) leftSvg.removeChild(leftSvg.firstChild);
    while (rightSvg.firstChild) rightSvg.removeChild(rightSvg.firstChild);
    const map = makeMapper();

    drawAxes(leftSvg, map);
    drawAxes(rightSvg, map);
    drawPolygon(leftSvg, map, square, "rgba(20,184,166,0.2)", "#0f766e");
    drawPolygon(rightSvg, map, square.map(v => matVec(A, v)), "rgba(124,58,237,0.18)", "#6d28d9");

    rays.forEach(v => {
      drawArrow(leftSvg, map, [0, 0], v, "#94a3b8", 1.8);
      drawArrow(rightSvg, map, [0, 0], matVec(A, v), "#94a3b8", 1.8);
    });

    drawArrow(leftSvg, map, [0, 0], [1, 0], "#f97316", 2.8, "e₁");
    drawArrow(leftSvg, map, [0, 0], [0, 1], "#3b82f6", 2.8, "e₂");
    drawArrow(rightSvg, map, [0, 0], [A.a11, A.a21], "#f97316", 3, "col 1");
    drawArrow(rightSvg, map, [0, 0], [A.a12, A.a22], "#3b82f6", 3, "col 2");

    square.forEach(v => {
      const dot1 = svgEl("circle");
      const [x1, y1] = map(v[0], v[1]);
      dot1.setAttribute("cx", x1); dot1.setAttribute("cy", y1); dot1.setAttribute("r", "3.5");
      dot1.setAttribute("fill", "#0f766e");
      leftSvg.appendChild(dot1);
      const dot2 = svgEl("circle");
      const tv = matVec(A, v);
      const [x2, y2] = map(tv[0], tv[1]);
      dot2.setAttribute("cx", x2); dot2.setAttribute("cy", y2); dot2.setAttribute("r", "3.5");
      dot2.setAttribute("fill", "#6d28d9");
      rightSvg.appendChild(dot2);
    });

    const d = det(A);
    detBox.textContent = d.toFixed(2);
    detBox.style.color = d > 0.05 ? "#166534" : "#b91c1c";
    col1Box.textContent = `(${A.a11.toFixed(1)}, ${A.a21.toFixed(1)})`;
    col2Box.textContent = `(${A.a12.toFixed(1)}, ${A.a22.toFixed(1)})`;

    if (Math.abs(d) < 0.15) {
      const warn = svgEl("text");
      warn.setAttribute("x", "210");
      warn.setAttribute("y", "308");
      warn.setAttribute("text-anchor", "middle");
      warn.setAttribute("font-size", "14");
      warn.setAttribute("font-weight", "700");
      warn.setAttribute("fill", "#b91c1c");
      warn.textContent = "singular — all vectors collapse to a line";
      rightSvg.appendChild(warn);
    }

    caption.textContent = d > 0
      ? "A positive determinant preserves orientation. The transformed unit square becomes a parallelogram whose signed area equals det(A)."
      : d < 0
        ? "A negative determinant flips orientation. The orange and blue column vectors still tell you exactly where e₁ and e₂ land."
        : "When det(A) is zero, area collapses and the transformation is singular.";
  }

  preset.addEventListener("change", () => {
    const vals = PRESETS[preset.value];
    rows.forEach((r, i) => { r.input.value = String(vals[i]); });
    redraw();
  });
  rows.forEach(r => r.input.addEventListener("input", redraw));

  container.append(controls, presetRow, readout, panels, caption);
  redraw();
  return container;
}

45.3 Linear systems: Ax = b

A system of m linear equations in n unknowns has the form

How to read \mathbf{A}\mathbf{x}=\mathbf{b}

Symbol: \mathbf{A}\mathbf{x}=\mathbf{b}
Reads as: “A x equals b”
Means: a compact way to write many linear equations at once
Use when: coefficients and unknowns are easier to manage as structured objects (especially for large systems)
Common misread: \mathbf{A}\mathbf{x} is matrix multiplication (a weighted sum of columns), not componentwise multiplication
Symbol: [\mathbf{A}\,|\,\mathbf{b}]
Reads as: “A augmented with b”
Means: the coefficient matrix with the right-hand side appended as an extra column
Use when: doing row operations (Gaussian elimination) without rewriting equations
Common misread: row operations act on the whole augmented matrix, including the \mathbf{b} column

\begin{aligned} a_{11}x_1 + a_{12}x_2 + \cdots + a_{1n}x_n &= b_1 \\ a_{21}x_1 + a_{22}x_2 + \cdots + a_{2n}x_n &= b_2 \\ &\vdots \\ a_{m1}x_1 + a_{m2}x_2 + \cdots + a_{mn}x_n &= b_m \end{aligned}

In matrix notation this is \mathbf{A}\mathbf{x} = \mathbf{b}, where \mathbf{A} is the m \times n coefficient matrix, \mathbf{x} is the n \times 1 vector of unknowns, and \mathbf{b} is the m \times 1 right-hand side vector.

The compactness of this notation is not cosmetic. Once you write a system as \mathbf{A}\mathbf{x} = \mathbf{b}, all the structure of the solution is encoded in \mathbf{A} — its rank, its column space, its null space. You can reason about solvability without touching the specific values in \mathbf{b}.

45.3.1 The augmented matrix

The working object for solving a linear system is the augmented matrix [\mathbf{A}\,|\,\mathbf{b}]: the coefficient matrix with \mathbf{b} appended as an extra column, separated by a vertical bar.

[\mathbf{A}\,|\,\mathbf{b}] = \left(\begin{array}{ccc|c} a_{11} & \cdots & a_{1n} & b_1 \\ \vdots & \ddots & \vdots & \vdots \\ a_{m1} & \cdots & a_{mn} & b_m \end{array}\right)

Working on the augmented matrix instead of the equations directly keeps things compact and avoids rewriting variable names at every step.

45.3.2 What a solution means geometrically

Each equation a_{i1}x_1 + a_{i2}x_2 + \cdots + a_{in}x_n = b_i is a hyperplane in \mathbb{R}^n. Solving the system means finding the intersection of all m hyperplanes.

For n = 2 (two unknowns): each equation is a line in the plane. Two lines either intersect at one point (unique solution), are parallel (no solution), or are the same line (infinitely many solutions).

For n = 3: each equation is a plane in three-dimensional space. Three planes can intersect at one point, at a line, at a plane, or not at all. Gaussian elimination finds which case applies and produces the solution in each case.

45.4 Gaussian elimination

Row operations. The three operations that can be applied to rows of an augmented matrix without changing the solution set are:

Swap two rows: R_i \leftrightarrow R_j
Scale a row by a nonzero constant: R_i \leftarrow \lambda R_i (\lambda \neq 0)
Add a multiple of one row to another: R_i \leftarrow R_i + \mu R_j

These are called elementary row operations. They are reversible, so they transform the system into an equivalent one — same solution set, different representation.

The algorithm. Gaussian elimination applies row operations to reduce [\mathbf{A}\,|\,\mathbf{b}] to upper triangular (also called row echelon) form, where all entries below the main diagonal are zero. Back-substitution then extracts the unknowns from bottom to top.

45.4.1 Worked example: 3×3 system

Solve:

\begin{aligned} 2x_1 + 4x_2 - 2x_3 &= 2 \\ x_1 + 3x_2 + x_3 &= 10 \\ -x_1 + 2x_2 + 3x_3 &= 8 \end{aligned}

Step 1. Write the augmented matrix:

\left(\begin{array}{rrr|r} 2 & 4 & -2 & 2 \\ 1 & 3 & 1 & 10 \\ -1 & 2 & 3 & 8 \end{array}\right)

Step 2. Use row 1 as the pivot row to eliminate x_1 from rows 2 and 3.

R_2 \leftarrow R_2 - \tfrac{1}{2}R_1:

\left(\begin{array}{rrr|r} 2 & 4 & -2 & 2 \\ 0 & 1 & 2 & 9 \\ -1 & 2 & 3 & 8 \end{array}\right)

R_3 \leftarrow R_3 + \tfrac{1}{2}R_1:

\left(\begin{array}{rrr|r} 2 & 4 & -2 & 2 \\ 0 & 1 & 2 & 9 \\ 0 & 4 & 2 & 9 \end{array}\right)

Step 3. Use row 2 as the pivot row to eliminate x_2 from row 3.

R_3 \leftarrow R_3 - 4R_2:

\left(\begin{array}{rrr|r} 2 & 4 & -2 & 2 \\ 0 & 1 & 2 & 9 \\ 0 & 0 & -6 & -27 \end{array}\right)

This is upper triangular form. The system now reads:

\begin{aligned} 2x_1 + 4x_2 - 2x_3 &= 2 \\ x_2 + 2x_3 &= 9 \\ -6x_3 &= -27 \end{aligned}

Step 4. Back-substitution.

From row 3: x_3 = \tfrac{-27}{-6} = \tfrac{9}{2}.

From row 2: x_2 = 9 - 2 \cdot \tfrac{9}{2} = 9 - 9 = 0.

From row 1: 2x_1 = 2 - 4(0) + 2(\tfrac{9}{2}) = 2 + 9 = 11, so x_1 = \tfrac{11}{2}.

Solution: x_1 = \tfrac{11}{2}, x_2 = 0, x_3 = \tfrac{9}{2}.

45.4.2 The pivot

At each elimination step, the diagonal element used to eliminate the column below it is called the pivot. The pivot must be nonzero. If it is zero, swap the current row with any row below it that has a nonzero entry in that column (partial pivoting). If no such row exists, the matrix is singular — the system either has no solution or infinitely many.

In practice, numerical software always uses partial pivoting even when the pivot is technically nonzero, to avoid large round-off errors from dividing by very small numbers.

Step through the elimination — the amber cell is the active pivot.

Code

{
  const states = [
    {
      label: "Initial matrix — pivot x₁ = 2 (amber) will zero the column below it.",
      op: null,
      M: [[2,4,-2,2],[1,3,1,10],[-1,2,3,8]],
      pr: 0, pc: 0
    },
    {
      label: "x₁ cleared from row 2. Pivot still active — row 3 still needs clearing.",
      op: "R₂  ←  R₂ − ½R₁    (m₂₁ = ½)",
      M: [[2,4,-2,2],[0,1,2,9],[-1,2,3,8]],
      pr: 0, pc: 0
    },
    {
      label: "Column 1 fully cleared. Pivot shifts to x₂ = 1 in row 2.",
      op: "R₃  ←  R₃ + ½R₁    (m₃₁ = −½)",
      M: [[2,4,-2,2],[0,1,2,9],[0,4,2,9]],
      pr: 1, pc: 1
    },
    {
      label: "Upper triangular form reached. Back-substitution proceeds from the bottom.",
      op: "R₃  ←  R₃ − 4R₂    (m₃₂ = 4)",
      M: [[2,4,-2,2],[0,1,2,9],[0,0,-6,-27]],
      pr: 2, pc: 2
    }
  ];

  let step = 0;

  const wrap = document.createElement("div");
  wrap.style.cssText = "border:1px solid #e5e7eb;border-radius:8px;padding:1rem 1.25rem;margin:0.5rem 0;max-width:480px;font-family:inherit;";

  const opEl = document.createElement("div");
  opEl.style.cssText = "font-size:0.9em;font-family:monospace;color:#374151;min-height:1.5em;margin-bottom:0.6rem;white-space:nowrap;";

  const matWrap = document.createElement("div");
  matWrap.style.cssText = "margin-bottom:0.5rem;";

  const lblEl = document.createElement("div");
  lblEl.style.cssText = "font-size:0.82em;color:#6b7280;min-height:1.4em;margin-bottom:0.75rem;";

  function renderMatrix(M, pr, pc) {
    const nc = M[0].length;
    let html = '<div style="display:inline-flex;align-items:stretch;">';
    html += '<div style="border-left:2px solid #374151;border-top:2px solid #374151;border-bottom:2px solid #374151;width:5px;border-radius:3px 0 0 3px;margin-right:2px;"></div>';
    html += '<table style="border-collapse:collapse;font-size:1em;line-height:1.8;">';
    for (let r = 0; r < M.length; r++) {
      html += "<tr>";
      for (let c = 0; c < nc; c++) {
        const v = M[r][c];
        const disp = v < 0 ? "−" + Math.abs(v) : String(v);
        const augBorder = c === nc - 1 ? "border-left:2px solid #9ca3af;" : "";
        const pivBg = r === pr && c === pc ? "background:#fef3c7;" : "";
        const pivFw = r === pr && c === pc ? "font-weight:700;color:#92400e;" : "";
        html += `<td style="padding:0.15rem 0.65rem;text-align:right;min-width:1.8em;${augBorder}${pivBg}${pivFw}">${disp}</td>`;
      }
      html += "</tr>";
    }
    html += "</table>";
    html += '<div style="border-right:2px solid #374151;border-top:2px solid #374151;border-bottom:2px solid #374151;width:5px;border-radius:0 3px 3px 0;margin-left:2px;"></div>';
    html += '</div>';
    return html;
  }

  function draw() {
    const s = states[step];
    opEl.textContent = s.op || "";
    matWrap.innerHTML = renderMatrix(s.M, s.pr, s.pc);
    lblEl.textContent = s.label;
    nextBtn.disabled = step >= states.length - 1;
    nextBtn.style.opacity = step >= states.length - 1 ? "0.4" : "1";
    ctrEl.textContent = step === 0 ? "Click to begin" : `Step ${step} of ${states.length - 1}`;
    doneEl.style.display = step === states.length - 1 ? "block" : "none";
  }

  const ctrlRow = document.createElement("div");
  ctrlRow.style.cssText = "display:flex;gap:0.5rem;align-items:center;";

  const nextBtn = document.createElement("button");
  nextBtn.textContent = "Next operation →";
  nextBtn.style.cssText = "padding:0.35rem 0.85rem;border:1px solid #d1d5db;border-radius:4px;background:#fff;cursor:pointer;font-size:0.9em;";

  const rstBtn = document.createElement("button");
  rstBtn.textContent = "Reset";
  rstBtn.style.cssText = "padding:0.35rem 0.75rem;border:1px solid #e5e7eb;border-radius:4px;background:#f9fafb;cursor:pointer;font-size:0.9em;color:#6b7280;";

  const ctrEl = document.createElement("span");
  ctrEl.style.cssText = "font-size:0.82em;color:#9ca3af;margin-left:0.25rem;";

  const doneEl = document.createElement("div");
  doneEl.style.cssText = "margin-top:0.5rem;font-size:0.9em;color:#059669;font-weight:500;display:none;";
  doneEl.textContent = "Upper triangular form reached. Back-substitution proceeds from row 3 upward.";

  nextBtn.onclick = () => { if (step < states.length - 1) { step++; draw(); } };
  rstBtn.onclick = () => { step = 0; draw(); };

  ctrlRow.append(nextBtn, rstBtn, ctrEl);
  wrap.append(opEl, matWrap, lblEl, ctrlRow, doneEl);
  draw();
  return wrap;
}

45.4.3 Gauss-Jordan elimination and RREF

Gauss-Jordan elimination continues beyond upper triangular form: it also eliminates entries above each pivot, reducing the matrix to reduced row echelon form (RREF). In RREF every pivot is 1, and every other entry in the pivot column is 0. The unknowns can be read off directly — no back-substitution needed.

For large systems Gaussian elimination with back-substitution is more efficient than full Gauss-Jordan. For small systems, and especially for finding matrix inverses, RREF is convenient.

Continuing the worked example. Starting from the upper triangular form and scaling:

R_3 \leftarrow -\tfrac{1}{6}R_3, R_2 \leftarrow R_2, R_1 \leftarrow \tfrac{1}{2}R_1:

\left(\begin{array}{rrr|r} 1 & 2 & -1 & 1 \\ 0 & 1 & 2 & 9 \\ 0 & 0 & 1 & 9/2 \end{array}\right)

R_2 \leftarrow R_2 - 2R_3:

\left(\begin{array}{rrr|r} 1 & 2 & -1 & 1 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 9/2 \end{array}\right)

R_1 \leftarrow R_1 + R_3, then R_1 \leftarrow R_1 - 2R_2:

\left(\begin{array}{rrr|r} 1 & 0 & 0 & 11/2 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 9/2 \end{array}\right)

The solution x_1 = \tfrac{11}{2}, x_2 = 0, x_3 = \tfrac{9}{2} is visible directly in the right-hand column. Same answer, different path.

45.5 Solution types and rank

Not every system \mathbf{A}\mathbf{x} = \mathbf{b} has a unique solution. Three outcomes are possible, and the row-reduced form reveals which one applies.

How to read rank

Symbol: \mathrm{rank}(\mathbf{A})
Reads as: “rank of A”
Means: the number of independent rows/columns (the number of pivots after row reduction)
Use when: predicting whether constraints are independent and whether a solution is determined
Common misread: rank is not “how big the matrix is”; it is how much independent information it contains

45.5.1 The three cases

Case 1: unique solution. Every column of \mathbf{A} has a pivot. After row reduction there is one leading 1 per row and per column. The solution is determined completely.

Case 2: no solution (inconsistent). After row reduction, at least one row has the form [0 \; 0 \; \cdots \; 0 \;|\; c] with c \neq 0. This represents the equation 0 = c, which is impossible. Geometrically: the hyperplanes do not share a common point.

Case 3: infinitely many solutions (underdetermined). After row reduction, at least one column has no pivot — the corresponding variable is free, meaning it can take any value. Each free variable introduces one dimension of solutions. Geometrically: the hyperplanes intersect along a line, a plane, or a higher-dimensional subspace.

Move the sliders to see how the geometry changes. Try slope −0.50 for the parallel and coincident cases.

Code

{
  // Line 1 (fixed): x + 2y = 4  →  y = −0.5x + 2
  const m1 = -0.5, b1 = 2;
  const m2 = geomSlope, b2 = geomIntercept;

  const EPS_S = 0.03;   // slope tolerance (slider step is 0.05)
  const EPS_B = 0.06;   // intercept tolerance (slider step is 0.1)
  const sameSlope  = Math.abs(m2 - m1) < EPS_S;
  const coincident = sameSlope && Math.abs(b2 - b1) < EPS_B;
  const parallel   = sameSlope && !coincident;

  let caseLabel, caseColor;
  if (coincident) {
    caseLabel = "Infinitely many solutions — same line";
    caseColor = "#059669";
  } else if (parallel) {
    caseLabel = "No solution — parallel lines (0 = c, impossible)";
    caseColor = "#dc2626";
  } else {
    const xi = (b2 - b1) / (m1 - m2);
    const yi = m1 * xi + b1;
    caseLabel = `Unique solution — intersection at (${xi.toFixed(2)}, ${yi.toFixed(2)})`;
    caseColor = "#2563eb";
  }

  const W = 340, H = 280, PAD = 44;
  const xMin = -5, xMax = 5, yMin = -3, yMax = 6;

  const svg = document.createElementNS("http://www.w3.org/2000/svg", "svg");
  svg.setAttribute("width", W);
  svg.setAttribute("height", H);
  svg.setAttribute("viewBox", `0 0 ${W} ${H}`);
  svg.style.cssText = "display:block;font-family:inherit;border:1px solid #e5e7eb;border-radius:6px;";

  const ns = "http://www.w3.org/2000/svg";
  function mk(tag, attrs) {
    const e = document.createElementNS(ns, tag);
    if (attrs) for (const [k, v] of Object.entries(attrs)) e.setAttribute(k, v);
    return e;
  }
  function mkT(tag, attrs, text) { const e = mk(tag, attrs); e.textContent = text; return e; }

  function spx(x) { return PAD + (x - xMin) / (xMax - xMin) * (W - 2 * PAD); }
  function spy(y) { return H - PAD - (y - yMin) / (yMax - yMin) * (H - 2 * PAD); }

  const clipId = "geom-clip";
  const defs = mk("defs");
  const cp = mk("clipPath", { id: clipId });
  cp.appendChild(mk("rect", { x: PAD, y: PAD, width: W - 2*PAD, height: H - 2*PAD }));
  defs.appendChild(cp);
  svg.appendChild(defs);

  svg.appendChild(mk("rect", { x: 0, y: 0, width: W, height: H, fill: "#f9fafb", rx: 6 }));
  svg.appendChild(mk("rect", { x: PAD, y: PAD, width: W-2*PAD, height: H-2*PAD, fill: "#fff", stroke: "#e5e7eb", "stroke-width": 1 }));

  // Grid
  const gridG = mk("g", { "clip-path": `url(#${clipId})` });
  for (let x = Math.ceil(xMin); x <= Math.floor(xMax); x++)
    gridG.appendChild(mk("line", { x1: spx(x), y1: PAD, x2: spx(x), y2: H-PAD, stroke: "#f3f4f6", "stroke-width": 1 }));
  for (let y = Math.ceil(yMin); y <= Math.floor(yMax); y++)
    gridG.appendChild(mk("line", { x1: PAD, y1: spy(y), x2: W-PAD, y2: spy(y), stroke: "#f3f4f6", "stroke-width": 1 }));
  svg.appendChild(gridG);

  // Axes
  const clip = { "clip-path": `url(#${clipId})` };
  svg.appendChild(mk("line", { x1: PAD, y1: spy(0), x2: W-PAD, y2: spy(0), stroke: "#d1d5db", "stroke-width": 1.5, ...clip }));
  svg.appendChild(mk("line", { x1: spx(0), y1: PAD, x2: spx(0), y2: H-PAD, stroke: "#d1d5db", "stroke-width": 1.5, ...clip }));
  svg.appendChild(mkT("text", { x: W-PAD+6, y: spy(0)+4, fill: "#9ca3af", "font-size": 12 }, "x"));
  svg.appendChild(mkT("text", { x: spx(0)+5, y: PAD-4, fill: "#9ca3af", "font-size": 12 }, "y"));

  function addLine(m, b, color, dash, width, opacity) {
    svg.appendChild(mk("line", {
      x1: spx(xMin), y1: spy(m*xMin+b),
      x2: spx(xMax), y2: spy(m*xMax+b),
      stroke: color, "stroke-width": width,
      "stroke-dasharray": dash || "",
      opacity: opacity || 1,
      "clip-path": `url(#${clipId})`
    }));
  }

  // Line 1 — fixed, solid blue
  addLine(m1, b1, "#3b82f6", "", 2.5, 1);
  const lx1 = 2.5;
  svg.appendChild(mkT("text", { x: spx(lx1)+4, y: spy(m1*lx1+b1)-5, fill: "#3b82f6", "font-size": 11, "font-weight": "600" }, "x + 2y = 4"));

  // Line 2 — controlled
  if (coincident) {
    addLine(m2, b2, "#059669", "8 4", 5, 0.55);
  } else {
    addLine(m2, b2, "#f59e0b", parallel ? "" : "7 4", 2.5, 1);
    // Label line 2 near its left end
    const lx2 = -3.5, ly2 = m2*lx2+b2;
    if (ly2 >= yMin && ly2 <= yMax)
      svg.appendChild(mkT("text", { x: spx(lx2), y: spy(ly2)-6, fill: "#f59e0b", "font-size": 11, "font-weight": "600" }, "line 2"));
  }

  // Intersection dot
  if (!parallel && !coincident) {
    const xi = (b2 - b1) / (m1 - m2);
    const yi = m1 * xi + b1;
    if (xi >= xMin && xi <= xMax && yi >= yMin && yi <= yMax)
      svg.appendChild(mk("circle", { cx: spx(xi), cy: spy(yi), r: 5, fill: "#2563eb", stroke: "#fff", "stroke-width": 2 }));
  }

  const footer = document.createElement("div");
  footer.style.cssText = `font-size:0.88em;font-weight:600;color:${caseColor};margin-top:0.4rem;`;
  footer.textContent = caseLabel;

  const container = document.createElement("div");
  container.style.marginTop = "0.5rem";
  container.append(svg, footer);
  return container;
}

45.5.2 Rank

The rank of a matrix \mathbf{A}, written \text{rank}(\mathbf{A}), is the number of pivots in its row-reduced form — equivalently, the number of linearly independent rows (or columns).

For an m \times n matrix: \text{rank}(\mathbf{A}) \leq \min(m, n).

Solvability criterion. The system \mathbf{A}\mathbf{x} = \mathbf{b} has at least one solution if and only if \text{rank}(\mathbf{A}) = \text{rank}([\mathbf{A}\,|\,\mathbf{b}]). When this holds:

If \text{rank}(\mathbf{A}) = n (all unknowns are pivot variables), the solution is unique.
If \text{rank}(\mathbf{A}) < n, there are n - \text{rank}(\mathbf{A}) free variables and infinitely many solutions.

45.5.3 Rank-nullity theorem

For an n-column matrix \mathbf{A}:

\text{rank}(\mathbf{A}) + \text{nullity}(\mathbf{A}) = n

The nullity is the number of free variables — the dimension of the null space (the set of all solutions to \mathbf{A}\mathbf{x} = \mathbf{0}). This theorem says that the number of “determined” directions (rank) plus the number of “undetermined” directions (nullity) always adds up to the total number of unknowns. It is stated here for completeness; it becomes central in the chapter on eigenvalues.

45.5.4 What rank tells you before you compute

If you have a 3 \times 3 system and you can see by inspection that one equation is a linear combination of the other two (for example, equation 3 = equation 1 + equation 2), then \text{rank}(\mathbf{A}) \leq 2 < 3 regardless of \mathbf{b}. The system is either inconsistent or underdetermined. Checking rank first can save significant computation.

Use the stepper to watch each elimination multiplier land in L while the same row operation builds U.

Code

{
  const steps = [
    {
      title: "Start",
      note: "Begin with A. No multipliers have been stored yet.",
      L: [["1","0","0"],["0","1","0"],["0","0","1"]],
      U: [["2","4","-2"],["1","3","1"],["-1","2","3"]]
    },
    {
      title: "Column 1 elimination",
      note: "m21 = 1/2 and m31 = -1/2 zero the entries below the first pivot.",
      L: [["1","0","0"],["1/2","1","0"],["-1/2","0","1"]],
      U: [["2","4","-2"],["0","1","2"],["0","4","2"]]
    },
    {
      title: "Column 2 elimination",
      note: "m32 = 4 clears the final subdiagonal entry.",
      L: [["1","0","0"],["1/2","1","0"],["-1/2","4","1"]],
      U: [["2","4","-2"],["0","1","2"],["0","0","-6"]]
    }
  ];

  const container = document.createElement("div");
  container.style.cssText = "max-width:780px; margin:1rem 0 1.25rem 0; font-family:inherit;";
  const actions = document.createElement("div");
  actions.style.cssText = "display:flex; gap:0.6rem; align-items:center; margin-bottom:0.75rem;";
  const prevBtn = document.createElement("button"); prevBtn.textContent = "Previous";
  const nextBtn = document.createElement("button"); nextBtn.textContent = "Next step";
  const stage = document.createElement("strong");
  actions.append(prevBtn, nextBtn, stage);

  const note = document.createElement("p");
  note.style.cssText = "margin:0 0 0.6rem 0; color:#334155;";
  const grid = document.createElement("div");
  grid.style.cssText = "display:grid; grid-template-columns:repeat(3, auto); gap:1.2rem; align-items:start;";
  const lTable = document.createElement("table");
  const uTable = document.createElement("table");
  const checkTable = document.createElement("table");
  [lTable, uTable, checkTable].forEach(t => t.style.cssText = "border-collapse:collapse; font-size:0.95rem;");

  function setTable(table, title, rows, color) {
    table.replaceChildren();
    const cap = document.createElement("caption");
    cap.textContent = title;
    cap.style.cssText = `caption-side:top; text-align:left; font-weight:700; margin-bottom:0.25rem; color:${color};`;
    table.appendChild(cap);
    rows.forEach(r => {
      const tr = document.createElement("tr");
      r.forEach(v => {
        const td = document.createElement("td");
        td.textContent = v;
        td.style.cssText = "border:1px solid #cbd5e1; padding:0.45rem 0.65rem; text-align:center;";
        tr.appendChild(td);
      });
      table.appendChild(tr);
    });
  }

  let idx = 0;
  function redraw() {
    const step = steps[idx];
    stage.textContent = step.title;
    note.textContent = step.note;
    setTable(lTable, "L", step.L, "#2563eb");
    setTable(uTable, "U", step.U, "#f97316");
    if (idx === steps.length - 1) {
      setTable(checkTable, "L·U matches A", [["2","4","-2"],["1","3","1"],["-1","2","3"]], "#16a34a");
    } else {
      setTable(checkTable, "Target A", [["2","4","-2"],["1","3","1"],["-1","2","3"]], "#0f172a");
    }
  }

  prevBtn.addEventListener("click", () => { idx = Math.max(0, idx - 1); redraw(); });
  nextBtn.addEventListener("click", () => { idx = Math.min(steps.length - 1, idx + 1); redraw(); });

  grid.append(lTable, uTable, checkTable);
  container.append(actions, note, grid);
  redraw();
  return container;
}

45.6 LU factorisation

Gaussian elimination solves one system \mathbf{A}\mathbf{x} = \mathbf{b} at a time. In engineering, the same matrix \mathbf{A} often appears with many different right-hand sides \mathbf{b}_1, \mathbf{b}_2, \ldots, \mathbf{b}_k — for example, a structural stiffness matrix under different load patterns. Re-running Gaussian elimination from scratch each time wastes the work already done.

LU factorisation stores the elimination steps. It factors \mathbf{A} into:

\mathbf{A} = \mathbf{L}\mathbf{U}

where \mathbf{L} is a lower triangular matrix (with 1s on the diagonal) and \mathbf{U} is the upper triangular matrix produced by elimination. The elimination multipliers — the values m_{ij} = a_{ij}/a_{jj} used to zero out entries below each pivot — become the off-diagonal entries of \mathbf{L}.

45.6.1 Solving with LU

Given \mathbf{A} = \mathbf{L}\mathbf{U}, solving \mathbf{A}\mathbf{x} = \mathbf{b} becomes two triangular solves:

Solve \mathbf{L}\mathbf{y} = \mathbf{b} for \mathbf{y} by forward substitution (top to bottom, since \mathbf{L} is lower triangular).
Solve \mathbf{U}\mathbf{x} = \mathbf{y} for \mathbf{x} by back-substitution (bottom to top, since \mathbf{U} is upper triangular).

Each triangular solve costs O(n^2) operations. The factorisation itself costs O(n^3) — the same as Gaussian elimination — but it only needs to happen once per matrix. Subsequent solves for different \mathbf{b} cost only O(n^2) each.

45.6.2 Worked example: 3×3 LU factorisation

Factorise:

\mathbf{A} = \begin{pmatrix} 2 & 4 & -2 \\ 1 & 3 & 1 \\ -1 & 2 & 3 \end{pmatrix}

Step 1. Column 1. Use a_{11} = 2 as pivot.

Multipliers: m_{21} = \tfrac{1}{2}, m_{31} = \tfrac{-1}{2}.

Apply R_2 \leftarrow R_2 - \tfrac{1}{2}R_1 and R_3 \leftarrow R_3 - (-\tfrac{1}{2})R_1 = R_3 + \tfrac{1}{2}R_1:

\begin{pmatrix} 2 & 4 & -2 \\ 0 & 1 & 2 \\ 0 & 4 & 2 \end{pmatrix}

Step 2. Column 2. Use a_{22}^{(1)} = 1 as pivot.

Multiplier: m_{32} = \tfrac{4}{1} = 4.

Apply R_3 \leftarrow R_3 - 4R_2:

\mathbf{U} = \begin{pmatrix} 2 & 4 & -2 \\ 0 & 1 & 2 \\ 0 & 0 & -6 \end{pmatrix}

Assembling L. The multipliers fill the lower triangle of \mathbf{L}, with 1s on the diagonal:

\mathbf{L} = \begin{pmatrix} 1 & 0 & 0 \\ 1/2 & 1 & 0 \\ -1/2 & 4 & 1 \end{pmatrix}

Verification: \mathbf{L}\mathbf{U} should recover \mathbf{A}. Check the (3,2) entry: (-\tfrac{1}{2})(4) + (4)(1) + (1)(0) = -2 + 4 = 2 ✓.

45.6.3 When LU fails

LU factorisation in the form above requires that every pivot encountered during elimination is nonzero. When a zero pivot arises — even if the matrix is not singular — rows must be swapped before continuing. This leads to PLU factorisation (\mathbf{P}\mathbf{A} = \mathbf{L}\mathbf{U} where \mathbf{P} is a permutation matrix recording the row swaps), which is what production numerical libraries always compute. MATLAB’s lu() and NumPy’s scipy.linalg.lu() both return the PLU form.

45.6.4 Numerical perspective

For n > 4 or 5, Gaussian elimination and LU factorisation are almost always done by software. Understanding the algorithm tells you:

When it fails: a zero pivot signals a singular or near-singular matrix — the system is ill-conditioned and the solution, if it exists, may be numerically unreliable.
Why pivoting matters: without partial pivoting, round-off errors compound in a predictable direction and can make the computed solution wildly wrong even when a true solution exists.
What to trust: if NumPy’s linalg.solve() raises a LinAlgError: Singular matrix, the coefficient matrix is rank-deficient. The fix is not a different solver — it is a reconsideration of the physical model.

45.7 Applications

45.7.1 Circuit mesh analysis: Kirchhoff’s voltage law

Consider a three-mesh resistive circuit. Applying Kirchhoff’s voltage law (KVL) around each mesh — the sum of voltage drops around any closed loop is zero — yields one linear equation per mesh in the mesh currents I_1, I_2, I_3.

For a specific circuit with resistances R_1 = 2\,\Omega, R_2 = 4\,\Omega, R_3 = 3\,\Omega, R_4 = 6\,\Omega, R_5 = 1\,\Omega and voltage sources V_1 = 12\,\text{V}, V_2 = 6\,\text{V}, the KVL equations might take the form:

\begin{aligned} (R_1 + R_2)I_1 - R_2 I_2 &= V_1 \\ -R_2 I_1 + (R_2 + R_3 + R_4)I_2 - R_4 I_3 &= 0 \\ -R_4 I_2 + (R_4 + R_5)I_3 &= -V_2 \end{aligned}

Substituting values:

\begin{aligned} 6I_1 - 4I_2 &= 12 \\ -4I_1 + 13I_2 - 6I_3 &= 0 \\ -6I_2 + 7I_3 &= -6 \end{aligned}

In matrix form \mathbf{A}\mathbf{x} = \mathbf{b}:

\begin{pmatrix} 6 & -4 & 0 \\ -4 & 13 & -6 \\ 0 & -6 & 7 \end{pmatrix} \begin{pmatrix} I_1 \\ I_2 \\ I_3 \end{pmatrix} = \begin{pmatrix} 12 \\ 0 \\ -6 \end{pmatrix}

The coefficient matrix is symmetric — a consequence of reciprocity in passive resistive networks. This is always the case when you apply KVL systematically; it is not a coincidence to exploit, it is a theorem to rely on.

Applying Gaussian elimination yields (after row reduction): I_1 = 3.2\,\text{A}, I_2 = 1.8\,\text{A}, I_3 = 0.686\,\text{A} (to three significant figures). Once the mesh currents are known, every voltage and power in the circuit is computable — the linear system is the complete solution to the circuit analysis problem.

Scaling remark. A practical circuit board may have 50 to 500 mesh loops. The matrix is still symmetric, but 50 \times 50 by hand is not feasible. SPICE — the standard circuit simulator used by every electronics engineer — solves such systems using LU factorisation with partial pivoting, exactly the method in this chapter.

45.7.2 Structural force distribution

A statically determinate truss consists of members (bars) and joints (nodes). At each joint, equilibrium requires that the sum of forces in the horizontal direction is zero, and the sum in the vertical direction is zero. Each equilibrium equation is linear in the member forces. For a truss with n joints and m members plus three reaction forces, static determinacy gives m + 3 = 2n, and the full set of equilibrium equations is a square linear system.

For a simple three-member triangular truss with a vertical load P at the apex:

\begin{pmatrix} \cos\theta_1 & \cos\theta_2 & 0 \\ \sin\theta_1 & \sin\theta_2 & 0 \\ 0 & -1 & 1 \end{pmatrix} \begin{pmatrix} F_1 \\ F_2 \\ R \end{pmatrix} = \begin{pmatrix} 0 \\ P \\ 0 \end{pmatrix}

where F_1, F_2 are member forces, R is a reaction force, and \theta_1, \theta_2 are member angles. Solving this gives the force in each member — positive values indicate tension, negative indicate compression. A member under too much compression will buckle; too much tension and it will yield. The linear system does not just find numbers: it tells the structural engineer which members to upsize.

45.8 What you can do now

You can now compress a system into \mathbf{A}\mathbf{x}=\mathbf{b}, solve it reliably by row reduction, and use rank language to predict whether a model is determined, inconsistent, or underdetermined. You also have the practical engineering idea of LU: factor once, then solve many times.

45.9 Exercises

These exercises use real engineering and science contexts. The interesting part is setting up the augmented matrix correctly and identifying which solution case applies.

Code

function makeStepperHTML(steps) {
  let current = 0;
  const totalSteps = steps.length;
  const container = document.createElement("div");
  container.style.cssText = "border:1px solid #e5e7eb; border-radius:8px; padding:1rem 1.25rem; margin:0.75rem 0; font-family:inherit;";
  const stepsDiv = document.createElement("div");
  stepsDiv.style.marginBottom = "0.75rem";

  function renderSteps() {
    stepsDiv.innerHTML = "";
    for (let i = 0; i < current; i++) {
      const s = steps[i];
      const row = document.createElement("div");
      row.style.cssText = "display:grid; grid-template-columns:220px 1fr; gap:0.35rem 1rem; align-items:baseline; padding:0.35rem 0; border-top:1px solid #f3f4f6;";
      const opCell = document.createElement("span");
      opCell.style.cssText = "font-size:0.85em; color:#6b7280; font-style:italic;";
      opCell.textContent = s.op;
      const eqCell = document.createElement("span");
      eqCell.innerHTML = katex.renderToString(s.eq, { throwOnError: false, displayMode: false });
      row.appendChild(opCell);
      row.appendChild(eqCell);
      if (s.note) {
        const noteRow = document.createElement("div");
        noteRow.style.cssText = "grid-column:2; font-size:0.82em; color:#6b7280; padding-bottom:0.2rem;";
        noteRow.textContent = s.note;
        row.appendChild(noteRow);
      }
      stepsDiv.appendChild(row);
    }
    if (current === totalSteps) {
      const done = document.createElement("div");
      done.style.cssText = "margin-top:0.5rem; font-size:0.9em; color:#059669; font-weight:500;";
      done.textContent = "Solution complete.";
      stepsDiv.appendChild(done);
    }
  }

  const controls = document.createElement("div");
  controls.style.cssText = "display:flex; gap:0.5rem; align-items:center;";
  const nextBtn = document.createElement("button");
  nextBtn.textContent = "Next step →";
  nextBtn.style.cssText = "padding:0.42rem 0.95rem; border:1px solid #2563eb; border-radius:5px; background:#2563eb; color:#ffffff; cursor:pointer; font-size:0.9em; font-weight:600; box-shadow:0 1px 2px rgba(37,99,235,0.18);";
  const resetBtn = document.createElement("button");
  resetBtn.textContent = "Reset";
  resetBtn.style.cssText = "padding:0.42rem 0.8rem; border:1px solid #cbd5e1; border-radius:5px; background:#f8fafc; cursor:pointer; font-size:0.9em; color:#475569; font-weight:500;";
  const counter = document.createElement("span");
  counter.style.cssText = "font-size:0.82em; color:#9ca3af; margin-left:0.25rem;";

  function updateButtons() {
    nextBtn.disabled = current === totalSteps;
    nextBtn.style.opacity = current === totalSteps ? "0.4" : "1";
    counter.textContent = current === 0 ? "Click to reveal steps" : `Step ${current} of ${totalSteps}`;
  }

  nextBtn.onclick = () => { if (current < totalSteps) { current++; renderSteps(); updateButtons(); } };
  resetBtn.onclick = () => { current = 0; renderSteps(); updateButtons(); };
  controls.appendChild(nextBtn);
  controls.appendChild(resetBtn);
  controls.appendChild(counter);
  container.appendChild(stepsDiv);
  container.appendChild(controls);
  renderSteps();
  updateButtons();
  return container;
}

45.9.1 Exercise 1: Node admittance matrix multiplication

A circuit has three nodes. The node admittance matrix \mathbf{Y} and a voltage vector \mathbf{V} are:

\mathbf{Y} = \begin{pmatrix} 5 & -2 & -1 \\ -2 & 6 & -3 \\ -1 & -3 & 4 \end{pmatrix}, \qquad \mathbf{V} = \begin{pmatrix} 10 \\ 5 \\ 2 \end{pmatrix} \text{ V}

Compute the current injection vector \mathbf{I} = \mathbf{Y}\mathbf{V}.

Code

makeStepperHTML([
  {
    op: "Write the product definition",
    eq: "\\mathbf{I} = \\mathbf{Y}\\mathbf{V} = \\begin{pmatrix} 5 & -2 & -1 \\\\ -2 & 6 & -3 \\\\ -1 & -3 & 4 \\end{pmatrix}\\begin{pmatrix} 10 \\\\ 5 \\\\ 2 \\end{pmatrix}",
    note: "Each entry of I is the dot product of one row of Y with V."
  },
  {
    op: "Compute I₁ (row 1 · V)",
    eq: "I_1 = 5(10) + (-2)(5) + (-1)(2) = 50 - 10 - 2 = 38 \\text{ A}"
  },
  {
    op: "Compute I₂ (row 2 · V)",
    eq: "I_2 = (-2)(10) + 6(5) + (-3)(2) = -20 + 30 - 6 = 4 \\text{ A}"
  },
  {
    op: "Compute I₃ (row 3 · V)",
    eq: "I_3 = (-1)(10) + (-3)(5) + 4(2) = -10 - 15 + 8 = -17 \\text{ A}"
  },
  {
    op: "Assemble the result",
    eq: "\\mathbf{I} = \\begin{pmatrix} 38 \\\\ 4 \\\\ -17 \\end{pmatrix} \\text{ A}",
    note: "Negative current injection means current is leaving that node."
  }
])

45.9.2 Exercise 2: Mesh current analysis by Gaussian elimination

Three mesh currents I_1, I_2, I_3 (in amperes) in a resistive network satisfy:

\begin{aligned} 8I_1 - 2I_2 - 0I_3 &= 16 \\ -2I_1 + 7I_2 - 3I_3 &= 0 \\ 0I_1 - 3I_2 + 5I_3 &= -6 \end{aligned}

Find I_1, I_2, I_3 by Gaussian elimination.

Code

makeStepperHTML([
  {
    op: "Write augmented matrix",
    eq: "\\left(\\begin{array}{rrr|r} 8 & -2 & 0 & 16 \\\\ -2 & 7 & -3 & 0 \\\\ 0 & -3 & 5 & -6 \\end{array}\\right)"
  },
  {
    op: "Eliminate column 1: R₂ ← R₂ + (1/4)R₁",
    eq: "\\left(\\begin{array}{rrr|r} 8 & -2 & 0 & 16 \\\\ 0 & 6.5 & -3 & 4 \\\\ 0 & -3 & 5 & -6 \\end{array}\\right)",
    note: "Multiplier m₂₁ = −2/8 = −1/4; add (1/4)R₁ to R₂."
  },
  {
    op: "Eliminate column 2: R₃ ← R₃ - (−6/13)R₂",
    eq: "\\left(\\begin{array}{rrr|r} 8 & -2 & 0 & 16 \\\\ 0 & 6.5 & -3 & 4 \\\\ 0 & 0 & 3.615 & -4.154 \\end{array}\\right)",
    note: "Multiplier m₃₂ = −3/6.5 = −6/13; subtracting that multiple zeros the (3,2) entry."
  },
  {
    op: "Back-substitute: solve for I₃",
    eq: "I_3 = \\frac{-4.154}{3.615} \\approx -1.149 \\text{ A}"
  },
  {
    op: "Back-substitute: solve for I₂",
    eq: "I_2 = \\frac{4 - (-3)(-1.149)}{6.5} = \\frac{4 - 3.447}{6.5} \\approx 0.085 \\text{ A}"
  },
  {
    op: "Back-substitute: solve for I₁",
    eq: "I_1 = \\frac{16 - (-2)(0.085)}{8} = \\frac{16.170}{8} \\approx 2.021 \\text{ A}",
    note: "Exact fractions: I₁ = 1063/526 ≈ 2.021 A; I₂ = 9/106 ≈ 0.085 A; I₃ = −61/53 ≈ −1.151 A."
  }
])

45.9.3 Exercise 3: RREF and solution type (underdetermined structural system)

A redundant structure gives three equilibrium equations in four unknown member forces F_1, F_2, F_3, F_4:

\begin{aligned} F_1 + 2F_2 - F_3 + F_4 &= 10 \\ 2F_1 + 4F_2 + F_3 - F_4 &= 8 \\ F_1 + 2F_2 + 3F_3 - 3F_4 &= -16 \end{aligned}

Row-reduce to RREF. Identify the free variable(s) and write the general solution.

Code

makeStepperHTML([
  {
    op: "Write augmented matrix",
    eq: "\\left(\\begin{array}{rrrr|r} 1 & 2 & -1 & 1 & 10 \\\\ 2 & 4 & 1 & -1 & 8 \\\\ 1 & 2 & 3 & -3 & -2 \\end{array}\\right)"
  },
  {
    op: "Eliminate col 1: R₂ ← R₂ − 2R₁; R₃ ← R₃ − R₁",
    eq: "\\left(\\begin{array}{rrrr|r} 1 & 2 & -1 & 1 & 10 \\\\ 0 & 0 & 3 & -3 & -12 \\\\ 0 & 0 & 4 & -4 & -12 \\end{array}\\right)"
  },
  {
    op: "Scale R₂ by 1/3; eliminate col 3: R₃ ← R₃ − 4R₂",
    eq: "\\left(\\begin{array}{rrrr|r} 1 & 2 & -1 & 1 & 10 \\\\ 0 & 0 & 1 & -1 & -4 \\\\ 0 & 0 & 0 & 0 & 0 \\end{array}\\right)",
    note: "Row 3 becomes all zeros — a dependent equation."
  },
  {
    op: "Back-eliminate col 3 from row 1: R₁ ← R₁ + R₂",
    eq: "\\left(\\begin{array}{rrrr|r} 1 & 2 & 0 & 0 & 6 \\\\ 0 & 0 & 1 & -1 & -4 \\\\ 0 & 0 & 0 & 0 & 0 \\end{array}\\right)"
  },
  {
    op: "Identify pivot and free variables",
    eq: "\\text{Pivots in cols 1, 3} \\implies F_1,\\, F_3 \\text{ are pivot variables; } F_2,\\, F_4 \\text{ are free}",
    note: "rank(A) = 2 < 4 unknowns, so nullity = 2: a two-parameter family of solutions."
  },
  {
    op: "Write general solution (let F₂ = s, F₄ = t)",
    eq: "F_1 = 6 - 2s,\\; F_2 = s,\\; F_3 = -4 + t,\\; F_4 = t \\quad (s,\\,t \\in \\mathbb{R})",
    note: "Physical interpretation: the structure is statically indeterminate — you need material stiffness data to find a unique solution."
  }
])

45.9.4 Exercise 4: Rank of a sensor measurement matrix

A sensor network takes four measurements of three physical quantities (temperature, pressure, flow rate). The measurement matrix is:

\mathbf{M} = \begin{pmatrix} 1 & 2 & 3 \\ 2 & 4 & 6 \\ 1 & 0 & 2 \\ 3 & 2 & 7 \end{pmatrix}

Find \text{rank}(\mathbf{M}). What does this tell you about the sensor network?

Code

makeStepperHTML([
  {
    op: "Write the matrix and note its size",
    eq: "\\mathbf{M} \\text{ is } 4 \\times 3, \\text{ so } \\text{rank}(\\mathbf{M}) \\leq 3"
  },
  {
    op: "Eliminate col 1: R₂ ← R₂ − 2R₁; R₃ ← R₃ − R₁; R₄ ← R₄ − 3R₁",
    eq: "\\begin{pmatrix} 1 & 2 & 3 \\\\ 0 & 0 & 0 \\\\ 0 & -2 & -1 \\\\ 0 & -4 & -2 \\end{pmatrix}",
    note: "Row 2 is exactly 2 × Row 1 — it carries no new information."
  },
  {
    op: "Swap R₂ and R₃ (bring nonzero pivot to position (2,2))",
    eq: "\\begin{pmatrix} 1 & 2 & 3 \\\\ 0 & -2 & -1 \\\\ 0 & 0 & 0 \\\\ 0 & -4 & -2 \\end{pmatrix}"
  },
  {
    op: "Eliminate col 2 from R₄: R₄ ← R₄ − 2R₂",
    eq: "\\begin{pmatrix} 1 & 2 & 3 \\\\ 0 & -2 & -1 \\\\ 0 & 0 & 0 \\\\ 0 & 0 & 0 \\end{pmatrix}",
    note: "Two zero rows — rank equals the number of nonzero rows after reduction."
  },
  {
    op: "Count pivots",
    eq: "\\text{rank}(\\mathbf{M}) = 2",
    note: "Only 2 of the 4 sensors provide independent information. The other two are redundant — they measure combinations already captured. One redundant sensor is useful for fault detection; two is over-instrumented."
  }
])

45.9.5 Exercise 5: LU factorisation for repeated solves

A heat-transfer network has conductance matrix:

\mathbf{K} = \begin{pmatrix} 4 & -1 & 0 \\ -1 & 3 & -1 \\ 0 & -1 & 2 \end{pmatrix}

Find the LU factorisation of \mathbf{K}, then use it to solve \mathbf{K}\mathbf{T} = \mathbf{q} for two different heat-flux vectors:

\mathbf{q}_1 = \begin{pmatrix} 8 \\ 4 \\ 2 \end{pmatrix}, \qquad \mathbf{q}_2 = \begin{pmatrix} 0 \\ 6 \\ 4 \end{pmatrix}

Code

makeStepperHTML([
  {
    op: "Elimination step 1: R₂ ← R₂ − (−1/4)R₁",
    eq: "m_{21} = \\tfrac{-1}{4} \\implies R_2 \\leftarrow R_2 - \\left(-\\tfrac{1}{4}\\right)R_1",
    note: "Subtracting (−1/4)×R₁ is the same as adding (1/4)×R₁."
  },
  {
    op: "Result after step 1",
    eq: "\\mathbf{U}^{(1)} = \\begin{pmatrix} 4 & -1 & 0 \\\\ 0 & 11/4 & -1 \\\\ 0 & -1 & 2 \\end{pmatrix}"
  },
  {
    op: "Elimination step 2: R₃ ← R₃ − (−1/(11/4))R₂ = R₃ + (4/11)R₂",
    eq: "m_{32} = \\tfrac{-1}{11/4} = -\\tfrac{4}{11}"
  },
  {
    op: "Upper triangular U",
    eq: "\\mathbf{U} = \\begin{pmatrix} 4 & -1 & 0 \\\\ 0 & 11/4 & -1 \\\\ 0 & 0 & 18/11 \\end{pmatrix}"
  },
  {
    op: "Assemble L from multipliers",
    eq: "\\mathbf{L} = \\begin{pmatrix} 1 & 0 & 0 \\\\ -1/4 & 1 & 0 \\\\ 0 & -4/11 & 1 \\end{pmatrix}"
  },
  {
    op: "Solve Ly = q₁ by forward substitution",
    eq: "\\mathbf{y}_1: \\; y_1 = 8,\\; y_2 = 4 + \\tfrac{1}{4}(8) = 6,\\; y_3 = 2 + \\tfrac{4}{11}(6) = \\tfrac{46}{11}"
  },
  {
    op: "Solve UT = y₁ by back-substitution",
    eq: "T_3 = \\frac{46/11}{18/11} = \\frac{46}{18} = \\frac{23}{9} \\approx 2.56,\\; T_2 = \\frac{6 + T_3}{11/4} \\approx 3.11,\\; T_1 = \\frac{8 + T_2 - 0}{4} \\approx 2.78",
    note: "Exact: T₁ = 167/60, T₂ = 56/18 = 28/9, T₃ = 23/9."
  },
  {
    op: "For q₂: reuse L and U, only redo the two triangular solves",
    eq: "\\mathbf{y}_2: \\; y_1 = 0,\\; y_2 = 6,\\; y_3 = 4 + \\tfrac{4}{11}(6) = \\tfrac{68}{11}",
    note: "The LU factorisation cost is paid once. Each new right-hand side costs only O(n²)."
  }
])

45.9.6 Exercise 6: Write and solve the linear system for a 3-mesh circuit

A three-mesh circuit has the following KVL equations after applying Kirchhoff’s voltage law. The mesh currents I_1, I_2, I_3 (in mA) satisfy:

Mesh 1: A 10 Ω resistor carries I_1; a 5 Ω resistor is shared with mesh 2 carrying I_1 - I_2. Voltage source: 20 V.
Mesh 2: The shared 5 Ω resistor carries I_2 - I_1; a 10 Ω resistor carries I_2; a 4 Ω resistor is shared with mesh 3 carrying I_2 - I_3. No voltage source.
Mesh 3: The shared 4 Ω resistor carries I_3 - I_2; a 6 Ω resistor carries I_3. Voltage source: −8 V (opposing reference direction).

Write \mathbf{A}\mathbf{x} = \mathbf{b} explicitly, then solve using Gaussian elimination.

Code

makeStepperHTML([
  {
    op: "Apply KVL to mesh 1",
    eq: "10I_1 + 5(I_1 - I_2) = 20 \\implies 15I_1 - 5I_2 = 20"
  },
  {
    op: "Apply KVL to mesh 2",
    eq: "5(I_2 - I_1) + 10I_2 + 4(I_2 - I_3) = 0 \\implies -5I_1 + 19I_2 - 4I_3 = 0"
  },
  {
    op: "Apply KVL to mesh 3",
    eq: "4(I_3 - I_2) + 6I_3 = -8 \\implies -4I_2 + 10I_3 = -8"
  },
  {
    op: "Write in matrix form Ax = b",
    eq: "\\begin{pmatrix} 15 & -5 & 0 \\\\ -5 & 19 & -4 \\\\ 0 & -4 & 10 \\end{pmatrix}\\begin{pmatrix} I_1 \\\\ I_2 \\\\ I_3 \\end{pmatrix} = \\begin{pmatrix} 20 \\\\ 0 \\\\ -8 \\end{pmatrix}",
    note: "The coefficient matrix is symmetric — always the case for KVL mesh analysis with passive resistors."
  },
  {
    op: "Eliminate col 1: R₂ ← R₂ + (1/3)R₁",
    eq: "\\left(\\begin{array}{rrr|r} 15 & -5 & 0 & 20 \\\\ 0 & 52/3 & -4 & 20/3 \\\\ 0 & -4 & 10 & -8 \\end{array}\\right)",
    note: "Multiplier: m₂₁ = −5/15 = −1/3; subtract (−1/3)R₁, i.e., add (1/3)R₁."
  },
  {
    op: "Eliminate col 2: R₃ ← R₃ + (12/52)R₂",
    eq: "\\left(\\begin{array}{rrr|r} 15 & -5 & 0 & 20 \\\\ 0 & 52/3 & -4 & 20/3 \\\\ 0 & 0 & 442/52 & -31/13 \\end{array}\\right)",
    note: "Multiplier: m₃₂ = −4 / (52/3) = −12/52 = −3/13."
  },
  {
    op: "Back-substitute",
    eq: "I_3 = \\frac{-31/13}{442/52} \\approx -0.280 \\text{ mA}, \\quad I_2 \\approx 0.452 \\text{ mA}, \\quad I_1 \\approx 1.484 \\text{ mA}",
    note: "Verify: 15(1.484) − 5(0.452) = 22.26 − 2.26 = 20 ✓. All three mesh currents are determined."
  }
])