Get AI summaries of any video or article — Sign up free
A quick trick for computing eigenvalues | Chapter 15, Essence of linear algebra thumbnail

A quick trick for computing eigenvalues | Chapter 15, Essence of linear algebra

3Blue1Brown·
4 min read

Based on 3Blue1Brown's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

For a 2×2 matrix, the trace equals the sum of the eigenvalues, so the mean of the eigenvalues equals the mean of the diagonal entries.

Briefing

For 2×2 matrices, eigenvalues can be computed almost instantly by reading two numbers off the matrix—its trace and determinant—then using a “mean–product” version of the quadratic formula. The payoff is speed and, more importantly, a clearer mental model: the trace gives the sum (and thus the mean) of the eigenvalues, while the determinant gives their product. Once those are known, the eigenvalues must sit symmetrically around the mean, so the remaining work reduces to a single square root.

The standard route starts from the eigenvalue condition: if λ is an eigenvalue of a matrix A, then A − λI sends a nonzero vector to zero, forcing det(A − λI)=0. For 2×2 matrices, that determinant becomes a quadratic characteristic polynomial, whose roots are the eigenvalues. Solving it typically means expanding, simplifying, and then applying the quadratic formula—work that’s not terrible, but it’s avoidable.

The shortcut relies on three facts, of which the first two are especially quick to extract from the matrix. First, the trace (sum of the diagonal entries) equals the sum of the eigenvalues. Equivalently, the mean of the two eigenvalues equals the mean of the two diagonal entries. Second, the determinant equals the product of the eigenvalues. For a concrete example, a matrix with diagonal entries 8 and 6 has eigenvalue mean 7, and if its determinant is 40, then the eigenvalues multiply to 40.

The third fact turns those two pieces into the actual eigenvalues. If two numbers have mean m and product p, they must be m ± d for some distance d. Their product becomes (m+d)(m−d)=m²−d², so d²=m²−p. That yields the general formula: the eigenvalues are m ± √(m²−p). This is essentially the difference-of-squares identity repackaged for eigenvalue problems, and it can be re-derived quickly if forgotten.

The method is demonstrated on specific 2×2 matrices where the trace and determinant immediately produce the eigenvalues with minimal algebra. It’s also connected to physics through the Pauli spin matrices, which have eigenvalues ±1. In that quantum-mechanics setting, the eigenvalues correspond to the discrete outcomes of spin measurements along chosen axes.

A key nuance: for those particular Pauli matrices, the traditional characteristic-polynomial method can be just as fast or faster. The real advantage appears when forming linear combinations of them—matrices built from a, b, c coefficients representing a direction in space. In that more general case, the trace remains easy to read (often still 0), and the determinant can still be checked quickly; then the mean–product formula pins down the eigenvalues without the heavier characteristic-polynomial expansion.

Finally, the shortcut is framed as a meaningful, not merely memorized, quadratic-solver: it mirrors how quadratic roots are determined by sum and product. The broader lesson is that trace and determinant aren’t just bookkeeping—they directly encode eigenvalue information, letting students skip the intermediate characteristic polynomial and write the roots straight from the matrix.

Cornell Notes

For any 2×2 matrix, the eigenvalues can be found using only the trace and determinant. The trace equals the sum of the eigenvalues, so the mean of the eigenvalues matches the mean of the diagonal entries. The determinant equals the product of the eigenvalues. If the eigenvalues have mean m and product p, they must be m ± √(m²−p). This turns eigenvalue-finding into a fast, “read-off-then-sqrt” procedure and helps connect trace/determinant directly to eigenvalue structure (symmetry around the mean).

Why does det(A − λI)=0 determine eigenvalues?

An eigenvalue λ means there exists a nonzero vector v such that Av = λv. Rearranging gives (A − λI)v = 0. Since v is nonzero, the matrix A − λI must be singular, which happens exactly when det(A − λI)=0.

How do trace and determinant relate to the eigenvalues of a 2×2 matrix?

For a 2×2 matrix, the trace (sum of diagonal entries) equals the sum of the eigenvalues. The determinant equals the product of the eigenvalues. So if the diagonal entries are a and d, then the eigenvalue mean is (a+d)/2, and the eigenvalue product is det(A).

How does knowing the mean m and product p let you recover the two eigenvalues?

If the eigenvalues are x and y, then m=(x+y)/2 and p=xy. Writing x=m+d and y=m−d makes xy=(m+d)(m−d)=m²−d². Setting this equal to p gives d²=m²−p, so x,y = m ± √(m²−p).

Work through the example with matrix [[3,1],[4,1]]. What are the eigenvalues?

Mean m comes from the diagonal entries 3 and 1: m=(3+1)/2=2. The determinant is 3·1 − 1·4 = −1, so p=−1. Then eigenvalues are m ± √(m²−p)=2 ± √(4−(−1))=2 ± √5.

Why does the shortcut become especially useful for linear combinations of Pauli spin matrices?

For individual Pauli matrices, characteristic polynomials can be simple enough that the traditional method is comparable. But for a general linear combination a(σx)+b(σy)+c(σz) with a²+b²+c²=1, the trace stays easy to read (often 0), and the determinant can still be checked quickly. With mean m and product p in hand, the eigenvalues follow from m ± √(m²−p) without expanding a characteristic polynomial.

Review Questions

  1. Given a 2×2 matrix with trace 10 and determinant 21, what eigenvalues does the mean–product formula predict?
  2. For eigenvalues x and y with mean m and product p, derive the expression for x and y using only m and p.
  3. Explain how the trace and determinant of a 2×2 matrix encode the sum and product of its eigenvalues, and why that makes the characteristic polynomial unnecessary for this shortcut.

Key Points

  1. 1

    For a 2×2 matrix, the trace equals the sum of the eigenvalues, so the mean of the eigenvalues equals the mean of the diagonal entries.

  2. 2

    For a 2×2 matrix, the determinant equals the product of the eigenvalues.

  3. 3

    If two numbers have mean m and product p, they are m ± √(m²−p), because (m+d)(m−d)=m²−d².

  4. 4

    Eigenvalue computation can often skip characteristic-polynomial expansion: read trace and determinant directly, then take one square root.

  5. 5

    The method is especially helpful when eigenvalues are needed repeatedly, such as when analyzing linear combinations of matrices.

  6. 6

    Trace/determinant-based reasoning connects eigenvalues to geometric intuition: trace tracks stretching in aggregate (sum), while determinant tracks area/volume scaling (product).

  7. 7

    For some structured matrices (like individual Pauli matrices), the traditional method may be just as fast, but the shortcut shines in more general combinations.

Highlights

Eigenvalues of a 2×2 matrix can be written immediately as m ± √(m²−p), where m is the mean of the diagonal entries (half the trace) and p is the determinant.
The shortcut is a difference-of-squares identity in disguise: if eigenvalues are m±d, their product becomes m²−d².
Pauli spin matrices have eigenvalues ±1, illustrating how eigenvalue computation links directly to discrete measurement outcomes in quantum mechanics.
The biggest speedup appears when taking linear combinations of Pauli matrices, where characteristic-polynomial work becomes harder to do mentally.

Topics

Mentioned