Abstract Linear Algebra 48 | Proof of Spectral Theorem

TL;DR

Unitary diagonalizability means U*AU = D for diagonal D, and diagonal matrices satisfy D*D = DD*, which forces A*A = AA*.

Briefing Cornell Notes

Briefing

Normal matrices over complex numbers are exactly the matrices that can be diagonalized by a unitary change of basis, and the proof hinges on showing that any “Schur form” of a normal matrix collapses from upper triangular to diagonal.

First comes the easy direction: if a matrix A is unitarily diagonalizable, so that U*AU = D for some diagonal D and unitary U, then A must be normal. The reason is algebraic and local to the diagonal. Since D is diagonal, it commutes with its adjoint: D*D = DD*. Writing A in terms of U and D, one computes A*A and AA* and finds both reduce to U(D*D)U* and U(DD*)U*, which match because D*D = DD*. This establishes that unitary diagonalizability implies normality.

The harder direction starts with a normal matrix A and uses Schur decomposition. Because A is square, there exists a unitary U such that U*AU = R, where R is upper triangular. The key move is to show that normality transfers to R: if A*A = AA*, then the same equality holds for R, since the unitary factors cancel in the corresponding products. So the problem becomes: what do normal upper triangular matrices look like?

A 2×2 calculation gives the pattern. Let R be upper triangular with entries a, b, c, so R = [[a, b],[0, c]]. Computing the (1,1) entry of RR* and R*R yields expressions involving |a|^2 and |b|^2. Equality of the two products forces |b|^2 = 0, hence b = 0. That eliminates the only possible off-diagonal entry in a 2×2 upper triangular matrix, proving that a normal 2×2 upper triangular matrix must be diagonal.

The general case follows by induction on the size n. Assume every normal n×n upper triangular matrix is diagonal. For an (n+1)×(n+1) normal upper triangular matrix R, the first column below the (1,1) entry is already zero by triangularity, so R can be written in block form: R = [[a, W*],[0, R̃]], where W is an n-dimensional column vector and R̃ is an n×n upper triangular block. Normality forces the (1,1) entries of RR* and R*R to match. Those entries become |a|^2 + ||W||^2 on one side and |a|^2 on the other, so ||W||^2 must be 0 and therefore W = 0. With W gone, R̃ remains normal and upper triangular, so the induction hypothesis makes R̃ diagonal. Putting the blocks together shows R itself is diagonal.

Since R is diagonal in the Schur form, U*AU is diagonal, meaning A is unitarily diagonalizable. The spectral theorem is thus proven for normal matrices: normality is equivalent to unitary diagonalizability. A major consequence highlighted at the end is that self-adjoint matrices are a special case of normal matrices, so they too are unitarily diagonalizable—setting up the most important diagonalization used later in the course.

Cornell Notes

For a complex square matrix A, normality (A*A = AA*) is equivalent to being unitarily diagonalizable. The easy direction uses the fact that if U*AU = D with D diagonal, then D*D = DD*, which forces A*A = AA*. The key work is the reverse direction: start with A normal, apply Schur decomposition to get U*AU = R where R is upper triangular, and then prove that normality forces R to be diagonal. A direct 2×2 computation shows a normal upper triangular matrix cannot have a nonzero superdiagonal entry. The general proof uses induction on n with a block decomposition R = [[a, W*],[0, R̃]]; matching the (1,1) entries of RR* and R*R forces W = 0, and induction makes R̃ diagonal. Therefore U*AU is diagonal, so A is unitarily diagonalizable.

Why does unitary diagonalizability automatically imply normality?

If U*AU = D with D diagonal and U unitary, then D commutes with its adjoint: D*D = DD*. Writing A = UDU*, one computes A*A and AA*; both simplify to U(D*D)U* and U(DD*)U* respectively. Since D*D = DD*, the two products match, giving A*A = AA*.

How does Schur decomposition reduce the problem for a normal matrix A?

For any complex square matrix A, Schur decomposition provides a unitary U such that U*AU = R, where R is upper triangular. When A is normal, the equality A*A = AA* transfers through the unitary similarity, forcing R to be normal as well. So the task becomes: classify normal upper triangular matrices.

What does the 2×2 case prove about normal upper triangular matrices?

Let R = [[a, b],[0, c]] be upper triangular. Computing the (1,1) entry of RR* gives |a|^2 + |b|^2, while the (1,1) entry of R*R gives |a|^2. Normality requires these to be equal, so |b|^2 = 0 and thus b = 0. With the only possible off-diagonal entry forced to vanish, R must be diagonal.

In the induction step, how does normality force the off-diagonal block W to vanish?

Write the (n+1)×(n+1) upper triangular normal matrix in blocks: R = [[a, W*],[0, R̃]], where W is an n-dimensional column vector and R̃ is n×n upper triangular. The (1,1) entry of RR* becomes |a|^2 + W*W = |a|^2 + ||W||^2, while the (1,1) entry of R*R becomes |a|^2. Equality from normality implies ||W||^2 = 0, so W = 0.

Why does the induction hypothesis finish the proof once W = 0?

With W = 0, the matrix R becomes block diagonal with blocks a and R̃. The block R̃ inherits normality (since it sits inside the normal matrix R and the unitary-free block structure preserves the relevant equality), and it remains upper triangular. The induction hypothesis then forces R̃ to be diagonal, so the whole matrix R is diagonal.

Review Questions

What algebraic property of diagonal matrices makes the implication “unitarily diagonalizable ⇒ normal” immediate?
How does the proof use the (1,1) entry of RR* and R*R to eliminate off-diagonal entries in both the 2×2 and general n×n cases?
Why does normality of A imply normality of the Schur form R = U*AU?

Key Points

1
Unitary diagonalizability means U*AU = D for diagonal D, and diagonal matrices satisfy D*D = DD*, which forces A*A = AA*.
2
Schur decomposition turns any complex square matrix A into U*AU = R with R upper triangular using a unitary U.
3
If A is normal, then the Schur form R must also be normal because the unitary similarity preserves the equality A*A = AA*.
4
A normal 2×2 upper triangular matrix must be diagonal: normality forces the single superdiagonal entry to be zero.
5
For an (n+1)×(n+1) normal upper triangular matrix written as [[a, W*],[0, R̃]], matching the (1,1) entries of RR* and R*R forces W = 0.
6
After W vanishes, the induction hypothesis applies to the remaining upper triangular block R̃, making it diagonal and hence making R diagonal.
7
Since the Schur form of a normal matrix is diagonal, the original matrix is unitarily diagonalizable, completing the spectral theorem for normal matrices.

Highlights

Normality plus Schur decomposition turns the classification problem into understanding normal upper triangular matrices.

In the 2×2 case, comparing the (1,1) entries of RR* and R*R forces the superdiagonal entry to vanish.

The induction step uses a block form R = [[a, W*],[0, R̃]]; normality forces ||W||^2 = 0, so W = 0.

Once the off-diagonal block disappears, the remaining upper triangular block is normal and diagonal by induction.

The spectral theorem follows: normal matrices are exactly those unitarily diagonalizable matrices.

Topics

Spectral Theorem
Normal Matrices
Schur Decomposition
Unitary Diagonalization
Induction Proof