Abstract Linear Algebra 48 | Proof of Spectral Theorem
Based on The Bright Side of Mathematics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Unitary diagonalizability means U*AU = D for diagonal D, and diagonal matrices satisfy D*D = DD*, which forces A*A = AA*.
Briefing
Normal matrices over complex numbers are exactly the matrices that can be diagonalized by a unitary change of basis, and the proof hinges on showing that any “Schur form” of a normal matrix collapses from upper triangular to diagonal.
First comes the easy direction: if a matrix A is unitarily diagonalizable, so that U*AU = D for some diagonal D and unitary U, then A must be normal. The reason is algebraic and local to the diagonal. Since D is diagonal, it commutes with its adjoint: D*D = DD*. Writing A in terms of U and D, one computes A*A and AA* and finds both reduce to U(D*D)U* and U(DD*)U*, which match because D*D = DD*. This establishes that unitary diagonalizability implies normality.
The harder direction starts with a normal matrix A and uses Schur decomposition. Because A is square, there exists a unitary U such that U*AU = R, where R is upper triangular. The key move is to show that normality transfers to R: if A*A = AA*, then the same equality holds for R, since the unitary factors cancel in the corresponding products. So the problem becomes: what do normal upper triangular matrices look like?
A 2×2 calculation gives the pattern. Let R be upper triangular with entries a, b, c, so R = [[a, b],[0, c]]. Computing the (1,1) entry of RR* and R*R yields expressions involving |a|^2 and |b|^2. Equality of the two products forces |b|^2 = 0, hence b = 0. That eliminates the only possible off-diagonal entry in a 2×2 upper triangular matrix, proving that a normal 2×2 upper triangular matrix must be diagonal.
The general case follows by induction on the size n. Assume every normal n×n upper triangular matrix is diagonal. For an (n+1)×(n+1) normal upper triangular matrix R, the first column below the (1,1) entry is already zero by triangularity, so R can be written in block form: R = [[a, W*],[0, R̃]], where W is an n-dimensional column vector and R̃ is an n×n upper triangular block. Normality forces the (1,1) entries of RR* and R*R to match. Those entries become |a|^2 + ||W||^2 on one side and |a|^2 on the other, so ||W||^2 must be 0 and therefore W = 0. With W gone, R̃ remains normal and upper triangular, so the induction hypothesis makes R̃ diagonal. Putting the blocks together shows R itself is diagonal.
Since R is diagonal in the Schur form, U*AU is diagonal, meaning A is unitarily diagonalizable. The spectral theorem is thus proven for normal matrices: normality is equivalent to unitary diagonalizability. A major consequence highlighted at the end is that self-adjoint matrices are a special case of normal matrices, so they too are unitarily diagonalizable—setting up the most important diagonalization used later in the course.
Cornell Notes
For a complex square matrix A, normality (A*A = AA*) is equivalent to being unitarily diagonalizable. The easy direction uses the fact that if U*AU = D with D diagonal, then D*D = DD*, which forces A*A = AA*. The key work is the reverse direction: start with A normal, apply Schur decomposition to get U*AU = R where R is upper triangular, and then prove that normality forces R to be diagonal. A direct 2×2 computation shows a normal upper triangular matrix cannot have a nonzero superdiagonal entry. The general proof uses induction on n with a block decomposition R = [[a, W*],[0, R̃]]; matching the (1,1) entries of RR* and R*R forces W = 0, and induction makes R̃ diagonal. Therefore U*AU is diagonal, so A is unitarily diagonalizable.
Why does unitary diagonalizability automatically imply normality?
How does Schur decomposition reduce the problem for a normal matrix A?
What does the 2×2 case prove about normal upper triangular matrices?
In the induction step, how does normality force the off-diagonal block W to vanish?
Why does the induction hypothesis finish the proof once W = 0?
Review Questions
- What algebraic property of diagonal matrices makes the implication “unitarily diagonalizable ⇒ normal” immediate?
- How does the proof use the (1,1) entry of RR* and R*R to eliminate off-diagonal entries in both the 2×2 and general n×n cases?
- Why does normality of A imply normality of the Schur form R = U*AU?
Key Points
- 1
Unitary diagonalizability means U*AU = D for diagonal D, and diagonal matrices satisfy D*D = DD*, which forces A*A = AA*.
- 2
Schur decomposition turns any complex square matrix A into U*AU = R with R upper triangular using a unitary U.
- 3
If A is normal, then the Schur form R must also be normal because the unitary similarity preserves the equality A*A = AA*.
- 4
A normal 2×2 upper triangular matrix must be diagonal: normality forces the single superdiagonal entry to be zero.
- 5
For an (n+1)×(n+1) normal upper triangular matrix written as [[a, W*],[0, R̃]], matching the (1,1) entries of RR* and R*R forces W = 0.
- 6
After W vanishes, the induction hypothesis applies to the remaining upper triangular block R̃, making it diagonal and hence making R diagonal.
- 7
Since the Schur form of a normal matrix is diagonal, the original matrix is unitarily diagonalizable, completing the spectral theorem for normal matrices.