Hilbert Spaces 5 | Proof of Jordan-von Neumann Theorem [dark version]

TL;DR

Jordan–von Neumann’s theorem characterizes inner-product norms purely by the parallelogram law: a norm comes from an inner product iff the parallelogram identity holds for all vectors.

Briefing Cornell Notes

Briefing

A normed space becomes a genuine Hilbert space exactly when its norm obeys the parallelogram law. That criterion—Jordan–von Neumann’s theorem—turns a geometric identity about lengths into the algebraic structure of an inner product. The proof strategy is to start with the norm and *define* a candidate inner product from it, then verify that this candidate satisfies the inner-product axioms.

The parallelogram law says that for every pair of vectors X and Y, the quantity ‖X+Y‖² + ‖X−Y‖² is always the same as 2‖X‖² + 2‖Y‖². If this identity holds universally, then one can reconstruct an inner product whose induced norm matches the original norm: taking ⟨X,X⟩ and square-rooting must return ‖X‖. In the real case, the polarization identity provides the only plausible formula for such an inner product, so the remaining task is not guessing—it’s proving that the formula actually works.

The construction uses the norm to define ⟨W,Z⟩ via a polarization-type expression (in the real setting, no complex conjugation is needed). Three inner-product properties must be checked: positive definiteness, symmetry, and linearity in the second argument. Positive definiteness is straightforward: plugging the same vector into both slots makes the formula collapse to ‖W‖² (up to the correct factor), so ⟨W,W⟩=0 forces W to be the zero vector. Symmetry is equally direct in the real case because swapping the two arguments does not change the norm-based expression.

Linearity is where the parallelogram law earns its keep. To prove linearity in the second argument, the proof repeatedly rewrites differences of norm-squares into sums so the parallelogram law can be applied. For example, to handle expressions involving ‖W+Z‖²−‖W‖², the argument artificially adds and subtracts the missing term ‖W‖² so each piece can be matched to the parallelogram pattern. Solving small “systems” of vector substitutions (effectively choosing X and Y so that X±Y becomes the needed combinations) lets the parallelogram law be used twice, after which cancellation removes the extra terms.

This yields a key scaling rule: factors of 1/2 can be pulled out of the second argument, i.e., ⟨W, (1/2)Z⟩ relates cleanly to ⟨W,Z⟩. By iterating the same reasoning, the proof extends this to factors (1/2)ⁿ for natural n. Next comes additivity in the second argument: ⟨W, Z+Ẑ⟩ becomes ⟨W,Z⟩+⟨W,Ẑ⟩ through another carefully engineered decomposition that again turns the needed expression into a form where the parallelogram law applies, with cancellation leaving exactly the desired sum.

From additivity, homogeneity for integers follows (pulling out 2, then any natural number K). The remaining real scalars are handled by approximation using rationals of the form K/2ⁿ, and then extending to all real numbers via continuity of the norm. With linearity established, the constructed inner product is valid, and the Jordan–von Neumann theorem is complete: a normed space is an inner product space if and only if the parallelogram law holds. The complex case follows the same blueprint with additional terms in the polarization identity.

In short: the parallelogram law is not just a curiosity—it’s the exact algebraic fingerprint that lets length data be upgraded into an inner product, which is why Hilbert spaces can be characterized purely by norm geometry.

Cornell Notes

Jordan–von Neumann’s theorem gives a precise test for when a norm comes from an inner product: a normed space is an inner product space exactly when its norm satisfies the parallelogram law. The proof defines a candidate inner product from the norm using the polarization identity (real case, so no complex conjugation). It then verifies the inner-product axioms: ⟨X,X⟩ reproduces ‖X‖² (positive definiteness) and the formula is symmetric. The hard part is linearity in the second argument, proved by repeatedly rewriting norm-squared differences into norm-squared sums so the parallelogram law can be applied, leading first to scaling by 1/2 and then to additivity. Integer homogeneity comes from additivity, and all real scalars follow by approximating with K/2ⁿ and using continuity.

What does the parallelogram law guarantee, and why is it the “gate” to an inner product?

It guarantees that the norm behaves like one induced by an inner product. Concretely, for all vectors X and Y, the identity ‖X+Y‖² + ‖X−Y‖² = 2‖X‖² + 2‖Y‖² holds. That universal geometric constraint is exactly what makes it possible to reconstruct an inner product from the norm so that ⟨X,X⟩ = ‖X‖² and the full inner-product structure (including linearity) can be derived.

How does the proof get positive definiteness without heavy algebra?

By evaluating the constructed inner product at ⟨X,X⟩. The polarization-type formula collapses so that the result becomes exactly ‖X‖² (with the correct factor), meaning ⟨X,X⟩ = 0 implies ‖X‖ = 0. Since norms vanish only at the zero vector, this forces X = 0, establishing positive definiteness.

Why is linearity in the second argument the difficult step?

Because the parallelogram law applies to expressions involving sums like ‖A+B‖² and ‖A−B‖², but linearity calculations naturally produce differences such as ‖W+Z‖² − ‖W‖². The proof fixes this by adding and subtracting the “missing” norm-squared terms so each part can be matched to the parallelogram-law pattern, enabling two applications of the identity and then cancellation of extra terms.

What is the significance of proving the scaling rule for factors of 1/2?

It’s the first concrete homogeneity result. After rewriting expressions to use the parallelogram law twice and canceling terms, the argument shows that the inner product behaves correctly when the second argument is multiplied by 1/2. Iterating the same method yields homogeneity for (1/2)ⁿ, which then supports approximation of general real scalars.

How does the proof extend homogeneity from rationals of the form K/2ⁿ to all real numbers?

It first proves homogeneity for integers K using additivity (e.g., pulling out 2, then any natural number by induction). Then it combines this with the (1/2)ⁿ scaling to get homogeneity for rationals K/2ⁿ. Finally, it uses density/approximation of real numbers by such rationals and the continuity of the norm to pass the property to every real scalar λ.

Review Questions

State the parallelogram law and explain how it relates to the existence of an inner product.
In the real case, why does symmetry of the constructed inner product become relatively easy?
Outline the main technique used to prove linearity in the second argument using the parallelogram law.

Key Points

1
Jordan–von Neumann’s theorem characterizes inner-product norms purely by the parallelogram law: a norm comes from an inner product iff the parallelogram identity holds for all vectors.
2
Given a norm satisfying the parallelogram law, a polarization-type formula reconstructs an inner product whose induced norm matches the original norm via ⟨X,X⟩ = ‖X‖².
3
Positive definiteness follows quickly by substituting the same vector into both arguments, forcing ⟨X,X⟩=0 only when X is the zero vector.
4
Symmetry is straightforward in the real case because the norm-based expression is unchanged when swapping the two arguments.
5
Linearity in the second argument is proved by converting norm-squared differences into sums so the parallelogram law can be applied, then using cancellation to simplify.
6
The proof first establishes homogeneity for factors (1/2)ⁿ, then additivity, then integer homogeneity, and finally extends to all real scalars by approximation and continuity.
7
The complex case follows the same overall plan but requires additional terms from the complex polarization identity.

Highlights

The parallelogram law is the exact condition needed to upgrade a norm into an inner product structure.

The proof’s core move is rewriting expressions so the parallelogram law can be applied—even when linearity computations initially produce differences of norm squares.

Once scaling by 1/2 is secured, iteration and additivity build homogeneity for rationals K/2ⁿ, and continuity finishes the extension to all real scalars.

Topics

Jordan–von Neumann Theorem
Parallelogram Law
Polarization Identity
Inner Product Axioms
Hilbert Spaces