Get AI summaries of any video or article — Sign up free
Multivariable Calculus 15 | Multi-Index Notation [dark version] thumbnail

Multivariable Calculus 15 | Multi-Index Notation [dark version]

4 min read

Based on The Bright Side of Mathematics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

A multi-index α = (α_1, …, α_n) is an n-tuple of nonnegative integers that encodes how many times to differentiate with respect to each variable x_i.

Briefing

Multi-index notation streamlines multivariable partial derivatives by encoding both the differentiation order and which variables are involved into a single object. Instead of writing repeated partial derivatives with explicit exponents and variable order, an n-dimensional multi-index α (an n-tuple of nonnegative integers) packages the counts of derivatives with respect to each coordinate. The payoff is compact formulas that scale cleanly to higher-order derivatives—especially when building Taylor polynomials in several variables.

The starting point is the familiar partial-derivative notation: a derivative like ∂^{(something)}/∂x_1^{(…)}∂x_2^{(…)}… indicates how many times each variable is differentiated, and evaluation is shown with parentheses. When the function f is sufficiently smooth, Schwartz’s theorem guarantees that mixed partial derivatives commute, meaning the order of differentiation does not change the result. That symmetry motivates a notation that doesn’t track the order of operations—only how many times each variable is differentiated.

A multi-index α = (α_1, …, α_n) is defined as an n-dimensional vector whose components are natural numbers, with 0 allowed. Two basic derived quantities drive the notation. First, the “length” or absolute value of α is |α| = α_1 + … + α_n, which directly gives the total derivative order. Second, α is used to form monomials: for a vector x = (x_1, …, x_n), x^α means x_1^{α_1} x_2^{α_2} … x_n^{α_n}. This turns multi-index notation into a compact way to write multivariable polynomials.

Factorials also generalize componentwise: α! is defined as α_1! α_2! … α_n!. With this, generalized binomial coefficients can be defined for multi-indices of the same dimension, using (α choose β) = α! / ( (α−β)! β! ), where α−β is computed component by component. Although binomial coefficients are not needed immediately later, the framework is laid out for future combinatorial identities.

The central operator notation is D^α f, where D^α denotes the mixed partial derivative determined by α. Concretely, D^α f corresponds to taking ∂^{|α|}f / (∂x_1^{α_1} ∂x_2^{α_2} … ∂x_n^{α_n}). The total order is |α|, so the multi-index automatically encodes the derivative’s degree.

Examples with n = 3 make the mechanics clear. If α = (1,0,0), then |α| = 1, α! = 1, and D^α f reduces to ∂f/∂x_1. If α = (1,2,1), then |α| = 4 and α! = 1!·2!·1! = 2, and D^α f becomes the fourth-order mixed derivative with one derivative in x_1, two in x_2, and one in x_3. This compact bookkeeping is positioned as the key step toward writing multivariable Taylor polynomials in a form that mirrors the one-dimensional case.

Cornell Notes

Multi-index notation replaces long mixed-partial derivative expressions with a single n-tuple α = (α_1, …, α_n) of nonnegative integers. The total derivative order is |α| = α_1 + … + α_n, and α also determines which variables are differentiated through D^α f = ∂^{|α|}f / (∂x_1^{α_1} … ∂x_n^{α_n}). The same α is used to write monomials x^α = x_1^{α_1} … x_n^{α_n} and to define multi-index factorials α! = α_1! … α_n!. With Schwartz’s theorem ensuring mixed partials commute for sufficiently smooth functions, the notation can ignore differentiation order and focus only on counts. This sets up cleaner formulas for multivariable Taylor polynomials.

What exactly is a multi-index α, and why does it matter for partial derivatives?

A multi-index α is an n-dimensional tuple α = (α_1, …, α_n) where each component is a natural number and 0 is allowed. It matters because each α_i tells how many times to differentiate with respect to x_i. That lets one object encode both the variables involved and the total differentiation counts, avoiding lengthy repeated-derivative notation.

How does |α| determine the order of the derivative D^α f?

The absolute value (length) of α is defined as |α| = α_1 + … + α_n. In the operator D^α f, this quantity becomes the total order of the mixed partial derivative: D^α f = ∂^{|α|}f / (∂x_1^{α_1} … ∂x_n^{α_n}). For example, with α = (1,2,1), |α| = 4, so D^α f is a fourth-order derivative.

How are monomials x^α and multi-index factorials α! defined, and how do they connect to Taylor expansions?

For x = (x_1, …, x_n), the monomial x^α is x^α = x_1^{α_1} x_2^{α_2} … x_n^{α_n}. The multi-index factorial is α! = α_1! α_2! … α_n! (with 0! = 1). These definitions mirror how coefficients in multivariable Taylor polynomials are organized: powers of (x−a) are written as (x−a)^α and denominators use α!.

What role does Schwartz’s theorem play in motivating multi-index notation?

Schwartz’s theorem says mixed partial derivatives commute when f has sufficiently many continuous higher-order partial derivatives. That means the order of differentiation doesn’t change the result, so notation doesn’t need to track the sequence of ∂/∂x_1 and ∂/∂x_2 operations. Multi-index notation captures only the counts (α_1, α_2, …), not the order.

How do generalized binomial coefficients for multi-indices work?

For multi-indices α and β of the same dimension, the generalized binomial coefficient is (α choose β) = α! / ((α−β)! β!). The subtraction α−β is done componentwise, so (α−β)_i = α_i − β_i. The formula uses the same multi-index factorial definition in numerator and denominator.

Can you compute D^α f for a specific α in the n = 3 examples?

Yes. If α = (1,0,0), then |α| = 1 and D^α f = ∂f/∂x_1 (since differentiation with respect to x_2 and x_3 happens 0 times). If α = (1,2,1), then |α| = 4 and D^α f corresponds to one derivative in x_1, two in x_2, and one in x_3: ∂^4 f / (∂x_1 ∂x_2^2 ∂x_3).

Review Questions

  1. Given α = (0,3,2) in three variables, what are |α| and the explicit form of D^α f?
  2. How do the definitions of x^α and α! differ, and why is 0! = 1 important?
  3. Explain how Schwartz’s theorem justifies ignoring the order of mixed partial derivatives when using D^α notation.

Key Points

  1. 1

    A multi-index α = (α_1, …, α_n) is an n-tuple of nonnegative integers that encodes how many times to differentiate with respect to each variable x_i.

  2. 2

    The total derivative order in D^α f is |α| = α_1 + … + α_n.

  3. 3

    Monomials are compactly written as x^α = x_1^{α_1} … x_n^{α_n}, using the same multi-index α.

  4. 4

    Multi-index factorials are defined componentwise as α! = α_1! … α_n!, with 0! = 1.

  5. 5

    For sufficiently smooth functions, Schwartz’s theorem ensures mixed partial derivatives commute, so notation can ignore differentiation order.

  6. 6

    The operator D^α f is defined as ∂^{|α|}f / (∂x_1^{α_1} … ∂x_n^{α_n}).

  7. 7

    Generalized binomial coefficients for multi-indices use (α choose β) = α! / ((α−β)! β!) with componentwise subtraction.

Highlights

Multi-index notation packs both “which variables” and “how many times” into one object α, eliminating long mixed-derivative strings.
Schwartz’s theorem is the reason differentiation order can be dropped, making D^α depend only on counts α_i.
The same α drives three parallel constructions: derivative operators D^α f, monomials x^α, and factorials α!.
An example like α = (1,2,1) immediately yields a fourth-order mixed derivative with one x_1 derivative, two x_2 derivatives, and one x_3 derivative.

Topics