Distributions 10 | Distributional Derivative
Based on The Bright Side of Mathematics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Distributional derivatives are defined via duality: derivatives are transferred from the distribution onto test functions using integration-by-parts logic.
Briefing
Distributional derivatives turn differentiation into an operation that always exists for generalized functions, even when classical derivatives fail at jumps or singularities. Starting from the duality pairing between a distribution T and a smooth test function Φ, differentiation is defined by shifting derivatives off T and onto Φ via integration by parts—producing a new distribution that remains well-defined for every T in D′.
For smooth functions, this definition matches the classical derivative. If f is continuously differentiable (f ∈ C¹) and T_f denotes the regular distribution associated with f, then applying the distributional derivative to a test function gives an ordinary integral of f′(x)Φ(x). Integration by parts moves the derivative from f′ onto Φ, introduces a minus sign, and boundary terms vanish because test functions have compact support (so Φ is zero outside a large interval). The result is exactly the distribution associated with f′. In higher dimensions, the same mechanism extends to partial derivatives: for a multi-index α, the distributional partial derivative ∂^α T is defined so that ⟨∂^α T, Φ⟩ equals (−1)^|α| ⟨T, ∂^α Φ⟩, where |α| counts how many partial derivatives are applied.
The payoff is that distributional derivatives exist without requiring pointwise differentiability. Classical differentiation demands that derivatives exist as functions, but distributional differentiation only requires that T act continuously and linearly on test functions. That means one can differentiate objects like the Heaviside step function, which has a jump at 0 and therefore lacks a classical derivative there. Using the definition with T = Θ (the heavy-side/Heaviside distribution), the pairing ⟨∂Θ, Φ⟩ reduces to an integral involving Φ′. Because Θ(x)=0 for x<0 and Θ(x)=1 for x>0, the integral collapses to the positive side and the fundamental theorem of calculus yields −Φ(0). This matches the defining action of the Dirac delta distribution δ, so the distributional derivative of the Heaviside function is the Dirac delta: ∂Θ = δ.
A second example shows how singularities behave under differentiation. Differentiating the Dirac delta distribution itself produces another distribution that is not regular: ⟨∂δ, Φ⟩ = −Φ′(0). In other words, the derivative of δ is again well-defined in D′, but it no longer corresponds to an ordinary function. The trade-off is clear: distributional derivatives always exist, yet the resulting object may become “less regular” than the original.
Overall, the core insight is operational: differentiation is redefined through duality and integration by parts so that derivatives can be computed for any distribution, and classical derivatives are recovered automatically when the underlying object is smooth enough.
Cornell Notes
Distributional derivatives redefine differentiation using duality: a distribution T is differentiated by transferring derivatives from T onto a test function Φ, with a sign determined by the order of differentiation. For a multi-index α, the rule is ⟨∂^α T, Φ⟩ = (−1)^|α| ⟨T, ∂^α Φ⟩, which guarantees the result is again a distribution. When T comes from a smooth function f, this construction reproduces the classical partial derivatives of f. The method also works for non-smooth and singular objects: the Heaviside step function’s distributional derivative equals the Dirac delta, and differentiating δ yields a new (non-regular) distribution acting as −Φ′(0).
How does the definition of a distributional derivative avoid needing classical derivatives of T?
Why do boundary terms disappear when deriving the sign rule (−1)^|α|?
What is the distributional derivative of the Heaviside step function Θ?
How does differentiating the Dirac delta δ change the type of object?
When do distributional derivatives coincide with classical derivatives?
Review Questions
- State the general formula for the distributional partial derivative ∂^α T in terms of the pairing with a test function Φ.
- Why does the distributional derivative of the Heaviside step function produce the Dirac delta? Outline the key steps in the pairing calculation.
- What does ⟨∂δ, Φ⟩ equal, and why does this show the derivative of δ is not a regular distribution?
Key Points
- 1
Distributional derivatives are defined via duality: derivatives are transferred from the distribution onto test functions using integration-by-parts logic.
- 2
For a multi-index α, the sign in the definition is (−1)^|α| because each derivative transfer introduces a minus sign.
- 3
Compact support of test functions eliminates boundary terms, making the definition consistent and well-defined.
- 4
For regular distributions associated with smooth functions, distributional derivatives match classical derivatives.
- 5
The Heaviside step function Θ has no classical derivative at the jump, but its distributional derivative exists and equals the Dirac delta δ.
- 6
Differentiating δ yields a new, non-regular distribution acting as −Φ′(0), illustrating that regularity can decrease under distributional differentiation.