Get AI summaries of any video or article — Sign up free
Probability Theory 2 | Probability Measures [dark version] thumbnail

Probability Theory 2 | Probability Measures [dark version]

4 min read

Based on The Bright Side of Mathematics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

A probability measure P assigns probabilities to events (subsets of Ω) in a way that always lands in [0,1].

Briefing

Probability measures formalize randomness by assigning probabilities to “events” in a way that behaves like area: the total probability of all outcomes is 1, impossible events have probability 0, and probabilities add up consistently for disjoint events. The core setup starts with a sample space Ω, the set of all possible outcomes of a random experiment, and then chooses a collection of subsets of Ω—called a σ-algebra A—so that the probability function can be defined without contradictions.

A probability measure P is a map from events (elements of A) to numbers in the interval [0,1]. Two boundary conditions anchor the definition: P(Ω)=1 and P(∅)=0. The additivity rule then handles how probabilities combine. If A and B are disjoint (their intersection is empty), then P(A∪B)=P(A)+P(B). The definition extends beyond two sets: for countably many pairwise disjoint events {A_i}, the probability of their union equals the sum of their probabilities. This “countable additivity” is what makes the framework work not just for finite partitions, but also for limits of increasingly fine event collections.

To make these rules meaningful, the domain of P must be restricted to a σ-algebra A. A σ-algebra is a family of subsets of Ω that is closed under three operations: it contains the empty set and the whole space (implicitly via the usual σ-algebra requirements), it is closed under complements (if A is an event, then Ω\A is also an event), and it is closed under countable unions (if A_1, A_2, … are events, then their union is also an event). In probability language, the elements of a σ-algebra are exactly the events whose probabilities are allowed to be defined.

The transcript illustrates the machinery with a single die. The sample space Ω is {1,2,3,4,5,6}. For a finite sample space, a natural choice is to take the σ-algebra A as the power set of Ω, meaning every subset is an event. With the usual “fair die” model, each outcome has probability 1/6, so P({2})=1/6. The event “even number” corresponds to {2,4,6}, giving P(even)=3/6=1/2. This example shows how probability measures turn everyday counting into a consistent rule system.

A closing exercise ties the definition together: using P(Ω)=1 and the additivity/complement structure, one can prove that for any event A, the probability of its complement satisfies P(A^c)=1−P(A). That identity is presented as a direct consequence of the probability measure axioms and becomes a recurring tool in later examples.

Cornell Notes

A probability measure assigns probabilities to events in a way that mirrors consistent “area accounting.” Start with a sample space Ω of all outcomes, then choose a σ-algebra A (a collection of subsets closed under complements and countable unions) so probabilities are defined for the events of interest. A probability measure P maps events in A to values in [0,1], with P(Ω)=1 and P(∅)=0, and it satisfies countable additivity: probabilities of pairwise disjoint events add up to the probability of their union. For a fair die, Ω={1,…,6} and taking A as the power set lets every subset be an event; for example, P({2})=1/6 and P(even)=1/2. A key consequence is P(A^c)=1−P(A).

Why can’t probabilities be assigned to arbitrary subsets of Ω without extra structure?

Probabilities must obey rules like complement and countable additivity. To guarantee those rules stay consistent, the allowed subsets are restricted to a σ-algebra A. A σ-algebra is closed under complements (if A is allowed, then Ω\A is allowed) and under countable unions (if A1, A2, … are allowed, then ⋃i Ai is allowed). This closure ensures P can be defined on all events needed for the probability axioms.

What exactly are the axioms of a probability measure P?

A probability measure P is a function from events A∈A to numbers in [0,1]. It satisfies: (1) normalization: P(Ω)=1 and P(∅)=0; and (2) σ-additivity (countable additivity): for pairwise disjoint events A1, A2, …, P(⋃i Ai)=Σi P(Ai). The disjointness condition means Ai∩Aj=∅ for i≠j.

How does disjointness connect to adding probabilities?

If two events A and B do not overlap (A∩B=∅), then the outcomes in A and the outcomes in B are separate. The probability of “A or B” becomes the sum of their probabilities: P(A∪B)=P(A)+P(B). The same idea extends to countably many pairwise disjoint events via σ-additivity.

In the die example, why is the σ-algebra taken as the power set?

For a finite sample space Ω={1,2,3,4,5,6}, it’s convenient to allow every subset as an event. The power set 𝒫(Ω) contains all subsets, so any event like “throw an even number” or “throw a 2” is included. The transcript notes this approach works well for finite Ω; for infinite sample spaces, a different (typically smaller) σ-algebra may be needed.

How do you compute P(even) for a fair die using the probability measure idea?

The event “even number” is the subset {2,4,6}. With a fair die, each singleton outcome has probability 1/6. Since {2}, {4}, and {6} are disjoint, additivity gives P(even)=P({2})+P({4})+P({6})=1/6+1/6+1/6=3/6=1/2.

What is the complement rule and why does it follow from the axioms?

For any event A, the complement A^c=Ω\A represents “not A.” The rule is P(A^c)=1−P(A). It follows because Ω is the disjoint union of A and A^c, so countable additivity (applied to this two-set disjoint union) gives P(Ω)=P(A)+P(A^c). Since P(Ω)=1, rearranging yields P(A^c)=1−P(A).

Review Questions

  1. What properties must a collection of subsets A satisfy to qualify as a σ-algebra on Ω?
  2. State the countable additivity (σ-additivity) condition for a probability measure and explain the role of pairwise disjointness.
  3. Using only the probability measure axioms, derive the identity P(A^c)=1−P(A).

Key Points

  1. 1

    A probability measure P assigns probabilities to events (subsets of Ω) in a way that always lands in [0,1].

  2. 2

    The total probability is fixed by normalization: P(Ω)=1 and P(∅)=0.

  3. 3

    Countable additivity is the central rule: disjoint events’ probabilities sum to the probability of their union.

  4. 4

    A σ-algebra A is the required domain for events; it must be closed under complements and countable unions.

  5. 5

    For a fair die, Ω={1,…,6} and choosing A as the power set lets every subset be an event.

  6. 6

    With a fair die, P({2})=1/6 and P(even)=3/6=1/2 by additivity over disjoint outcomes.

  7. 7

    The complement identity P(A^c)=1−P(A) follows from the disjoint union Ω=A∪A^c and P(Ω)=1.

Highlights

Probability measures turn randomness into a function P: events → [0,1] with P(Ω)=1 and P(∅)=0.
σ-additivity extends “add probabilities for disjoint events” from finite unions to countable unions.
A σ-algebra is the gatekeeper for which subsets are legitimate events: closed under complements and countable unions.
For a fair die, the event “even” corresponds to {2,4,6}, giving probability 3/6=1/2.
The complement rule P(A^c)=1−P(A) drops out directly from P(Ω)=P(A)+P(A^c).

Topics

  • Probability Measures
  • Sample Space
  • Sigma Algebra
  • Sigma Additivity
  • Complement Rule

Mentioned

  • P