Probability Theory 2 | Probability Measures [dark version]
Based on The Bright Side of Mathematics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
A probability measure P assigns probabilities to events (subsets of Ω) in a way that always lands in [0,1].
Briefing
Probability measures formalize randomness by assigning probabilities to “events” in a way that behaves like area: the total probability of all outcomes is 1, impossible events have probability 0, and probabilities add up consistently for disjoint events. The core setup starts with a sample space Ω, the set of all possible outcomes of a random experiment, and then chooses a collection of subsets of Ω—called a σ-algebra A—so that the probability function can be defined without contradictions.
A probability measure P is a map from events (elements of A) to numbers in the interval [0,1]. Two boundary conditions anchor the definition: P(Ω)=1 and P(∅)=0. The additivity rule then handles how probabilities combine. If A and B are disjoint (their intersection is empty), then P(A∪B)=P(A)+P(B). The definition extends beyond two sets: for countably many pairwise disjoint events {A_i}, the probability of their union equals the sum of their probabilities. This “countable additivity” is what makes the framework work not just for finite partitions, but also for limits of increasingly fine event collections.
To make these rules meaningful, the domain of P must be restricted to a σ-algebra A. A σ-algebra is a family of subsets of Ω that is closed under three operations: it contains the empty set and the whole space (implicitly via the usual σ-algebra requirements), it is closed under complements (if A is an event, then Ω\A is also an event), and it is closed under countable unions (if A_1, A_2, … are events, then their union is also an event). In probability language, the elements of a σ-algebra are exactly the events whose probabilities are allowed to be defined.
The transcript illustrates the machinery with a single die. The sample space Ω is {1,2,3,4,5,6}. For a finite sample space, a natural choice is to take the σ-algebra A as the power set of Ω, meaning every subset is an event. With the usual “fair die” model, each outcome has probability 1/6, so P({2})=1/6. The event “even number” corresponds to {2,4,6}, giving P(even)=3/6=1/2. This example shows how probability measures turn everyday counting into a consistent rule system.
A closing exercise ties the definition together: using P(Ω)=1 and the additivity/complement structure, one can prove that for any event A, the probability of its complement satisfies P(A^c)=1−P(A). That identity is presented as a direct consequence of the probability measure axioms and becomes a recurring tool in later examples.
Cornell Notes
A probability measure assigns probabilities to events in a way that mirrors consistent “area accounting.” Start with a sample space Ω of all outcomes, then choose a σ-algebra A (a collection of subsets closed under complements and countable unions) so probabilities are defined for the events of interest. A probability measure P maps events in A to values in [0,1], with P(Ω)=1 and P(∅)=0, and it satisfies countable additivity: probabilities of pairwise disjoint events add up to the probability of their union. For a fair die, Ω={1,…,6} and taking A as the power set lets every subset be an event; for example, P({2})=1/6 and P(even)=1/2. A key consequence is P(A^c)=1−P(A).
Why can’t probabilities be assigned to arbitrary subsets of Ω without extra structure?
What exactly are the axioms of a probability measure P?
How does disjointness connect to adding probabilities?
In the die example, why is the σ-algebra taken as the power set?
How do you compute P(even) for a fair die using the probability measure idea?
What is the complement rule and why does it follow from the axioms?
Review Questions
- What properties must a collection of subsets A satisfy to qualify as a σ-algebra on Ω?
- State the countable additivity (σ-additivity) condition for a probability measure and explain the role of pairwise disjointness.
- Using only the probability measure axioms, derive the identity P(A^c)=1−P(A).
Key Points
- 1
A probability measure P assigns probabilities to events (subsets of Ω) in a way that always lands in [0,1].
- 2
The total probability is fixed by normalization: P(Ω)=1 and P(∅)=0.
- 3
Countable additivity is the central rule: disjoint events’ probabilities sum to the probability of their union.
- 4
A σ-algebra A is the required domain for events; it must be closed under complements and countable unions.
- 5
For a fair die, Ω={1,…,6} and choosing A as the power set lets every subset be an event.
- 6
With a fair die, P({2})=1/6 and P(even)=3/6=1/2 by additivity over disjoint outcomes.
- 7
The complement identity P(A^c)=1−P(A) follows from the disjoint union Ω=A∪A^c and P(Ω)=1.