Session 44 - Confidence Intervals

TL;DR

Confidence intervals replace single-number claims with a range around a point estimate using point estimate ± margin of error.

Briefing Cornell Notes

Briefing

Confidence intervals are presented as the practical fix for a simple problem: a single sample statistic (like the sample mean) can’t reliably pin down an unknown population parameter. Instead of claiming a precise number, confidence intervals give a range that is tied to a confidence level—most commonly 95%—so the unknown population mean is expected to fall inside that range in repeated sampling. The session builds this idea from first principles, starting with population vs. sample, then moving through point estimation, and finally deriving how the interval is calculated.

The walkthrough begins with core statistical foundations. Population is the full set of interest (e.g., all students), while a sample is a random subset used because studying everyone is impossible. Parameters describe population quantities (often denoted with Greek letters), while statistics describe sample quantities. Since parameters are typically unknown, inference uses sample data to estimate them. This naturally leads to inferential statistics, where the goal is to make predictions or decisions about population characteristics using limited samples.

From there, “point estimate” is introduced as a single-number summary of the population parameter—most often the sample mean. The instructor uses an analogy: if someone predicts an exact cricket score and is paid only when the prediction is exactly correct, that level of certainty is unrealistic because the true outcome varies. Confidence intervals replace that unrealistic exactness with a probabilistic range.

The session then defines confidence intervals as ranges constructed around the point estimate using a margin of error. The margin of error depends on (1) the confidence level (through a critical value), (2) the population standard deviation (or an estimate of it), and (3) the sample size. The confidence level is clarified to avoid a common misunderstanding: it does not mean the population mean has a 95% probability of lying in the interval for a single computed interval. Instead, it means that if the sampling-and-interval-building process were repeated many times, about 95% of the constructed intervals would contain the true population parameter.

Two calculation procedures are taught. The z-procedure applies when the population standard deviation is known and the sampling assumptions hold; it uses the standard normal critical value (z) and typically relies on the Central Limit Theorem when sample size is large (the instructor emphasizes n ≥ 30). The t-procedure is introduced for the more realistic case where the population standard deviation is unknown. It replaces the z critical value with a t critical value from the t-distribution, which accounts for extra uncertainty via “degrees of freedom” (n − 1). As degrees of freedom increase, the t distribution approaches the normal distribution, making the t-procedure converge toward the z-procedure.

Finally, the session emphasizes how interval width changes with inputs: higher confidence levels widen intervals (larger margin of error), larger population variability widens intervals, and larger sample sizes narrow them—especially when moving from small to moderate sample sizes. A practical example with a YouTube-style dataset (CampusX subscribers) and a discussion of a prior Titanic-related implementation mistake reinforce the idea that the correct standard deviation must be used in the right place (sampling distribution vs. sample-level variability), otherwise the confidence interval coverage can be wrong. The class ends by previewing next steps: hypothesis testing, then regression and machine learning topics, with confidence intervals serving as a foundational tool for inference.

Cornell Notes

Confidence intervals turn an uncertain point estimate (like a sample mean) into a range for an unknown population parameter. The range is built as: point estimate ± margin of error, where the margin of error depends on the confidence level, variability, and sample size. The confidence level (e.g., 95%) is interpreted through repeated sampling: if the same sampling process were repeated many times, about that fraction of constructed intervals would contain the true population parameter. Two procedures are used: the z-procedure when the population standard deviation is known (and assumptions support normality/CLT), and the t-procedure when it’s unknown, using the t-distribution with degrees of freedom (n − 1). Interval width grows with higher confidence and higher variability, and shrinks with larger sample sizes.

What’s the difference between a population parameter and a sample statistic, and why does it matter for confidence intervals?

A population parameter is a numerical characteristic of the full group (e.g., the true population mean height), usually denoted with Greek letters. A sample statistic is the corresponding quantity computed from a subset (e.g., the sample mean). Confidence intervals exist because parameters are typically unknown, so inference uses sample statistics to estimate where the parameter likely lies. The interval is centered on the point estimate (often the sample mean) and expanded by a margin of error to reflect uncertainty from sampling.

Why does the confidence level not mean “the parameter has a 95% probability of being inside this one interval”?

The session stresses a common misinterpretation. For a single computed interval, the true population mean is fixed (not randomly changing), so it’s not meaningful to assign it a probability like “95% chance it’s inside.” Instead, the 95% confidence level refers to the long-run behavior of the procedure: if the sampling-and-interval construction were repeated many times (e.g., 100 times), about 95% of the resulting intervals would contain the true population mean.

How do z-procedure and t-procedure differ in practice?

The z-procedure is used when the population standard deviation is known (or treated as known) and assumptions support the normal approximation; it uses the standard normal critical value (z). The t-procedure is used when the population standard deviation is unknown; it replaces z with a t critical value from the t-distribution. That t distribution depends on degrees of freedom (n − 1), reflecting extra uncertainty from estimating variability using the sample.

What determines the width of a confidence interval?

Three main factors are highlighted: (1) Confidence level: higher confidence increases the critical value, widening the interval. (2) Variability: larger standard deviation increases the margin of error, widening the interval. (3) Sample size: larger n reduces the margin of error (roughly via a 1/√n effect), narrowing the interval. The session also notes diminishing returns: increasing n helps most when moving from small to moderate sample sizes.

What role does degrees of freedom (n − 1) play in the t-procedure?

Degrees of freedom control how heavy-tailed the t distribution is. With small samples (low degrees of freedom), the t critical value is larger than the corresponding z value, producing wider intervals to account for extra uncertainty. As n grows, degrees of freedom increase and the t distribution approaches the normal distribution, so t-based intervals converge toward z-based intervals.

What kind of mistake can break confidence interval correctness in an applied example?

The session points to a practical error: using the wrong standard deviation source when constructing intervals. If the standard deviation is computed incorrectly (e.g., mixing up sampling distribution standard deviation with sample-level standard deviation, or using a value that doesn’t match the assumed formula), the resulting interval coverage can drop below the intended confidence level (e.g., not reaching the expected ~95% containment).

Review Questions

When you compute a 95% confidence interval, what repeated-sampling statement does that 95% actually correspond to?
Under what conditions should the z-procedure be used instead of the t-procedure?
How does increasing sample size affect the margin of error, and why does the benefit taper off after moderate n?

Key Points

1
Confidence intervals replace single-number claims with a range around a point estimate using point estimate ± margin of error.
2
Confidence level (e.g., 95%) is a property of the interval-building procedure over repeated sampling, not a probability that the fixed true parameter lies inside one computed interval.
3
Population parameters are unknown in general; sample statistics are used to infer them via inferential statistics.
4
Use the z-procedure when the population standard deviation is known and normal/CLT assumptions are appropriate; use the t-procedure when it’s unknown.
5
t-procedure widens intervals for small samples because the t critical value depends on degrees of freedom (n − 1).
6
Interval width increases with higher confidence level and higher variability, and decreases with larger sample size (with diminishing returns).
7
Applied implementations can fail if the standard deviation used in the formula doesn’t match the procedure’s assumptions (e.g., mixing sampling-distribution vs. sample-level variability).

Highlights

Confidence intervals are built as point estimate ± margin of error, turning uncertainty into a usable range for an unknown population parameter.

The confidence level is interpreted through repeated sampling: about 95% of intervals from repeated experiments contain the true parameter.

z-procedure assumes known population standard deviation; t-procedure compensates for estimating it using the t-distribution and degrees of freedom (n − 1).

Interval width shrinks with larger n and grows with higher confidence and higher variability—especially noticeable when n moves from small to moderate values.

Topics

Confidence Intervals
Point Estimation
Z vs T Procedures
Margin of Error
Degrees of Freedom

Session 44 - Confidence Intervals | DSMP 2023