Confidence Intervals for the Slope of a Regression Model — AP Statistics
1. Core Concepts & Conditions for Inference ★★☆☆☆ ⏱ 4 min
When we fit a least squares regression line to sample bivariate data, we calculate a sample slope $b$, which is a point estimate of the true unknown population slope $\beta$, the parameter describing the linear relationship between $x$ and $y$ for the entire population. A confidence interval gives a range of plausible values for $\beta$, accounting for sampling variability.
- **Linear**: The true relationship between $x$ and $y$ is linear; check for no curvature in residual or scatter plots.
- **Independent**: Observations are independent of each other; confirmed by random sampling/experiment, 10% condition applies if sampling without replacement.
- **Normal**: Residuals are normally distributed with mean 0 at all values of $x$; check with a roughly linear normal probability plot of residuals.
- **Equal Variance (Homoscedasticity)**: The standard deviation of residuals is constant across all values of $x$; check for no fanning or uneven spread in residual plots.
Exam tip: On the AP exam, you must name and check every condition in the context of the problem to earn full credit. Only listing the L.I.N.E. acronym without context-specific checking will earn zero points.
2. Calculating the Confidence Interval for the Slope ★★★☆☆ ⏱ 5 min
A confidence interval for the true population slope $\beta$ follows the same general structure as any confidence interval: point estimate ± critical value × standard error of the estimate. For regression slopes, we use the t-distribution for the critical value because we never know the population standard deviation of the residuals, so we estimate it from sample data.
b \pm t^* \times SE_b
- $b$ = sample slope from the least squares regression fit
- $t^*$ = critical t-value for the desired confidence level, with degrees of freedom $df = n - 2$ (we subtract 2 because we estimate two population parameters: intercept $\alpha$ and slope $\beta$)
- $SE_b$ = standard error of the sample slope, which measures sampling variability of $b$; almost always given directly in regression output on AP exam questions
If you do need to calculate $SE_b$ by hand, the formula is $SE_b = \frac{s_e}{s_x \sqrt{n-1}}$, where $s_e$ is the standard deviation of residuals and $s_x$ is the standard deviation of the explanatory variable $x$.
Exam tip: When reading regression output, always mark the slope row for your explanatory variable immediately after reading the question. Many students accidentally use the standard error of the intercept instead of the slope, costing easy points on the exam.
3. Interpreting the Interval & Inference Connection ★★★☆☆ ⏱ 3 min
Interpretation of a confidence interval for the slope is one of the most frequently tested skills on the AP exam, and requires two key components for full credit: a correct statement of confidence, and context-specific description of what the slope means.
A key inference connection: If the confidence interval does not contain 0, that means 0 is not a plausible value for the true slope $\beta$. At the significance level $\alpha = 1 - C$ (e.g., $\alpha = 0.05$ for 95% confidence), this means we have statistically significant evidence of a linear relationship between $x$ and $y$. If the interval does contain 0, 0 is a plausible value for $\beta$, so we do not have significant evidence of a linear relationship.
4. Exam-Style Concept Check ★★★★☆ ⏱ 2 min
Common Pitfalls
Why: Students confuse regression inference with one-sample t-intervals for means, where $df = n-1$, or default to z out of habit from proportion inference.
Why: Outputs list standard error for both terms, and students often scan the wrong row when working quickly on the exam.
Why: Students confuse the known sample slope with the unknown population slope we are trying to estimate.
Why: Students remember the slope describes change in y, but forget that regression models the *mean* y at each value of x.
Why: Students confuse "no evidence of a relationship" with "evidence of no relationship".
Why: Students memorize the acronym and forget that AP requires context-specific checking to earn credit.