Statistics · CED Unit 6: Inference for Categorical Data: Proportions · 14 min read · Updated 2026-05-11
Introducing Confidence Intervals for Proportions — AP Statistics
AP Statistics · CED Unit 6: Inference for Categorical Data: Proportions · 14 min read
1. Core Concepts and Notation★★☆☆☆⏱ 3 min
A confidence interval for a population proportion is an inferential method that produces a range of plausible values for an unknown fixed population proportion $p$, based on data collected from a random sample. Unlike a point estimate (a single guess for $p$), a confidence interval explicitly quantifies sampling variability, accounting for natural variation between different samples from the same population.
This is the first core inference topic for proportions in AP Statistics, and it makes up 12-15% of the total AP exam weight, appearing in both multiple-choice and free-response sections.
2. Confidence Interval Structure and Correct Interpretation★★☆☆☆⏱ 3 min
All confidence intervals follow the same core structure: a point estimate plus or minus a margin of error. The point estimate is your best single guess for the unknown population parameter, and the margin of error accounts for random sampling variability. For a population proportion, the general form is:
Where $z^*$ is the critical value from the standard normal distribution corresponding to your chosen confidence level. You must memorize the three most common $z^*$ values for the AP exam: 1.645 for 90% confidence, 1.96 for 95% confidence, and 2.576 for 99% confidence.
The confidence level describes the long-run behavior of the interval method, not a single calculated interval. A C% confidence level means that if you repeated the sampling process many times, C% of the resulting intervals would capture the true population proportion. The AP-expected phrasing for interpreting a single interval is: *We are C% confident that the true proportion of [context] is between lower bound and upper bound.*
Exam tip: Always explicitly distinguish between interpreting a confidence level (long-run method behavior) and interpreting a single confidence interval (plausible values for the population proportion).
3. Conditions for Inference★★★☆☆⏱ 3 min
Before you can reliably construct a confidence interval for a proportion, you must verify three conditions to ensure that the sampling distribution of $\hat{p}$ is approximately normal and your standard error calculation is valid. Skipping condition checks is one of the most common reasons for lost points on AP FRQs.
**Random**: The sample must be randomly selected from the population of interest, ensuring $\hat{p}$ is an unbiased estimator of $p$.
**Independent**: Individual observations must be independent. When sampling without replacement from a finite population, use the 10% condition: the sample size $n$ must be no more than 10% of the total population size $N$.
**Large Counts**: The sampling distribution of $\hat{p}$ is approximately normal only if we have at least 10 successes and 10 failures in the sample: $n\hat{p} \geq 10$ and $n(1-\hat{p}) \geq 10$.
4. Calculating a One-Proportion Z-Interval★★★☆☆⏱ 5 min
Once all conditions are confirmed to be met, you can calculate the confidence interval using the standard formula, then interpret the interval in context to earn full credit on FRQs.
The term $z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}$ is the margin of error ($ME$), which measures how far we expect $\hat{p}$ to be from the true $p$ at our chosen confidence level.
Common Pitfalls
Why: Students confuse the long-run behavior of the interval method with probability for a single fixed interval. The true $p$ is fixed, not random, so it is either in the interval or not.
Why: Students often dismiss the 10% condition as unimportant and skip it to save time on FRQs.
Why: Students remember $p(1-p)$ is maximized at 0.5 from sampling distribution topics and incorrectly use it for condition checks.
Why: Students rely on the empirical rule and use rounded values instead of the precise critical values AP expects.
Why: Students confuse the range of the sampling distribution of sample proportions with a confidence interval for the fixed population parameter.