Statistics · CED Unit 6: Inference for Categorical Data: Proportions · 14 min read · Updated 2026-05-11

Introducing Confidence Intervals for Proportions — AP Statistics

AP Statistics · CED Unit 6: Inference for Categorical Data: Proportions · 14 min read

1. Core Concepts and Notation ★★☆☆☆ ⏱ 3 min

A confidence interval for a population proportion is an inferential method that produces a range of plausible values for an unknown fixed population proportion $p$, based on data collected from a random sample. Unlike a point estimate (a single guess for $p$), a confidence interval explicitly quantifies sampling variability, accounting for natural variation between different samples from the same population.

This is the first core inference topic for proportions in AP Statistics, and it makes up 12-15% of the total AP exam weight, appearing in both multiple-choice and free-response sections.

2. Confidence Interval Structure and Correct Interpretation ★★☆☆☆ ⏱ 3 min

All confidence intervals follow the same core structure: a point estimate plus or minus a margin of error. The point estimate is your best single guess for the unknown population parameter, and the margin of error accounts for random sampling variability. For a population proportion, the general form is:

\hat{p} \pm z^* \times \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}

Where $z^*$ is the critical value from the standard normal distribution corresponding to your chosen confidence level. You must memorize the three most common $z^*$ values for the AP exam: 1.645 for 90% confidence, 1.96 for 95% confidence, and 2.576 for 99% confidence.

The confidence level describes the long-run behavior of the interval method, not a single calculated interval. A C% confidence level means that if you repeated the sampling process many times, C% of the resulting intervals would capture the true population proportion. The AP-expected phrasing for interpreting a single interval is: *We are C% confident that the true proportion of [context] is between lower bound and upper bound.*

📐 Worked Example

A political scientist asks 250 randomly selected registered voters whether they support a new local park bond, and finds 140 support it. (a) Identify the point estimate for the true proportion of all registered voters who support the bond. (b) Interpret what a 95% confidence level means in this context.

Confirm the goal: we need a point estimate and context-specific interpretation of the 95% confidence level.
Calculate the point estimate as the sample proportion:
$\hat{p} = \frac{\text{Number of supporters}}{\text{Sample size}} = \frac{140}{250} = 0.56$
0.56 is our point estimate for the true population proportion $p$.
Recall that the confidence level describes the long-run success rate of the method, not any single interval.
State the interpretation in context: If we took many random samples of 250 registered voters from this population, about 95% of the resulting confidence intervals would capture the true proportion of all registered voters who support the park bond.

Exam tip: Always explicitly distinguish between interpreting a confidence level (long-run method behavior) and interpreting a single confidence interval (plausible values for the population proportion).

3. Conditions for Inference ★★★☆☆ ⏱ 3 min

Before you can reliably construct a confidence interval for a proportion, you must verify three conditions to ensure that the sampling distribution of $\hat{p}$ is approximately normal and your standard error calculation is valid. Skipping condition checks is one of the most common reasons for lost points on AP FRQs.

**Random**: The sample must be randomly selected from the population of interest, ensuring $\hat{p}$ is an unbiased estimator of $p$.
**Independent**: Individual observations must be independent. When sampling without replacement from a finite population, use the 10% condition: the sample size $n$ must be no more than 10% of the total population size $N$.
**Large Counts**: The sampling distribution of $\hat{p}$ is approximately normal only if we have at least 10 successes and 10 failures in the sample: $n\hat{p} \geq 10$ and $n(1-\hat{p}) \geq 10$.

📐 Worked Example

A quality control inspector tests 40 randomly selected lightbulbs from a shipment of 5000, and finds 6 are defective. Check the conditions for constructing a confidence interval for the true proportion of defective bulbs in the shipment.

Check the Random condition: The problem explicitly states the 40 bulbs were randomly selected, so this condition is satisfied.
Check the 10% condition for independence: Total population size is 5000. 10% of 5000 is 500, and our sample size of 40 is less than 500. The 10% condition is satisfied, so we can assume independence.
Check the Large Counts condition: First calculate the sample proportion $\hat{p} = 6/40 = 0.15$, then compute the counts:
$n\hat{p} = 40(0.15) = 6; \quad n(1-\hat{p}) = 40(0.85) = 34$
Final conclusion: The Large Counts condition fails because $n\hat{p} = 6 < 10$. We cannot safely construct a one-proportion z-interval for this data.

4. Calculating a One-Proportion Z-Interval ★★★☆☆ ⏱ 5 min

Once all conditions are confirmed to be met, you can calculate the confidence interval using the standard formula, then interpret the interval in context to earn full credit on FRQs.

\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}

The term $z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}$ is the margin of error ($ME$), which measures how far we expect $\hat{p}$ to be from the true $p$ at our chosen confidence level.

📐 Worked Example

A biologist wants to estimate the proportion of oak trees in a national forest that are infected with a certain fungus. They take a random sample of 180 oak trees, and find 63 are infected. Calculate and interpret a 90% confidence interval for the true proportion of infected oak trees.

Verify all required conditions: Random sample is given, population of oak trees is far larger than $10 \times 180 = 1800$ so 10% condition is met, and $n\hat{p} = 63 \geq 10$, $n(1-\hat{p}) = 117 \geq 10$ so Large Counts is met. All conditions are satisfied.
Calculate the sample proportion:
$\hat{p} = \frac{63}{180} = 0.35$
Find the critical $z^*$ value for 90% confidence: $z^* = 1.645$.
Calculate the margin of error:
$ME = 1.645 \times \sqrt{\frac{(0.35)(0.65)}{180}} \approx 0.0585$
Construct the interval bounds:
$0.35 \pm 0.0585 = (0.2915, 0.4085) \approx (0.29, 0.41)$
Interpret the interval in context, per AP expectations: We are 90% confident that the true proportion of all oak trees in this national forest that are infected with the fungus is between 0.29 and 0.41.

Common Pitfalls

Why: Students confuse the long-run behavior of the interval method with probability for a single fixed interval. The true $p$ is fixed, not random, so it is either in the interval or not.

Why: Students often dismiss the 10% condition as unimportant and skip it to save time on FRQs.

Why: Students remember $p(1-p)$ is maximized at 0.5 from sampling distribution topics and incorrectly use it for condition checks.

Why: Students rely on the empirical rule and use rounded values instead of the precise critical values AP expects.

Why: Students confuse the range of the sampling distribution of sample proportions with a confidence interval for the fixed population parameter.

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →

Introducing Confidence Intervals for Proportions — AP Statistics

1. Core Concepts and Notation ★★☆☆☆ ⏱ 3 min

2. Confidence Interval Structure and Correct Interpretation ★★☆☆☆ ⏱ 3 min

3. Conditions for Inference ★★★☆☆ ⏱ 3 min

4. Calculating a One-Proportion Z-Interval ★★★☆☆ ⏱ 5 min

Common Pitfalls

Quick Reference Cheatsheet

More study guides