Statistics · Inference for Categorical Data: Proportions · 14 min read · Updated 2026-05-11

Confidence Intervals for the Difference in Two Proportions — AP Statistics

AP Statistics · Inference for Categorical Data: Proportions · 14 min read

1. Core Concepts and Conditions for Inference ★★☆☆☆ ⏱ 4 min

A confidence interval for $p_1 - p_2$ gives a range of plausible values for the true difference between the proportion of successes in two separate independent populations. This method is used to compare proportions from two distinct groups, such as pass rates between two prep courses or defect rates between two manufacturing lines. If 0 is not in the interval, we have evidence of a true difference at the corresponding significance level.

**Random**: Both samples must be independently drawn random samples from their populations, or from a randomized experiment.
**Independent Groups**: The two samples must be independent of each other, with no pairing or matching of observations.
**10% Condition**: When sampling without replacement, each sample size must be less than 10% of its population to ensure independence within samples.
**Large Counts**: Each sample must have at least 10 observed successes and 10 observed failures to ensure the sampling distribution is approximately normal: $n_1\hat{p}_1 \geq 10$, $n_1(1-\hat{p}_1) \geq 10$, $n_2\hat{p}_2 \geq 10$, $n_2(1-\hat{p}_2) \geq 10$.

📐 Worked Example

A high school counselor wants to compare the proportion of 12th grade students who have taken an SAT prep course before graduation, between public vs private high schools in a large state. She randomly samples 150 public school 12th graders and 80 private school 12th graders. 63 public school students and 41 private school students report taking a prep course. Verify all conditions for a 95% confidence interval for $p_{public} - p_{private}$.

1. Check Random condition: The problem explicitly states both samples are random, so this condition is met.
2. Check Independent Groups condition: Samples are drawn separately from two distinct populations with no pairing, so groups are independent, condition met.
3. Check 10% Condition: The population of public and private 12th graders in the state is far larger than $10 \times 150 = 1500$ and $10 \times 80 = 800$ respectively, so this condition is met.
4. Check Large Counts condition: Public school: 63 successes, 87 failures. Private school: 41 successes, 39 failures. All values are $\geq 10$, so this condition is met.
Conclusion: All conditions for inference are satisfied.

Exam tip: On AP FRQs, you must explicitly name and verify every condition, not just say 'conditions are met'. You will lose an entire point if you do not show counts for the Large Counts condition.

2. Constructing the Confidence Interval ★★★☆☆ ⏱ 4 min

Once conditions are verified, we always use the unpooled standard error for confidence intervals for the difference in two proportions (per AP CED requirements). Pooling is only used for hypothesis tests for two proportions, when we assume the null hypothesis $p_1 = p_2$ is true; for confidence intervals we make no such assumption, so we use individual sample proportions.

Key notation: $p_1$ = true proportion of successes for population 1, $p_2$ = true proportion for population 2, $\hat{p}_1 = x_1/n_1$, $\hat{p}_2 = x_2/n_2$, where $x_1, x_2$ are the number of observed successes and $n_1, n_2$ are sample sizes.

(phat{p}_1 - phat{p}_2) \pm z^* \sqrt{\frac{\u0070hat{p}_1(1-\u0070hat{p}_1)}{n_1} + \frac{\u0070hat{p}_2(1-\u0070hat{p}_2)}{n_2}}

Where $\hat{p}_1 - \hat{p}_2$ is the point estimate of the true difference, $z^*$ is the critical z-value for your confidence level (common values: 90% = 1.645, 95% = 1.96, 99% = 2.576), and the term under the square root is the variance of the difference: for independent variables, the variance of a difference equals the sum of variances, so we add the two variance terms.

📐 Worked Example

Using the prep course data from the previous example: 63 out of 150 public school students took an SAT prep course, 41 out of 80 private school students took a prep course. Construct a 95% confidence interval for $p_{public} - p_{private}$.

Conditions are already verified, so we proceed. Label populations clearly: population 1 = public schools, population 2 = private schools.
Calculate sample proportions and point estimate:
$\u0070hat{p}_{public} = \frac{63}{150} = 0.42, \quad \u0070hat{p}_{private} = \frac{41}{80} = 0.5125$
Point estimate: $0.42 - 0.5125 = -0.0925$
Calculate standard error:
$SE = \sqrt{\frac{0.42(0.58)}{150} + \frac{0.5125(0.4875)}{80}} = \sqrt{0.001624 + 0.003127} \approx 0.0689$
Critical $z^*$ for 95% confidence is 1.96. Calculate margin of error:
$ME = 1.96 \times 0.0689 \approx 0.135$
Calculate final confidence interval:
$-0.0925 \pm 0.135 = (-0.2275, 0.0425)$

Exam tip: Always explicitly label which population is 1 and which is 2 at the start. This prevents sign errors that lead to wrong inference conclusions.

3. Interpretation and Inference Conclusions ★★★☆☆ ⏱ 3 min

Interpretation is one of the most frequently tested skills on the AP exam for this topic. A correct interpretation requires context and correct phrasing: the true difference is a fixed value, so it is either in the interval or not; confidence refers to the long-run performance of the method, not the probability that the true value is in the interval.

For inference, use this simple rule: If 0 is not inside the confidence interval, we have convincing evidence at the $(100-C)$% significance level that the two population proportions differ. If 0 is inside the interval, we do not have convincing evidence of a difference. We can never conclude that the proportions are equal, because the interval contains many non-zero plausible values.

📐 Worked Example

Using the interval $(-0.228, 0.043)$ calculated for $p_{public} - p_{private}$: (a) Interpret the interval in context. (b) What conclusion can we draw about whether the proportion of students who take SAT prep differs between public and private schools? (c) Interpret what '95% confidence' means here.

(a) Interval interpretation: We are 95% confident that the true difference (public minus private) in the proportion of 12th grade students who take an SAT prep course is between -0.228 and 0.043. This means the true proportion for public schools could be as much as 22.8 percentage points lower, or 4.3 percentage points higher, than for private schools.
(b) Inference conclusion: Because 0 is inside the interval, we do not have convincing evidence at the 95% confidence level that the proportion of students who take SAT prep differs between public and private 12th grade schools in this state.
(c) Confidence level interpretation: 95% confidence means that if we repeatedly took random samples of 150 public and 80 private 12th graders, and constructed a 95% confidence interval for the difference in proportions each time, about 95% of the intervals would capture the true difference in population proportions.

Exam tip: AP graders require full context for interpretation points. Generic interpretations without naming the populations and parameter will not earn full credit.

4. AP-Style Concept Check ★★★☆☆ ⏱ 3 min

Common Pitfalls

Why: Confusion between confidence interval rules and hypothesis test rules, where pooling is sometimes used.

Why: Carrying over the pooled value from the standard error mistake to the condition check.

Why: Mixing up the probability of the method working with the probability of the fixed parameter being in the interval.

Why: Matching the subtraction in the point estimate to the variance calculation.

Why: Confusing 'no evidence of a difference' with 'evidence of no difference'.

Why: Automatically using two-sample method whenever two proportions are compared, even for dependent samples.

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →

Confidence Intervals for the Difference in Two Proportions — AP Statistics

1. Core Concepts and Conditions for Inference ★★☆☆☆ ⏱ 4 min

2. Constructing the Confidence Interval ★★★☆☆ ⏱ 4 min

3. Interpretation and Inference Conclusions ★★★☆☆ ⏱ 3 min

4. AP-Style Concept Check ★★★☆☆ ⏱ 3 min

Common Pitfalls

Quick Reference Cheatsheet

More study guides