Statistics · CED Unit 8: Chi-Squared Tests · 16 min read · Updated 2026-05-11

Chi-Squared Tests — AP Statistics

AP Statistics · CED Unit 8: Chi-Squared Tests · 16 min read

1. Overview of Chi-Squared Tests ★☆☆☆☆ ⏱ 3 min

Chi-squared ($\chi^2$) tests are non-parametric hypothesis tests for categorical (count) data, used to compare observed sample counts to expected counts under a null hypothesis. The test uses the right-skewed chi-squared probability distribution, defined by its degrees of freedom. This topic makes up 2-5% of your total AP Statistics exam score.

2. Goodness-of-Fit Test ★★☆☆☆ ⏱ 4 min

A chi-squared goodness-of-fit test tests whether the distribution of a single categorical variable from one sample matches a pre-specified hypothesized distribution.

$H_0$: The distribution of the variable matches the hypothesized distribution
$H_a$: The distribution of the variable does not match the hypothesized distribution

Test statistic formula:

\chi^2 = \sum \frac{(O - E)^2}{E}

Where $O$ = observed count, $E$ = expected count ($E = n \times p_i$ for sample size $n$ and hypothesized proportion $p_i$), and degrees of freedom are $df = k - 1$ for $k$ categories.

📐 Worked Example

You roll a 6-sided die 120 times, recording observed counts for each face: 1: 25, 2: 17, 3: 19, 4: 23, 5: 16, 6: 20. Test if the die is fair at the $\alpha=0.05$ significance level.

State hypotheses:
$H_0: \text{Each face has probability } 1/6 \text{ of landing up} \\ H_a: \text{At least one face has probability } \neq 1/6$
Calculate expected counts for each face:
$E = 120 \times 1/6 = 20 \quad (\text{all } E = 20)$
Calculate the chi-squared test statistic:
$\chi^2 = \frac{(25-20)^2}{20} + \frac{(17-20)^2}{20} + \frac{(19-20)^2}{20} + \frac{(23-20)^2}{20} + \frac{(16-20)^2}{20} + \frac{(20-20)^2}{20} = 3$
Find degrees of freedom and p-value:
$df = 6 - 1 = 5, \quad p \approx 0.699$
Since $p > 0.05$, we fail to reject $H_0$. There is no statistically significant evidence the die is unfair.

Exam tip: Always label expected counts clearly in free response answers; examiners regularly dock marks for unlabeled calculations.

3. Tests of Independence and Homogeneity ★★★☆☆ ⏱ 5 min

These two tests use identical calculations but differ in study design and hypothesis framing, so it is critical to distinguish them for full AP exam marks.

Both tests use two-way contingency tables. Expected count for each cell, and degrees of freedom, are calculated as:

E = \frac{\text{row total} \times \text{column total}}{\text{grand total}}, \quad df = (r - 1)(c - 1)

📐 Worked Example

200 students are surveyed about their gender (male/female) and ice cream preference (chocolate/vanilla/strawberry). Observed counts: Male: 50 chocolate, 30 vanilla, 20 strawberry; Female: 30 chocolate, 40 vanilla, 30 strawberry. Test for an association between the two variables at $\alpha=0.05$.

State hypotheses:
$H_0$: Gender and ice cream preference are independent in the population
$H_a$: Gender and ice cream preference are associated in the population
Calculate expected counts:
$E_{\text{Male, Chocolate}} = 40, E_{\text{Male, Vanilla}} = 35, E_{\text{Male, Strawberry}} = 25 \\ E_{\text{Female, Chocolate}} = 40, E_{\text{Female, Vanilla}} = 35, E_{\text{Female, Strawberry}} = 25$
Calculate test statistic:
$\chi^2 = \frac{(50-40)^2}{40} + \frac{(30-35)^2}{35} + \frac{(20-25)^2}{25} + \frac{(30-40)^2}{40} + \frac{(40-35)^2}{35} + \frac{(30-25)^2}{25} = 8.428$
Find degrees of freedom and p-value:
$df = (2-1)(3-1) = 2, \quad p \approx 0.0148$
Since $p < 0.05$, we reject $H_0$. There is sufficient evidence of an association between gender and ice cream preference.

Exam tip: Always name the correct test type in your free response answer to earn full points for your conclusion.

4. Validity Conditions for Inference ★★☆☆☆ ⏱ 3 min

All three chi-squared tests require the same three conditions to be met for valid inference. You must explicitly check all three on every free response question to earn full credit.

**Random**: Data comes from a random sample or randomized experiment. For homogeneity, each sample must be independently randomly selected.
**Independent**: Individual observations are independent. For sampling without replacement, the population must be at least 10 times the sample size (the 10% condition).
**Large Counts**: All expected counts are at least 5. If any expected count is <5, combine adjacent logically related categories to meet this condition.

5. Interpreting Test Statistics and P-Values ★★★☆☆ ⏱ 4 min

Correct interpretation in context is required for full marks on AP Statistics free response. Generic interpretations will not earn credit.

The $\chi^2$ test statistic is always non-negative, since it is a sum of squared differences. Larger $\chi^2$ values indicate larger gaps between observed and expected counts, meaning stronger evidence against the null hypothesis.

For the die example, a correct interpretation is: *The p-value of 0.699 means that if the die is fair, there is a 69.9% chance of observing a $\chi^2$ test statistic of 3 or larger purely by random chance. Since this is greater than our 0.05 significance level, we do not have sufficient evidence to conclude the die is unfair.*

Exam tip: Always tie your interpretation back to the specific context of the problem. Generic interpretations without reference to the variables being studied will not earn full marks.

Common Pitfalls

Why: Confusion between hypothesized proportions and expected counts when setting up calculations.

Why: Mixing up the degrees of freedom formulas for goodness-of-fit versus two-way chi-squared tests.

Why: Confusing association and causation, a common AP Statistics theme.

Why: Carrying over one-sided alternative habits from z-tests or t-tests for means and proportions.

Why: Assuming conditions are met and forgetting to document checks, which are explicitly graded on the AP exam.

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →