Statistics · Inference for Categorical Data: Chi-Square · 14 min read · Updated 2026-05-11

Chi-Square Test for Homogeneity — AP Statistics

AP Statistics · Inference for Categorical Data: Chi-Square · 14 min read

1. What Is the Chi-Square Test for Homogeneity? ★★☆☆☆ ⏱ 3 min

The chi-square test for homogeneity is a hypothesis-based inference procedure used to test whether the distribution of a single categorical response variable is homogeneous (identical) across two or more independent populations or treatment groups.

It is often confused with the other two chi-square tests on the AP exam, but has a distinct study design:

Goodness-of-fit: Compares one sample's distribution to a single hypothesized distribution
Test for independence: Tests for association between two categorical variables from a single population sample
Homogeneity: Starts with separate, independent random samples from each population of interest

2. Hypotheses and Conditions for Inference ★★☆☆☆ ⏱ 3 min

The first step in any test is correctly stating hypotheses and verifying inference conditions. For $k$ populations and $m$ response categories:

Three conditions must be satisfied per the AP CED:

**Random**: Data comes from independent random samples or randomized experiment groups
**Independence**: Individual observations are independent; 10% condition when sampling without replacement
**Large Counts**: Every cell's expected count is at least 5

📐 Worked Example

A counselor tests if the distribution of post-graduation plans (4-year college, 2-year college, work, gap year) is the same for general education, honors, and IB seniors. She takes independent random samples of 80, 100, and 60 seniors respectively. The smallest expected count is 7.1. State hypotheses and check conditions.

1. Define parameters: Let $p_{ij}$ be the proportion of program $j$ planning for outcome $i$.
2. State hypotheses: $H_0$: The distribution of post-graduation plans is homogeneous across the three programs. $H_a$: The distribution differs across at least one program.
3. Check Random: Independent random samples were taken, so condition satisfied.
4. Check Independence: Total sample size 240 < 10% of all seniors at a typical large high school, so 10% condition met.
5. Check Large Counts: Smallest expected count is 7.1 ≥ 5, so condition satisfied.
All conditions for inference are met.

3. Test Statistic and Degrees of Freedom ★★★☆☆ ⏱ 4 min

For an $r \times c$ contingency table (r = number of response categories, c = number of populations/groups), the expected count for each cell is calculated as:

E_{ij} = \frac{(\text{Row Total}) \times (\text{Column Total})}{\text{Grand Total}}

If the null hypothesis is true, the overall proportion of observations in each row applies equally to all columns, so this gives the count we expect if $H_0$ is true. The chi-square test statistic is:

\chi^2 = \sum_{\text{all cells}} \frac{(O - E)^2}{E}

Where $O$ is the observed count and $E$ is the expected count for each cell. Larger values of $\chi^2$ indicate larger differences between observed and expected counts, so stronger evidence against $H_0$. Degrees of freedom are calculated as:

df = (r - 1)(c - 1)

4. Drawing a Conclusion in Context ★★★☆☆ ⏱ 2 min

The final step is comparing your p-value to the pre-specified significance level $\alpha$ (almost always $\alpha = 0.05$ on the AP exam, unless stated otherwise) and drawing a conclusion that answers the original research question in context.

For chi-square tests, the p-value is the probability of observing a test statistic as large or larger than your calculated value if $H_0$ is true. All p-values correspond to the area to the right of the test statistic under the chi-square distribution. The decision rule is:

If $p$-value $< \alpha$: Reject $H_0$
If $p$-value $\geq \alpha$: Fail to reject $H_0$

5. AP-Style Concept Check ★★★★☆ ⏱ 2 min

Common Pitfalls

Why: Both tests use identical calculation methods, so students often mix up the underlying research question

Why: Students memorize goodness-of-fit first and confuse the two formulas

Why: Students get lazy with wording and forget failing to reject does not prove the null is true

Why: Students see the "all counts ≥ 5" rule and default to checking the given observed counts

Why: Students incorrectly assume the alternative requires all groups to differ

Why: Students rush calculations and misremember the formula order

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →