| Study Guides
Statistics · Inference for Categorical Data: Chi-Square · 14 min read · Updated 2026-05-11

Chi-Square Test for Homogeneity — AP Statistics

AP Statistics · Inference for Categorical Data: Chi-Square · 14 min read

1. What Is the Chi-Square Test for Homogeneity? ★★☆☆☆ ⏱ 3 min

The chi-square test for homogeneity is a hypothesis-based inference procedure used to test whether the distribution of a single categorical response variable is homogeneous (identical) across two or more independent populations or treatment groups.

It is often confused with the other two chi-square tests on the AP exam, but has a distinct study design:

  • Goodness-of-fit: Compares one sample's distribution to a single hypothesized distribution
  • Test for independence: Tests for association between two categorical variables from a single population sample
  • Homogeneity: Starts with separate, independent random samples from each population of interest

2. Hypotheses and Conditions for Inference ★★☆☆☆ ⏱ 3 min

The first step in any test is correctly stating hypotheses and verifying inference conditions. For $k$ populations and $m$ response categories:

Three conditions must be satisfied per the AP CED:

  1. **Random**: Data comes from independent random samples or randomized experiment groups
  2. **Independence**: Individual observations are independent; 10% condition when sampling without replacement
  3. **Large Counts**: Every cell's expected count is at least 5

3. Test Statistic and Degrees of Freedom ★★★☆☆ ⏱ 4 min

For an $r \times c$ contingency table (r = number of response categories, c = number of populations/groups), the expected count for each cell is calculated as:

E_{ij} = \frac{(\text{Row Total}) \times (\text{Column Total})}{\text{Grand Total}}

If the null hypothesis is true, the overall proportion of observations in each row applies equally to all columns, so this gives the count we expect if $H_0$ is true. The chi-square test statistic is:

\chi^2 = \sum_{\text{all cells}} \frac{(O - E)^2}{E}

Where $O$ is the observed count and $E$ is the expected count for each cell. Larger values of $\chi^2$ indicate larger differences between observed and expected counts, so stronger evidence against $H_0$. Degrees of freedom are calculated as:

df = (r - 1)(c - 1)

4. Drawing a Conclusion in Context ★★★☆☆ ⏱ 2 min

The final step is comparing your p-value to the pre-specified significance level $\alpha$ (almost always $\alpha = 0.05$ on the AP exam, unless stated otherwise) and drawing a conclusion that answers the original research question in context.

For chi-square tests, the p-value is the probability of observing a test statistic as large or larger than your calculated value if $H_0$ is true. All p-values correspond to the area to the right of the test statistic under the chi-square distribution. The decision rule is:

  • If $p$-value $< \alpha$: Reject $H_0$
  • If $p$-value $\geq \alpha$: Fail to reject $H_0$

5. AP-Style Concept Check ★★★★☆ ⏱ 2 min

Common Pitfalls

Why: Both tests use identical calculation methods, so students often mix up the underlying research question

Why: Students memorize goodness-of-fit first and confuse the two formulas

Why: Students get lazy with wording and forget failing to reject does not prove the null is true

Why: Students see the "all counts ≥ 5" rule and default to checking the given observed counts

Why: Students incorrectly assume the alternative requires all groups to differ

Why: Students rush calculations and misremember the formula order

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →