Statistics · Inference for Categorical Data: Chi-Square · 14 min read · Updated 2026-05-11
Chi-Square Test for Homogeneity — AP Statistics
AP Statistics · Inference for Categorical Data: Chi-Square · 14 min read
1. What Is the Chi-Square Test for Homogeneity?★★☆☆☆⏱ 3 min
The chi-square test for homogeneity is a hypothesis-based inference procedure used to test whether the distribution of a single categorical response variable is homogeneous (identical) across two or more independent populations or treatment groups.
It is often confused with the other two chi-square tests on the AP exam, but has a distinct study design:
Goodness-of-fit: Compares one sample's distribution to a single hypothesized distribution
Test for independence: Tests for association between two categorical variables from a single population sample
Homogeneity: Starts with separate, independent random samples from each population of interest
2. Hypotheses and Conditions for Inference★★☆☆☆⏱ 3 min
The first step in any test is correctly stating hypotheses and verifying inference conditions. For $k$ populations and $m$ response categories:
Three conditions must be satisfied per the AP CED:
**Random**: Data comes from independent random samples or randomized experiment groups
**Independence**: Individual observations are independent; 10% condition when sampling without replacement
**Large Counts**: Every cell's expected count is at least 5
3. Test Statistic and Degrees of Freedom★★★☆☆⏱ 4 min
For an $r \times c$ contingency table (r = number of response categories, c = number of populations/groups), the expected count for each cell is calculated as:
If the null hypothesis is true, the overall proportion of observations in each row applies equally to all columns, so this gives the count we expect if $H_0$ is true. The chi-square test statistic is:
Where $O$ is the observed count and $E$ is the expected count for each cell. Larger values of $\chi^2$ indicate larger differences between observed and expected counts, so stronger evidence against $H_0$. Degrees of freedom are calculated as:
df = (r - 1)(c - 1)
4. Drawing a Conclusion in Context★★★☆☆⏱ 2 min
The final step is comparing your p-value to the pre-specified significance level $\alpha$ (almost always $\alpha = 0.05$ on the AP exam, unless stated otherwise) and drawing a conclusion that answers the original research question in context.
For chi-square tests, the p-value is the probability of observing a test statistic as large or larger than your calculated value if $H_0$ is true. All p-values correspond to the area to the right of the test statistic under the chi-square distribution. The decision rule is:
If $p$-value $< \alpha$: Reject $H_0$
If $p$-value $\geq \alpha$: Fail to reject $H_0$
5. AP-Style Concept Check★★★★☆⏱ 2 min
Common Pitfalls
Why: Both tests use identical calculation methods, so students often mix up the underlying research question
Why: Students memorize goodness-of-fit first and confuse the two formulas
Why: Students get lazy with wording and forget failing to reject does not prove the null is true
Why: Students see the "all counts ≥ 5" rule and default to checking the given observed counts
Why: Students incorrectly assume the alternative requires all groups to differ
Why: Students rush calculations and misremember the formula order