Statistics · Unit 1: Exploring One-Variable Data · 14 min read · Updated 2026-05-11

Representing a Categorical Variable with Tables — AP Statistics

AP Statistics · Unit 1: Exploring One-Variable Data · 14 min read

1. Core Concepts: Tables for Categorical Data ★☆☆☆☆ ⏱ 3 min

A categorical variable places each individual observation into a distinct group or category (rather than a numerical measurement). The distribution of a variable describes what values it takes and how often those values occur. A table is the most common starting point for summarizing the distribution of a one-variable categorical dataset.

This topic is the foundation for all future work with categorical data, including visual representations, two-way tables, and inference for proportions, so mastering table construction and interpretation is critical for exam success. It accounts for 15-23% of the total AP exam score, and appears in both multiple-choice and free-response questions.

Standard notation used across this topic: $n$ = total number of observations, $f_i$ = raw frequency of observations in the $i$th category.

2. Frequency Tables ★☆☆☆☆ ⏱ 4 min

The core rule for all valid frequency tables is that the sum of all frequencies equals the total sample size $n$, written formally as:

\sum_{i=1}^k f_i = n

where $k$ is the number of distinct categories. This rule lets you check for counting errors before moving on to further analysis. Frequency tables give exact raw counts, which are useful for context-specific questions requiring actual numbers of observations.

3. Relative Frequency Tables ★★☆☆☆ ⏱ 4 min

The formula for relative frequency of the $i$th category is:

p_i = \frac{f_i}{n}

To get a percentage, multiply by 100: $\text{Percent} = p_i \times 100\%$. The sum of all relative frequencies should equal 1 (or 100% for percentages), with small deviations allowed from rounding individual values.

4. Cumulative Frequency and Mode Identification ★★☆☆☆ ⏱ 3 min

For ordered categorical (ordinal) variables (e.g., grades, satisfaction levels, income brackets), we can construct cumulative frequency tables to simplify answering questions about the number of observations at or below (or at or above) a given category. Cumulative frequency for the $i$th category is the sum of frequencies for all categories up to and including the $i$th category.

Common Pitfalls

Why: Students often mix up numerator and denominator when rushing, especially for percentage questions.

Why: Students assume cumulative frequency always goes from lowest to highest, regardless of context.

Why: Students confuse 'similar frequency' with 'equal highest frequency'.

Why: Students do not account for rounding error when reporting values to two decimal places.

Why: Students are used to quantitative data and automatically try to calculate a numerical center.

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →