Statistics · CED Unit 3: Collecting Data · 14 min read · Updated 2026-05-11

Inference and Experiments — AP Statistics

AP Statistics · CED Unit 3: Collecting Data · 14 min read

1. What Is Inference for Experiments? ★★☆☆☆ ⏱ 3 min

Inference is the process of drawing general conclusions about treatment effects or population characteristics that go beyond raw observed data. For experiments, inference has two commonly tested goals: generalizing results to a broader population, and establishing that a treatment causes a change in the measured response variable. This topic makes up 10–15% of the total AP Statistics exam weight, appearing in both multiple-choice and free-response sections.

Unlike inference from observational studies, inference from experiments relies on random assignment of treatments to subjects, rather than only random sampling from a population. This key difference changes what types of inference are valid: random assignment allows causal inference, while random sampling allows generalization to a broader population. The AP exam heavily tests your ability to identify which inferences are appropriate based on study design, not just calculation.

2. Causal vs Associative Inference ★★☆☆☆ ⏱ 4 min

The core rule tested on the exam is: random assignment balances all confounding variables (measured and unmeasured) across treatment groups on average, so any remaining difference between groups can be attributed to the treatment. Causal inference and generalization are independent: a study can have one, both, or neither, depending on design choices.

📐 Worked Example

A researcher tests whether daily 10-minute meditation reduces self-reported anxiety in high school seniors. She recruits 60 volunteers from a local high school, and randomly assigns 30 to meditate daily for 8 weeks, and 30 to a control group that does 10 minutes of daily quiet reading. At the end of the study, the meditation group has a statistically significantly lower average anxiety score. Can the researcher make a causal inference that meditation reduces anxiety? Can she generalize this result to all high school seniors in the U.S.? Justify your answer.

First, confirm random assignment: The researcher randomly assigned treatments (meditation vs quiet reading) to volunteers. This balances all confounding variables (like baseline anxiety, study habits, stress from college applications) across the two groups on average.
Because random assignment was used, the difference in average anxiety can be attributed to the meditation treatment, so a causal inference is valid for this study.
Next, check for random sampling: The sample consists of 60 volunteers from a single local high school, not a random sample of all U.S. high school seniors.
The sample is not representative of the broader population, so generalization to all U.S. high school seniors is not valid.

Exam tip: When asked about inference scope, always address both causation and generalization explicitly, even if the question only asks one. AP exam graders expect you to demonstrate you know the difference between the two requirements, so stating both will help you earn full credit.

3. Core Principles of Experimental Design ★★★☆☆ ⏱ 4 min

For inference from an experiment to be valid, the experiment must follow four core design principles, each addressing a different threat to valid inference:

**Control**: Compare the treatment group to a control group that receives no treatment, a placebo, or the current standard treatment. This controls for confounding effects like the placebo effect, isolating the treatment effect.
**Replication**: Apply each treatment to multiple independent experimental units. Replication reduces the impact of random individual variation, leading to more precise inference and easier detection of true treatment effects.
**Randomization**: Randomly assign treatments to experimental units. This balances measured and unmeasured confounding variables across groups, enabling causal inference.
**Blocking**: Group experimental units similar on a known confounding variable into blocks, then randomly assign treatments within each block. Blocking removes variability from the known confounding variable, making it easier to detect a true treatment effect.

📐 Worked Example

An agriculture researcher wants to test whether a new fertilizer increases corn yield compared to a standard fertilizer. She knows that corn yield is affected by how much sunlight a plot receives, and her 20 test plots are split between a forest edge (lower sunlight) and an open field (higher sunlight). Design an appropriate experiment to test the new fertilizer, naming the principles you use.

**Blocking**: First, split the 20 plots into two blocks: 10 plots in the low-sunlight forest edge block, and 10 plots in the high-sunlight open field block. This accounts for the known effect of sunlight on yield, so we do not confuse sunlight variability with fertilizer variability.
**Randomization**: Within each block, randomly assign 5 plots to get the new fertilizer and 5 plots to get the standard fertilizer. Randomization balances unmeasured confounding variables (like soil nutrient variation) across the two fertilizer groups.
**Replication**: We have 5 plots per fertilizer in each block, for 10 total plots per fertilizer across all blocks. This gives us enough replication to reduce the impact of random plot-to-plot variation.
**Control**: We use the standard fertilizer as a control, so we can compare the yield of the new fertilizer to the existing standard to measure the treatment effect.

Exam tip: If the study groups units by a pre-existing variable before randomizing treatments, that is blocking, not confounding. Confounding is for uncontrolled variables; blocking is an intentional technique to improve inference.

4. Confounding and Threats to Valid Inference ★★★☆☆ ⏱ 3 min

Confounding is the primary reason causal inference is not valid for observational studies, but it can also occur in poorly designed experiments. Common sources include selection bias (when subjects choose their own treatment), lack of blinding, and unmeasured lurking variables. AP questions frequently ask you to identify a possible confounding variable and explain how it threatens causal inference.

📐 Worked Example

A restaurant chain wants to test whether adding a new appetizer to the menu increases total monthly revenue. They add the new appetizer to 10 randomly selected locations, and leave the menu unchanged at 10 other locations. After 3 months, the locations with the new appetizer have 12% higher average revenue than the locations without. A manager claims the new appetizer caused the revenue increase. Identify a possible confounding variable and explain why it threatens the causal inference.

One possible confounding variable is location size: the chain may have assigned the new appetizer to larger, higher-revenue locations by chance.
This variable is confounded with the new appetizer because location size is associated with the treatment (new appetizer is more likely to be added to larger locations) and associated with the response (larger locations naturally have higher total revenue).
We cannot separate the effect of the new appetizer from the effect of location size, so the manager's causal claim is not justified by this study.
Other valid examples include regional marketing campaigns that ran at the same time the new appetizer was added, or different average foot traffic between the two groups of locations.

Exam tip: When asked to identify a confounding variable on an FRQ, you must explicitly explain how it is associated with both the treatment and the response to earn full credit. Naming the variable alone is not enough.

Common Pitfalls

Why: Students confuse random sampling and random assignment, mixing up which type of inference each supports.

Why: Students confuse controlled design choices with uncontrolled sources of bias.

Why: General science texts sometimes mention repeating experiments to confirm results, so students misapply the definition in AP Statistics experimental design.

Why: Students assume that one implies the other, but the two are independent based on different study design choices.

Why: Students think just naming the variable is enough for full credit on FRQ.

Why: Students learn blocking reduces variability, so they assume blocking is always better.

Quick Reference Cheatsheet

← Back to topic

Stuck on a specific question?
Snap a photo or paste your problem — Ollie (our AI tutor) walks through it step-by-step with diagrams.
Try Ollie free →

Inference and Experiments — AP Statistics

1. What Is Inference for Experiments? ★★☆☆☆ ⏱ 3 min

2. Causal vs Associative Inference ★★☆☆☆ ⏱ 4 min

3. Core Principles of Experimental Design ★★★☆☆ ⏱ 4 min

4. Confounding and Threats to Valid Inference ★★★☆☆ ⏱ 3 min

Common Pitfalls

Quick Reference Cheatsheet

More study guides