Statistics · CED Unit 4: Probability, Random Variables, and Probability Distributions · 14 min read · Updated 2026-05-11
The Geometric Distribution — AP Statistics
AP Statistics · CED Unit 4: Probability, Random Variables, and Probability Distributions · 14 min read
1. What Is The Geometric Distribution?★★☆☆☆⏱ 3 min
The geometric distribution is a discrete probability distribution that models the number of independent trials required to get the first success in a series of repeated Bernoulli (two-outcome) trials. It is often called the "waiting-time distribution" because we measure how many trials we wait for the first success.
Unlike the binomial distribution, which fixes the number of trials and counts the number of successes, the geometric distribution reverses this framing: it fixes the probability of success per trial, and lets the number of trials be the random variable of interest. AP Statistics exclusively uses the "shifted" convention, where we count trials starting at 1, matching the official CED definition.
2. Conditions for a Geometric Setting★★☆☆☆⏱ 4 min
Before you can use the geometric distribution to calculate probabilities or expected values, you must confirm that your scenario meets all four required conditions, abbreviated **BITS**:
**B**: Two possible outcomes per trial: each trial results in either a "success" (the outcome we are waiting for) or a "failure" (the other outcome).
**I**: Independent trials: the outcome of one trial does not change the probability of success for any other trial.
**T**: Wait for the first success: the number of trials is not fixed in advance; the value we measure is the number of trials needed to get the first success.
**S**: Constant success probability: the probability of success $p$ is the same for every trial.
Exam tip: When asked to identify the appropriate distribution for a scenario, always answer the question 'are we counting trials until a success, or counting successes in fixed trials?' first — this eliminates 50% of wrong answers immediately.
3. Geometric Probability Calculations (PMF and CDF)★★★☆☆⏱ 4 min
Once you confirm a scenario meets the geometric conditions, you can calculate probabilities using two core formulas: the probability mass function (PMF) for the probability of first success on an exact trial, and the cumulative distribution function (CDF) for the probability of first success by a certain trial.
To get the probability that the first success occurs *exactly* on the $k$-th trial, you must have $k-1$ consecutive failures first, followed by a success on the $k$-th trial. Because trials are independent, we multiply the probabilities:
P(X = k) = (1-p)^{k-1}p
for $k = 1, 2, 3, ...$
For cumulative probability, the probability that the first success occurs *on or before* the $k$-th trial is equal to 1 minus the probability that the first $k$ trials are all failures, which gives a convenient shortcut:
P(X \leq k) = 1 - (1-p)^k
We can rearrange this to get the probability that the first success occurs *after* the $k$-th trial:
P(X > k) = (1-p)^k
This shortcut saves significant time on the exam, as you do not need to sum multiple individual probabilities.
Exam tip: If you are asked for $P(X < k)$, always adjust the cutoff to get the correct exponent: $P(X < k) = P(X \leq k-1) = 1 - (1-p)^{k-1}$ to avoid off-by-one errors that are common on MCQs.
4. Mean and Standard Deviation of a Geometric Random Variable★★★☆☆⏱ 3 min
The geometric distribution has simple, intuitive formulas for the mean (expected value) and standard deviation. The expected value, which is the long-run average number of trials needed to get the first success, is:
E(X) = \mu_X = \frac{1}{p}
This makes intuitive sense: if the probability of success is 1/10, you expect to wait 10 trials on average for the first success. Lower probability of success means a higher expected number of trials, which matches the formula.
The variance of $X$ is $\text{Var}(X) = \frac{1-p}{p^2}$, so the standard deviation (a measure of the spread of the distribution) is:
\sigma_X = \frac{\sqrt{1-p}}{p}
All geometric distributions are right-skewed: the highest probability is always at $k=1$, and probabilities get smaller as $k$ increases. On FRQs, you are almost always required to interpret the expected value in context, which requires connecting it to the long-run average over many repetitions.
Exam tip: Always include the phrases 'on average' and 'over many repetitions' when interpreting expected value on FRQs to earn full credit for the interpretation.
5. AP Style Practice Examples★★★★☆⏱ 4 min
Common Pitfalls
Why: Confusion between different geometric distribution conventions used in different textbooks; AP exclusively uses the shifted (trial-counting) convention.
Why: Both use Bernoulli trials, so students forget to check what is being counted and whether the number of trials is fixed.
Why: Off-by-one error from misinterpreting the inequality cutoff.
Why: Confusion between probability of success per trial and expected number of trials until first success.
Why: Students forget that independence is violated when sampling without replacement from small populations, just like in binomial settings.