Course Standards | Intro to Probability and Statistics

Module 1

Descriptive Statistics

Vocabulary, graphs, and numerical summaries — the language every statistical argument is built on.

DS-1

I can classify any variable, identify study components (population, sample, parameter, statistic) with proper notation, name the sampling method used, and explain how its design could bias results.

Classify any variable by type and sub-type; identify the population, sample, parameter, and statistic in a research scenario using proper notation (\(\mu, \bar{x}, \sigma, s, p, \hat{p}\)); identify the sampling method used in a study; and evaluate whether the design is likely to produce a representative sample, explaining the source and likely direction of any bias.

Mastered

Review lesson ds-1 →

To show mastery, I can…

Classify a variable by type and sub-type, including tricky cases like Likert scales and postal codes.
Identify the population, sample, parameter, and statistic in a research scenario.
Name the sampling method used in a described study.
Explain how the design could over- or under-represent part of the population, and state the likely direction of that bias.

Common mistake to avoid

Confusing parameters (\(\mu, \sigma, p\)) with statistics (\(\bar{x}, s, \hat{p}\)); and assuming that naming the sampling method tells you whether the sample is biased without analyzing the actual design.

DS-2

I can pick the right graph for a variable, build and read frequency tables, and spot a misleading graph.

Select the appropriate graph type for a given variable and research question; construct and read frequency distributions correctly; identify misrepresentation in a graph and explain what it obscures.

Mastered

Review lesson ds-2 →

To show mastery, I can…

Choose the right graph type for a given variable (a bar chart for categorical data, a histogram for quantitative).
Build a frequency distribution from raw data — computing relative (f/n) frequencies.
Read absolute and relative frequencies from a distribution.
Spot the specific technique that makes a graph misleading and explain what it distorts.

Common mistake to avoid

Confusing histograms and bar charts by judging from how they look instead of the variable type — a bar chart is for categorical data, a histogram for quantitative. Also: trusting a graph at face value instead of reading beyond the literal data.

DS-3

I can compute the standard measures of center and spread, then choose and justify the right ones for a distribution.

Calculate the measures of center and spread — mean, median, mode, range, variance, standard deviation (\(n-1\) denominator for sample data), the five-number summary, IQR, and the coefficient of variation (\(CV = \frac{s}{\bar{x}} \times 100\%\)); then select and justify the appropriate measures for a given distribution.

Mastered

Review across: ds-3 ds-4

To show mastery, I can…

Compute the mean, median, and mode from unsorted raw data (sorting first for the median).
Compute the range, the sample variance (using \(n-1\)), and the standard deviation.
Find the quartiles, five-number summary, and IQR from sorted data.
Compute and interpret \(CV = \frac{s}{\bar{x}} \times 100\%\) to compare spread across different units.
Choose and justify the right measures of center and spread for a distribution (skewed, symmetric, or with outliers).

Common mistake to avoid

Defaulting to the mean and standard deviation no matter the distribution's shape — when a skew or an outlier makes the median and IQR the honest choice; and on the computation side, finding the median without sorting first or dividing the sample variance by \(n\) instead of \(n-1\).

Module 2

Probability Foundations

The language and rules of probability — sample spaces, event operations, conditional probability, and combinatorics.

PR-1

I can read a real-world situation and calculate probabilities of simple and compound events, including when two events overlap or are mutually exclusive.

Set up and calculate probabilities of simple, complementary, and compound events directly from a described real-world situation — not only from a probability expression you are handed; and apply the addition rule correctly based on whether the events are mutually exclusive.

Mastered

Review lesson pr-1 →

To show mastery, I can…

Set up the events and their probabilities from a described real-world situation, not just from a given expression.
Use the complement rule \(P(A^c) = 1 - P(A)\) to find an event probability from its opposite.
Identify whether events are mutually exclusive before applying the addition rule.
Apply the addition rule in the right form — \(P(A \cup B) = P(A) + P(B) - P(A \cap B)\), dropping the overlap only when the events are mutually exclusive.

Common mistake to avoid

Using the simplified addition rule \(P(A \cup B) = P(A) + P(B)\) for events that actually overlap; and jumping straight to a formula without first translating the situation into the right events, so the wrong probabilities go into the calculation.

PR-2

I can read a real-world situation, calculate conditional and joint probabilities, and determine whether two events are mutually exclusive, independent, both, or neither.

Compute conditional probabilities \(P(A|B) = \frac{P(A \cap B)}{P(B)}\) with proper conditioning direction; apply the general multiplication rule \(P(A \cap B) = P(A)\,P(B|A)\) to find joint probabilities, with the independent special case \(P(A \cap B) = P(A)\,P(B)\); formally test for independence using both the conditional and multiplication definitions; and classify a pair of events as mutually exclusive, independent, both, or neither, justifying the relationship in writing.

Mastered

Review lesson pr-2 →

To show mastery, I can…

Read a real-world situation and set up the conditional probability it asks for — keeping the direction right (\(P(A|B)\), not \(P(B|A)\)).
Compute that conditional probability with \(P(A|B) = \frac{P(A \cap B)}{P(B)}\).
Compute a joint probability with the multiplication rule — \(P(A)\,P(B|A)\) in general, \(P(A)\,P(B)\) only when the events are independent.
Decide whether two events are mutually exclusive, independent, both, or neither, and justify it in writing.

Common mistake to avoid

Flipping the direction of conditioning (confusing \(P(A|B)\) with \(P(B|A)\)); multiplying \(P(A) \cdot P(B)\) for the joint probability without first checking that the events are actually independent; and treating mutually exclusive and independent as synonyms — they are actually opposites, since mutually exclusive events are always dependent.

PR-3

I can read a real-world situation, count its possible outcomes — deciding when order matters — and use the count to find a probability.

Apply the Fundamental Counting Principle, arrangements (\(_nP_r\), ordered selection of \(r\) — including the all-elements permutation \(n!\)), and combinations (\(_nC_r\) or \(\binom{n}{r}\)) to count outcomes; determine whether order matters in context and select the correct formula; use counting results to compute probabilities over equally-likely sample spaces.

Mastered

Review lesson pr-3 →

To show mastery, I can…

Read a real-world situation and decide whether order matters before choosing a tool.
Apply the correct tool — Fundamental Counting Principle, permutation, or combination.
Use the count to compute a probability over an equally-likely sample space.

Common mistake to avoid

Reaching for a permutation when order doesn't actually matter (or a combination when it does). The setup looks identical — the error only shows up in the final answer.

Module 3

Probability Distributions

Random variables, probability models, and named distributions — the bridge from probability rules to statistical inference.

PR-4

I can build a valid probability distribution for a random variable, compute and interpret its mean, variance, and standard deviation, and find cumulative probabilities.

Identify a random variable as discrete or continuous; construct a valid probability mass function (PMF) and verify that all probabilities sum to 1; calculate \(E(X)\), \(Var(X)\), and \(\sigma_X\); compute cumulative probabilities (\(P(X \le k)\), \(P(X > k)\), \(P(a < X \le b)\)); interpret \(E(X)\) as a long-run average — not the most likely outcome — and explain why \(E(X^2) \neq [E(X)]^2\) in general.

Mastered

Review lesson pr-4 →

To show mastery, I can…

Identify a random variable as discrete or continuous, then build a valid PMF from a described scenario and verify \(\sum P(X=x) = 1\).
Compute \(E(X)\), and compute \(E(X^2)\) and \([E(X)]^2\) separately — not as the same thing.
Use them to find \(Var(X)\) and \(\sigma_X\).
Compute cumulative probabilities — \(P(X \le k)\), \(P(X > k)\), and \(P(a < X \le b)\) — from the PMF.
Interpret \(E(X)\) as a long-run average in a real context (insurance, games, decisions), and give an example where it is not a possible value of \(X\).

Common mistake to avoid

Computing \(E(X^2)\) as if it equals \([E(X)]^2\) (which collapses the variance to zero), and reading \(E(X)\) as the "most likely" outcome rather than the long-run average — which often isn't even a possible value of \(X\) (an expected 2.3 children, say).

PR-5

I can check the BINS conditions and compute binomial probabilities (including "at least"/"at most").

Verify all four BINS conditions before applying the binomial model, and recognize when it does not apply; calculate exact and cumulative binomial probabilities; translate "at most \(k\)," "at least \(k\)," "more than \(k\)," and "fewer than \(k\)" correctly into probability expressions.

Mastered

Review lesson pr-5 →

To show mastery, I can…

Check and justify all four BINS conditions, and recognize when the binomial model does not apply (which condition fails).
Compute an exact binomial probability \(P(X = k)\).
Translate cumulative language ("at least," "at most," "more than," "fewer than") into the correct probability expression.

Common mistake to avoid

Using the binomial formula when the trials aren't independent or p changes between trials — and confusing "exactly \(k\)" with cumulative phrasing like "at least \(k\)."

PR-6

I can use the normal distribution model to find probabilities from values (standardizing to z first) and find values from probabilities (inverse normal).

Standardize a general normal variable \(X \sim N(\mu, \sigma^2)\) to \(Z\) and compute probabilities \(P(Z < a)\), \(P(Z > a)\), and \(P(a < Z < b)\) using the \(z\)-table; and perform inverse normal lookups to find the value \(x\) corresponding to a given percentile or area, unstandardizing with \(x = \mu + z^* \sigma\).

Mastered

Review lesson pr-6 →

To show mastery, I can…

Standardize \(X \sim N(\mu, \sigma^2)\) to \(Z\) before using the table.
Find a left-tail, right-tail, or middle area correctly using the \(z\)-table.
Recognize an inverse problem and work backward through the table.
Unstandardize correctly with \(x = \mu + z^* \sigma\).

Common mistake to avoid

Reading the \(z\)-table as right-tail when it is left-tail; adding areas instead of subtracting for middle regions; and finding the \(z^*\)-value but forgetting to convert it back to the original scale.

SYN-1 0–3

I can read a probability problem with no method named, recognize which approach it needs, and compute the answer.

Given a probability scenario with no method or section specified, recognize which approach applies — direct counting / the Fundamental Counting Principle, the addition or multiplication rules, conditional probability, or a named distribution (binomial or normal); justify the choice from the structure of the problem; and compute the requested probability.

Mastered

Review across: pr-1 pr-2 pr-3 pr-4 pr-5 pr-6

How this is scored (0–3)

3 · Exceeds Everything the problem calls for is here and then some: the method is picked and defended, the calculation lands correctly, and the write-up reads cleanly enough that another student could learn the approach straight from it.
2 · Proficient The approach fits the scenario and reaches a correct answer in context, with reasoning a reader can follow. A little tightening or a fuller line of explanation would polish it, but nothing important is missing or wrong. This is the target.
1 · Beginning Some of the idea is there, but it falls short — either the approach does not match the problem, or it does and then stumbles in the steps or the reasoning. Another pass and a clearer account of the thinking are needed.
0 · No evidence There is too little to go on — nothing that counts as a real attempt, an approach disconnected from the problem, or so much going wrong that there is no foothold to grade.

To show mastery, I can…

Recognize, unprompted, whether a problem is a counting problem, a probability-rules problem, a conditional-probability problem, or a named-distribution problem.
Rule out the approaches that don't fit and say why (e.g. not binomial because p changes between draws).
Carry the chosen method through to a correct probability.

Common mistake to avoid

Latching onto the first technique that fits a surface feature — multiplying because there are two stages, or reaching for the binomial because trials repeat — without checking that the structure actually matches (Independent? Does order matter? Are the BINS conditions met?).

Module 4

Statistical Inference

The Central Limit Theorem bridges probability to inference — the engine behind every confidence interval and hypothesis test.

INF-1

I can describe the sampling distribution of \(\bar{x}\) and calculate probabilities for sample means, and explain how the Central Limit Theorem applies.

Using the standard error \(\sigma/\sqrt{n}\), calculate probabilities for sample means; explain what the Central Limit Theorem states about the shape of the sampling distribution of \(\bar{x}\) and when it applies; and distinguish the distribution of sample means from individual observations.

Mastered

Review lesson inf-1 →

To show mastery, I can…

Use \(\sigma/\sqrt{n}\) (not \(\sigma\)) to compute probabilities for sample means.
Explain that the CLT applies to the distribution of sample means \(\bar{x}\), not individual observations.

Common mistake to avoid

Using the population standard deviation \(\sigma\) instead of the standard error \(\sigma/\sqrt{n}\) when calculating probabilities for sample means; and thinking the CLT implies that the raw data becomes normal as \(n\) grows.

INF-2

I can construct and interpret a confidence interval for a population mean (\(z\) vs. \(t\)) or proportion (\(z\)), check the right conditions, interpret the confidence level, and solve for required sample sizes.

Construct a confidence interval as point estimate \(\pm\) margin of error. For a mean: use \(z\) when \(\sigma\) is known or \(n \ge 30\), and \(t\) when \(\sigma\) is unknown and \(n < 30\) (\(df = n-1\)). For a proportion: always use \(z\), with point estimate \(\hat{p}\) and conditions \(n\hat{p} \ge 10\) and \(n(1-\hat{p}) \ge 10\). Check conditions; interpret the interval in context without attaching probability to the fixed parameter; explain what the confidence level means and how width responds to \(n\), variability, and confidence; and determine the sample size needed for a target margin of error (using a prior \(\hat{p}\) or \(p^* = 0.5\) worst case for a proportion).

Mastered

Review across: inf-2 inf-3 inf-4

To show mastery, I can…

For a mean: choose correctly between \(z\) and \(t\) and locate the critical value (\(t^*\) uses \(df = n-1\)).
For a proportion: always use \(z\), and check both success/failure conditions \(n\hat{p} \ge 10\) and \(n(1-\hat{p}) \ge 10\).
Construct the interval as estimate \(\pm E\) (\(\bar{x} \pm E\) or \(\hat{p} \pm E\)), building the margin of error from the right standard error.
Solve for the sample size needed for a target margin of error (using a prior \(\hat{p}\) or \(p^* = 0.5\) for a proportion).
Interpret the interval without attaching probability to the fixed parameter, and explain the long-run meaning of the confidence level and how width changes with \(n\), variability, and confidence.

Common mistake to avoid

Three classic slips: using \(z\) instead of \(t\) for a mean when \(\sigma\) is unknown and \(n\) is small; using \(p_0\) (a null value) instead of \(\hat{p}\) in a proportion interval's margin of error; and saying "there is a 95% probability the parameter lies in this interval" — the parameter is fixed, so the confidence describes the method's long-run reliability, not this one interval.

INF-3

I can run a five-step hypothesis test for a population mean (\(z\) vs. \(t\)) or proportion (\(p_0\) in the standard error), define a \(p\)-value, interpret a non-significant result, and explain Type I and Type II errors.

Execute all five steps of a hypothesis test for a population mean or proportion. For a mean: use \(z\) when \(\sigma\) is known or \(n \ge 30\), and \(t\) when \(\sigma\) is unknown and \(n < 30\). For a proportion: run a one-proportion \(z\)-test, checking \(np_0 \ge 10\) and \(n(1-p_0) \ge 10\) and building the standard error from the null value \(p_0\) (not \(\hat{p}\)). Check conditions; compute the test statistic and find or bound the \(p\)-value; state the conclusion in context; define \(p\)-value correctly as a conditional probability; interpret "fail to reject" correctly; and identify Type I and Type II errors in context.

Mastered

Review across: inf-5 inf-6

To show mastery, I can…

State \(H_0\) and \(H_a\) with the direction chosen in advance, and recognize whether the scenario calls for a mean test or a one-proportion \(z\)-test.
Select the correct test statistic — \(z\) vs. \(t\) for a mean (\(df = n-1\)), or the one-proportion \(z\)-test — and check the relevant conditions (including \(np_0 \ge 10\) and \(n(1-p_0) \ge 10\) for a proportion, with the SE built from \(p_0\)).
Compute the test statistic, find/bound the \(p\)-value, and make the reject / fail-to-reject decision.
State the final conclusion in context (not just "reject \(H_0\)").
Define \(p\)-value correctly and explain what "fail to reject" does and does not mean.
Identify Type I and Type II errors and explain their trade-offs in context.

Common mistake to avoid

Defining the \(p\)-value as "the probability the null is true"; concluding "\(H_0\) is true" from a failure to reject; choosing a one-tailed direction after inspecting the data; and — for a proportion — using \(\hat{p}\) instead of \(p_0\) in the standard error (the test asks "how extreme is this if \(H_0\) is true?", so the SE is built from \(p_0\)).

SYN-2 0–3

I can read an unstructured scenario, decide which inference procedure it calls for, carry it out, and answer the original question.

Given a scenario with no procedure named, decide whether it calls for a confidence interval or a hypothesis test, and whether the parameter is a mean or a proportion; select \(z\) vs. \(t\) and check the relevant conditions; carry out the chosen procedure; and state the answer in the context of the original question.

Mastered

Review across: inf-1 inf-2 inf-3 inf-4 inf-5 inf-6

How this is scored (0–3)

3 · Exceeds Everything the problem calls for is here and then some: the method is picked and defended, the calculation lands correctly, and the write-up reads cleanly enough that another student could learn the approach straight from it.
2 · Proficient The approach fits the scenario and reaches a correct answer in context, with reasoning a reader can follow. A little tightening or a fuller line of explanation would polish it, but nothing important is missing or wrong. This is the target.
1 · Beginning Some of the idea is there, but it falls short — either the approach does not match the problem, or it does and then stumbles in the steps or the reasoning. Another pass and a clearer account of the thinking are needed.
0 · No evidence There is too little to go on — nothing that counts as a real attempt, an approach disconnected from the problem, or so much going wrong that there is no foothold to grade.

To show mastery, I can…

Decide, from the wording alone, whether the question asks you to estimate a parameter (confidence interval) or test a claim about it (hypothesis test).
Identify whether the parameter is a mean or a proportion, and pick the matching formula.
Choose \(z\) vs. \(t\) correctly (\(z\) when \(\sigma\) is known or \(n \ge 30\) for a mean; \(t\) when \(\sigma\) is unknown and \(n\) is small) and justify the choice.
Check the conditions the chosen procedure requires before computing anything.
State the conclusion in the scenario's own terms — not as a bare 'reject \(H_0\)' or a naked interval.

Common mistake to avoid

Pattern-matching on a surface keyword instead of the structure of the question — running a hypothesis test because the scenario mentions a number to compare against, when the question actually asks you to estimate (a confidence interval). Or defaulting to \(z\) because it is more familiar, without checking whether \(\sigma\) is known.

Module 5

Regression & Association

Analyzing relationships between two variables — correlation and regression for quantitative data, chi-square for qualitative data.

REG-1

I can describe scatter plots, calculate and interpret correlation (\(r\)) and the coefficient of determination (\(r^2\)), check Pearson conditions, and explain why correlation is not causation.

Calculate and interpret Pearson \(r\) and \(r^2\); describe scatter plots by direction, form, strength, and outliers; check conditions for Pearson \(r\); and explain why correlation does not imply causation.

Mastered

Review lesson reg-1 →

To show mastery, I can…

Describe a scatter plot by direction, form, strength, and outliers.
Calculate and interpret \(r\) and \(r^2\), reporting \(r^2\) as the proportion of variance explained.
Check the conditions for Pearson \(r\).
Explain why correlation does not imply causation.

Common mistake to avoid

Interpreting \(r = 0\) as no relationship at all (it only means no linear relationship); confusing \(r\) with \(r^2\) (e.g. reporting \(r^2\) as the proportion of variance explained); and assuming correlation implies causation.

REG-2

I can find the least-squares regression line, interpret its slope and intercept, use it to make predictions (distinguishing interpolation from extrapolation), and identify outliers and high-leverage points.

Calculate the least-squares regression line using \(b = r(s_y/s_x)\) and \(a = \bar{y} - b\bar{x}\) (and verify it passes through \((\bar{x}, \bar{y})\)); interpret slope and intercept in context; use the line for prediction, distinguishing interpolation from extrapolation and explaining the risk of extrapolation; and identify regression outliers and influential/high-leverage points and describe their effects on the line.

Mastered

Review across: reg-2 reg-3

To show mastery, I can…

Compute the slope \(b\) and intercept \(a\) of the least-squares line, and verify the line passes through the point of means \((\bar{x}, \bar{y})\).
Interpret the slope using "predicted" and "on average," and flag when the intercept interpretation is not meaningful (\(x = 0\) outside the observed range).
Use the regression equation to predict \(\hat{y}\) for a given \(x\), and classify the prediction as interpolation or extrapolation (explaining the risk extrapolation introduces).
Identify a regression outlier (a point lying far from the line) and an influential/high-leverage point (an extreme-\(x\) point that pulls the slope), and explain its effect on the line.

Common mistake to avoid

Reading the slope as a guarantee ("\(y\) increases by \(b\)") instead of a prediction ("predicted \(\hat{y}\) increases by \(b\) on average"); interpreting the intercept when \(x = 0\) is outside the data range; and extrapolating far beyond the observed \(x\)-values.

REG-3

I can run a chi-square test of independence and interpret it as association (not causation).

Calculate expected frequencies using \(E = (\text{row total} \times \text{column total})/\text{grand total}\); compute \(\chi^2 = \sum \frac{(O-E)^2}{E}\); execute the five-step chi-square test with \(df = (r-1)(c-1)\); verify that all expected frequencies are \(\ge 5\); interpret the result as evidence of association, not causation.

Mastered

Review lesson reg-4 →

To show mastery, I can…

Compute all expected frequencies using \(E = (\text{row total} \times \text{column total})/\text{grand total}\).
Compute \(\chi^2 = \sum \frac{(O-E)^2}{E}\) with \(df = (r-1)(c-1)\).
Check the \(E \ge 5\) condition explicitly.
State the conclusion using "association," not "causation," in the research context.

Common mistake to avoid

Swapping observed and expected frequencies in the formula, running the test when an expected cell count is below 5, or concluding that one variable causes the other.

SYN-3 0–3

I can look at two variables and a research question and decide whether it calls for a chi-square test or a correlation/regression analysis — then carry it out.

Given two variables and a research question, determine the analysis from the variable types: a chi-square test of independence for two qualitative variables, or correlation and least-squares regression for two quantitative variables; justify the choice; carry out the appropriate analysis; and state the conclusion as association (never causation) in context.

Mastered

Review across: ds-1 reg-1 reg-2 reg-3 reg-4

How this is scored (0–3)

3 · Exceeds Everything the problem calls for is here and then some: the method is picked and defended, the calculation lands correctly, and the write-up reads cleanly enough that another student could learn the approach straight from it.
2 · Proficient The approach fits the scenario and reaches a correct answer in context, with reasoning a reader can follow. A little tightening or a fuller line of explanation would polish it, but nothing important is missing or wrong. This is the target.
1 · Beginning Some of the idea is there, but it falls short — either the approach does not match the problem, or it does and then stumbles in the steps or the reasoning. Another pass and a clearer account of the thinking are needed.
0 · No evidence There is too little to go on — nothing that counts as a real attempt, an approach disconnected from the problem, or so much going wrong that there is no foothold to grade.

To show mastery, I can…

Classify both variables (qualitative vs quantitative) before choosing any method — the Module 1 skill driving the Module 5 choice.
Map two qualitative variables to a chi-square test of independence, and two quantitative variables to correlation/regression.
Carry out the chosen analysis correctly — expected counts and \(\chi^2\), or \(r/r^2\) and the least-squares line.
State the result as association in context, and resist a causal claim.

Common mistake to avoid

Jumping to regression because the data arrived as numbers, without checking whether those numbers are genuinely quantitative or just coded categories — a postal code or a Likert label is qualitative, and calls for chi-square instead.