GP1 — Reading Conditional Probability from a Two-Way Table
For every variant: (a) the denominator of \( P(A \mid B) \) is always the marginal total for B — the row or column total for the conditioning event, not the grand total and not the intersection cell. (b) the conditional probability is the intersection cell divided by that marginal total.
Variant 0 — Study location × quiz result (n = 120)
| Passes quiz | Does not pass | Total | |
|---|---|---|---|
| Library | 48 | 12 | 60 |
| Elsewhere | 36 | 24 | 60 |
| Total | 84 | 36 | 120 |
- (a) Denominator = row total for "library" = 60.
- (b) \( P(\text{passes} \mid \text{library}) = 48/60 = \mathbf{0.80} \)
Variant 1 — Transportation × on-time arrival (n = 200)
| On time | Late | Total | |
|---|---|---|---|
| Public transit | 72 | 48 | 120 |
| Drives | 56 | 24 | 80 |
| Total | 128 | 72 | 200 |
- (a) Denominator = row total for "public transit" = 120.
- (b) \( P(\text{on time} \mid \text{transit}) = 72/120 = \mathbf{0.60} \)
Variant 2 — Membership type × weekly frequency (n = 150)
| 3+ times/week | Less frequent | Total | |
|---|---|---|---|
| Student member | 54 | 36 | 90 |
| Non-student | 24 | 36 | 60 |
| Total | 78 | 72 | 150 |
- (a) Denominator = row total for "student member" = 90.
- (b) \( P(\text{3+ times/week} \mid \text{student}) = 54/90 = \mathbf{0.60} \)
Variant 3 — Diet type × energy level (n = 100)
| High energy | Low energy | Total | |
|---|---|---|---|
| Vegetarian | 28 | 12 | 40 |
| Non-vegetarian | 36 | 24 | 60 |
| Total | 64 | 36 | 100 |
- (a) Denominator = row total for "vegetarian" = 40.
- (b) \( P(\text{high energy} \mid \text{vegetarian}) = 28/40 = \mathbf{0.70} \)
Variant 4 — Work status × course completion (n = 200)
| Completes on time | Does not complete | Total | |
|---|---|---|---|
| Works full-time | 48 | 72 | 120 |
| Not full-time | 56 | 24 | 80 |
| Total | 104 | 96 | 200 |
- (a) Denominator = row total for "works full-time" = 120.
- (b) \( P(\text{completes on time} \mid \text{full-time}) = 48/120 = \mathbf{0.40} \)
GP2 — General Multiplication Rule (Generator)
Generated by generateMultiplicationRuleProblem(). The approach for every instance:
- Identify \( P(B) \) and \( P(A \mid B) \) from the problem statement.
- Apply the General Multiplication Rule: \( P(A \cap B) = P(B) \cdot P(A \mid B) \).
Example: \( P(B) = 0.40 \), \( P(A \mid B) = 0.25 \).
\[ P(A \cap B) = 0.40 \times 0.25 = 0.10 \]Common distractors and why they are wrong:
- Adding instead of multiplying: \( 0.40 + 0.25 = 0.65 \) — this gives a union-related value, not a joint probability.
- Using only one factor: Reporting \( P(B) = 0.40 \) or \( P(A \mid B) = 0.25 \) alone — both factors are required for the joint probability.
- Reversing the formula: \( P(A \cap B) / P(B) = P(A \mid B) \) goes backward — we already know the conditional and need the joint, so we multiply, not divide.
GP3 — Tree Diagram Path Probability (Generator)
Generated by generateTreeDiagramProblem(). The approach for every instance:
- Identify the target path (e.g., Stage 1 = X, Stage 2 = Y).
- Read the Stage 1 probability \( P(X) \) and the conditional Stage 2 probability \( P(Y \mid X) \) from the branch labels.
- Multiply: joint probability of path = \( P(X) \times P(Y \mid X) \).
Example: \( P(\text{Stage 1} = X) = 0.60 \), \( P(\text{Stage 2} = Y \mid X) = 0.20 \).
\[ P(X \cap Y) = 0.60 \times 0.20 = 0.12 \]Common distractors and why they are wrong:
- Summing path probabilities: \( 0.60 + 0.20 = 0.80 \) — branch probabilities are multiplied along a path, not added.
- Using the wrong branch: Reading \( P(Y \mid X') \) instead of \( P(Y \mid X) \) — always follow the specified Stage 1 branch to its Stage 2 sub-branches.
- Forgetting the Stage 1 probability: Reporting only \( P(Y \mid X) = 0.20 \) — this is conditional on X, not a joint probability. Multiply by \( P(X) \).
GP4 — Independence Analysis (non-regenerable)
Data: 200 students. Video games: 90 play regularly. Exercise: 80 exercise regularly. Both: 36.
Part (a):
\[ P(\text{video games} \cap \text{exercise}) = \frac{36}{200} = 0.18 \]Part (b) — Multiplication test:
\[ P(\text{video games}) = \frac{90}{200} = 0.45, \quad P(\text{exercise}) = \frac{80}{200} = 0.40 \] \[ P(\text{video games}) \cdot P(\text{exercise}) = 0.45 \times 0.40 = 0.18 = P(\text{both}) \]The product equals the joint probability exactly. The events are independent. This is the surprising result — intuition might expect gamers to exercise less, but the data say otherwise.
Part (c) — Conditional probability confirmation:
\[ P(\text{video games} \mid \text{exercise}) = \frac{36}{80} = 0.45 = P(\text{video games}) \]Knowing a student exercises gives no additional information about whether they play video games. Independence confirmed.
Part (d) — Evaluating the classmate's claims:
- Claim 1 ("not mutually exclusive"): Correct. Since 36 students do both, \( P(A \cap B) = 0.18 \neq 0 \), so the events are not mutually exclusive.
- Claim 2 ("co-occurrence proves independence"): Flawed reasoning. Co-occurrence (positive joint probability) rules out mutual exclusivity but does not imply independence. Independence requires the multiplication test. In this case the test does confirm independence, but the reasoning given (co-occurrence = independence) is wrong — it is possible for events to co-occur and still be dependent.