GP-1 — Direction and Strength from r (Variants 0–4)
Variant 0 (\( r = 0.86 \), study hours vs. exam score):
- (a) Direction: Positive — \( r = 0.86 > 0 \); as study hours increase, exam scores tend to increase.
- (b) Strength: Strong — \( |r| = 0.86 \geq 0.8 \).
Variant 1 (\( r = -0.72 \), outdoor temperature vs. hot beverage sales):
- (a) Direction: Negative — \( r = -0.72 < 0 \); as temperature increases, hot beverage sales tend to decrease.
- (b) Strength: Moderate — \( |r| = 0.72 \in [0.5, 0.8) \).
Variant 2 (\( r = 0.34 \), shoe size vs. vocabulary score):
- (a) Direction: Positive — \( r = 0.34 > 0 \).
- (b) Strength: Weak — \( |r| = 0.34 < 0.5 \).
Variant 3 (\( r = -0.91 \), exercise minutes vs. resting heart rate):
- (a) Direction: Negative — more exercise associated with lower resting heart rate.
- (b) Strength: Strong — \( |r| = 0.91 \geq 0.8 \).
Variant 4 (\( r = 0.58 \), sleep hours vs. productivity):
- (a) Direction: Positive — \( r = 0.58 > 0 \).
- (b) Strength: Moderate — \( |r| = 0.58 \in [0.5, 0.8) \).
Common mistakes: (1) Reading the sign of \( r \) incorrectly — a negative \( r \) means negative direction, not a weak relationship. (2) Applying the wrong strength threshold — always compare \( |r| \) (the magnitude) to the thresholds, not the signed value.
GP-2 — Computing and Interpreting \( r^2 \) (Variants 0–4)
Variant 0 (\( r = 0.86 \), height vs. weight):
- (a) \( r^2 = 0.86^2 = 0.7396 \approx 0.74 \)
- (b) Correct: "74% of the variability in weight is explained by the linear relationship with height." Direction: x (height) explains variability in y (weight).
Variant 1 (\( r = -0.72 \), temperature vs. coffee sales):
- (a) \( r^2 = (-0.72)^2 = 0.5184 \approx 0.52 \). Note: squaring removes the sign — \( r^2 \) is always non-negative.
- (b) Correct: "52% of the variability in coffee sales is explained by the linear relationship with temperature."
Variant 2 (\( r = 0.43 \), attendance vs. GPA):
- (a) \( r^2 = 0.43^2 = 0.1849 \approx 0.18 \)
- (b) Correct: "18% of the variability in GPA is explained by the linear relationship with attendance — 82% remains unexplained."
Variant 3 (\( r = -0.91 \), exercise frequency vs. resting heart rate):
- (a) \( r^2 = 0.91^2 = 0.8281 \approx 0.83 \)
- (b) Correct: "83% of the variability in resting heart rate is explained by the linear relationship with exercise frequency."
Variant 4 (\( r = 0.65 \), sleep hours vs. productivity):
- (a) \( r^2 = 0.65^2 = 0.4225 \approx 0.42 \)
- (b) Correct: "42% of the variability in productivity is explained by the linear relationship with sleep hours."
The C6 trap: Never report \( r \) directly as a percentage of variability explained. With \( r = 0.65 \), the variability explained is 42%, not 65%. The squaring step is mandatory and makes a large difference for moderate correlations.
GP-3 — Correlation vs. Causation Scenarios (C4, C7)
Scenario 1 — Firefighters and property damage (\( r = 0.91 \)):
- (a) No — there is a confounding variable. The strength of the correlation (\( r = 0.91 \)) is irrelevant to the causation question.
- (b) Fire size is the confounding variable: larger fires both attract more firefighters and cause more damage. The newspaper's causal claim is unwarranted.
Scenario 2 — Physical activity and depression (\( r = -0.78 \)):
- (a) Cannot determine — reverse causation or confounding are plausible. The correlation is consistent with exercise reducing depression, but the direction of causation is not established by this data alone.
- (b) Reverse causation: people who feel better psychologically may have more energy to exercise. Alternatively, a confounder such as overall health status could drive both higher activity and lower depression scores.
Scenario 3 — Pencil length and exam grades (\( r = 0.45 \)):
- (a) No — both variables are caused by a third variable (studious behavior). Students who study more use their pencils more and earn higher grades.
GP-4 — Compute \( r^2 \) and Interpret (Generator)
Solutions for this problem are generated dynamically. Refer to the explanation shown after each attempt in the lesson.
General approach: (1) square \( r \) to obtain \( r^2 \); (2) multiply by 100 for the percentage; (3) use the correct phrasing: "x explains r² × 100% of the variability in y."
Key distractor to watch for: The generator presents \( r \) itself as one option for \( r^2 \). This is the C6 error — never report \( r \) as the percentage of variability explained without squaring first.