Problem 1 — Variable Classification and Method Selection (Variants 0–4)
Variant 0 (weekly study hours → final exam score, n = 25):
- (a) Variable 1 — Quantitative. Study hours are measured numerically in hours per week; arithmetic differences (e.g., 10 hours vs. 15 hours) are meaningful.
- (b) Variable 2 — Quantitative. Exam score is measured on a numeric scale (0–100); averages and differences are meaningful.
- (c) Method — Pearson correlation / linear regression. Both variables are quantitative → Pearson correlation applies directly.
Variant 1 (smoking status × exercise frequency, n = 100):
- (a) Variable 1 — Qualitative. Smoking status is a binary category (smoker / non-smoker); no arithmetic meaning between categories.
- (b) Variable 2 — Qualitative. Exercise frequency is a binary category (regular / infrequent); named categories without numeric meaning.
- (c) Method — Chi-square test of independence. Both variables are qualitative → chi-square applies directly.
Variant 2 (training hours → injury count, n = 40):
- (a) Variable 1 — Quantitative. Training hours are continuous numerical measurements.
- (b) Variable 2 — Quantitative. Injury count is a numerical variable (whole-number counts); arithmetic differences are meaningful.
- (c) Method — Pearson correlation / linear regression. Two quantitative variables → Pearson correlation.
Variant 3 (smartphone brand × age group, n = 150):
- (a) Variable 1 — Qualitative. Brand names (Brand A / B / C) are three named categories; no arithmetic meaning applies.
- (b) Variable 2 — Qualitative. Age group (18–30 / 31–50 / 51+) is recorded as named categorical brackets, not individual ages.
- (c) Method — Chi-square test of independence. Both variables are qualitative (nominal/ordinal categories) → chi-square.
Variant 4 (caloric intake → body weight, n = 45):
- (a) Variable 1 — Quantitative. Caloric intake is measured in kcal — a continuous numerical measurement.
- (b) Variable 2 — Quantitative. Body weight is measured in kg — a continuous physical measurement.
- (c) Method — Pearson correlation / linear regression. Two quantitative variables → Pearson correlation.
Problem 2 — Interpreting Given Results (Variants 0–4)
Variant 0 (study hours / exam score: r = 0.72, r² = 0.518, reject H₀):
- Conclusion: There is sufficient evidence of a positive linear relationship between weekly study hours and final exam scores in the population.
- Effect size: \(r^2 = 0.518\) — study hours explain 51.8% of the variance in exam scores. This is a strong practical effect (\(r^2 \geq 0.35\)).
Variant 1 (smoking status × exercise frequency: χ²(1) = 4.167, V = 0.204, reject H₀):
- Conclusion: There is sufficient evidence that smoking status and exercise frequency are not independent in the population.
- Effect size: \(V = 0.204\) — this is a small association (0.1–0.3). The result is statistically significant, but the association is modest in magnitude.
Variant 2 (TV hours / GPA: r = −0.38, r² = 0.144, reject H₀):
- Conclusion: There is sufficient evidence of a negative linear relationship between weekly TV hours and GPA in the population.
- Effect size: \(r^2 = 0.144\) — TV hours explain only 14.4% of GPA variance. This is a weak practical effect (0.04–0.15).
- Key point: The result is statistically significant (p ≈ 0.003) but practically weak. Statistical significance and practical importance are not the same thing.
Variant 3 (education level × newspaper reading: χ²(1) = 2.197, fail to reject H₀):
- Conclusion: There is insufficient evidence to conclude that education level and daily newspaper reading are not independent in the population.
- Key point: Failing to reject H₀ does not mean we "accept" H₀ or "prove" independence. We simply lack sufficient evidence of dependence. The variables may still be related in the population — the data just don't provide enough evidence to detect it.
Variant 4 (commute time / work satisfaction: r = −0.22, r² = 0.048, fail to reject H₀):
- Conclusion: There is insufficient evidence of a linear relationship between commute time and work satisfaction in the population.
- Effect size context: Even if the result had been significant, \(r^2 = 0.048\) would be weak — commute time explains only 4.8% of variation in work satisfaction.
Problem 3 — Study A / Study B Error Analysis
(a) Study A error:
The researcher concludes that because the test failed to reject H₀, students in the two programs "must have the same GPA." This is the accept-H₀ fallacy. Failing to reject H₀ does not prove the null hypothesis — it only means the data do not provide sufficient evidence against it. The two populations may still have different GPAs; the study simply lacked power to detect any difference or relationship. We never "accept" H₀; we only "fail to reject" it.
(b) Study B error:
Work-life balance measured on a 1–10 numerical scale is a quantitative variable (numerical ratings where arithmetic differences, e.g. 7 vs. 4, are meaningful). Chi-square requires two qualitative (categorical) variables. Applying chi-square to a continuous scale requires artificially binning the data — which discards information and introduces arbitrary cut-points. The appropriate method is Pearson correlation, since both work-life balance score and the second variable (if quantitative) are numerical measurements.
Problem 4 — Generator Problems
Solutions for Problem 4 are generated dynamically by the generateBivariateScenario() function. Each generated problem includes a complete solution in the solution field, covering all four steps: (1) variable type classification, (2) method identification, (3) full conclusion using correct template language, and (4) effect size interpretation with threshold label.