GP1 — t-Interval Construction (Variants A–E)
Each variant follows the same four-step workflow: compute df → look up t* → compute SE and E → state the interval.
Variant A (sleep duration, \(\bar{x} = 7.2\) hrs, \(s = 0.9\) hrs, \(n = 12\), 95% CI):
- \(df = n - 1 = 11\).
- \(t^* = 2.201\) (df = 11, 95% CI).
- \(\text{SE} = 0.9/\sqrt{12} = 0.2598, \quad E = 2.201 \times 0.2598 \approx 0.572\)
- 95% CI: \(7.2 \pm 0.572 = (6.628,\ 7.772) \approx (6.63,\ 7.77)\) hrs.
Variant B (plant growth, \(\bar{x} = 14.3\) cm, \(s = 2.1\) cm, \(n = 8\), 90% CI):
- \(df = 7, \quad t^* = 1.895\).
- \(\text{SE} = 2.1/\sqrt{8} = 0.7425, \quad E = 1.895 \times 0.7425 \approx 1.407\)
- 90% CI: \(14.3 \pm 1.407 = (12.893,\ 15.707) \approx (12.89,\ 15.71)\) cm.
Variant C (blood glucose, \(\bar{x} = 95.4\) mg/dL, \(s = 12.7\) mg/dL, \(n = 15\), 99% CI):
- \(df = 14, \quad t^* = 2.977\).
- \(\text{SE} = 12.7/\sqrt{15} = 3.280, \quad E = 2.977 \times 3.280 \approx 9.762\)
- 99% CI: \(95.4 \pm 9.762 = (85.638,\ 105.162) \approx (85.64,\ 105.16)\) mg/dL.
Variant D (reaction time, \(\bar{x} = 285\) ms, \(s = 42\) ms, \(n = 10\), 95% CI):
- \(df = 9, \quad t^* = 2.262\).
- \(\text{SE} = 42/\sqrt{10} = 13.285, \quad E = 2.262 \times 13.285 \approx 30.050\)
- 95% CI: \(285 \pm 30.050 = (254.95,\ 315.05)\) ms.
Variant E (protein content, \(\bar{x} = 8.7\) g, \(s = 1.2\) g, \(n = 6\), 90% CI):
- \(df = 5, \quad t^* = 2.015\).
- \(\text{SE} = 1.2/\sqrt{6} = 0.4899, \quad E = 2.015 \times 0.4899 \approx 0.987\)
- 90% CI: \(8.7 \pm 0.987 = (7.713,\ 9.687) \approx (7.71,\ 9.69)\) g.
Common error — df = n: Using \(df = n\) instead of \(df = n - 1\) yields a t* that is too small (closer to z*), producing an interval that is too narrow. Always subtract 1 because one degree of freedom is spent estimating \(\sigma\) with \(s\).
GP2 — When to Use t vs. z (Variants A–E)
The decision rule: if \(\sigma\) is known, use z regardless of n. If \(\sigma\) is unknown and \(n < 30\), use t. If \(\sigma\) is unknown but \(n \geq 30\), both are acceptable (z is most common).
Variant A (nutritionist, n = 20, σ unknown): Use t. σ unknown and n < 30. Correct interpretation: "We are 95% confident the true mean sodium content lies between [L] and [U] mg — this describes the reliability of the procedure, not the probability that μ is in this specific interval."
Variant B (factory QC, σ = 2.3 g known, n = 15): Use z. σ is known, so the extra uncertainty of estimating it is absent. We use z* = 1.96 (95% CI), not a t-value.
Variant C (researcher, n = 45, σ unknown): Use z (or t — both acceptable). With n ≥ 30 and σ unknown, the CLT justifies z, but t with df = 44 is equally valid (they converge). In practice, z is most commonly reported here.
Variant D (pharmacist, n = 8, σ unknown, population normal): Use t. Small n, σ unknown. The normality assumption for the population makes the t-interval valid even at n = 8.
Variant E (historian, n = 12, σ unknown): Use t. df = 11. Correct interpretation: "If we repeated this sampling procedure many times, about 90% of the resulting confidence intervals would capture the true mean — this one may or may not."
GP3 — z vs. t Width Comparison
Data: \(\bar{x} = 42.6\), \(s = 5.8\), \(n = 16\), 95% CI.
- \(\text{SE} = 5.8/\sqrt{16} = 5.8/4 = 1.450\)
- z-based (incorrect): \(E_z = 1.96 \times 1.450 = 2.842, \quad \text{CI}_z = (39.758,\ 45.442)\)
- t-based (correct): \(df = 15, \ t^* = 2.131, \quad E_t = 2.131 \times 1.450 = 3.090, \quad \text{CI}_t = (39.510,\ 45.690)\)
(a) The t-interval is wider (width 6.180 vs. 5.684).
(b) The t-interval is wider because \(t^* = 2.131 > z^* = 1.96\) — heavier tails of the t-distribution require pulling the critical values farther out to capture the same probability.
(c) The researcher should report the t-interval. Using z* with unknown σ underestimates uncertainty and produces false precision. The t-interval is honest about what the data actually tell us.
GP4 — Generated Problems
Solutions for GP4 are embedded in the generator output. The key steps are always:
- Compute \(df = n - 1\).
- Use the t* provided in the problem (it comes from the T_CRIT table for that df and confidence level).
- Compute \(SE = s/\sqrt{n}\), then \(E = t^* \times SE\).
- State the interval: \(\bar{x} \pm E\).