Problem 1 — t-Test Decisions (Variants A, B, C)
Each variant below shows a small-sample t-test worked in full. The key steps are always: (1) identify df = n − 1, (2) compute SE = s/√n, (3) compute t, (4) locate t in the correct df row of the t-table to bound p, (5) compare to α.
Variant A (Nutritionist; n = 10, x̄ = 2180 kcal, s = 280 kcal, μ₀ = 2000, two-tailed, α = 0.05):
- H₀: \(\mu = 2000\); Ha: \(\mu \neq 2000\)
- \(\text{SE} = 280 / \sqrt{10} = 280 / 3.162 \approx 88.54\)
- \(t = (2180 - 2000) / 88.54 = 180 / 88.54 \approx 2.03\), \(df = 9\)
- t-table, df = 9 row: \(t^* = 1.833\) (one-tail 0.05); \(t^* = 2.262\) (one-tail 0.025, i.e., two-tail 0.05). Since \(1.833 < 2.03 < 2.262\): \(0.05 < p < 0.10\).
- Decision: \(p > 0.05\). Fail to reject H₀. Insufficient evidence that mean caloric intake differs from 2000 kcal.
Variant B (Battery lab; n = 12, x̄ = 23.4 h, s = 3.2 h, μ₀ = 25 h, two-tailed, α = 0.05):
- H₀: \(\mu = 25\); Ha: \(\mu \neq 25\)
- \(\text{SE} = 3.2 / \sqrt{12} = 3.2 / 3.464 \approx 0.924\)
- \(t = (23.4 - 25) / 0.924 = -1.6 / 0.924 \approx -1.73\), \(df = 11\)
- t-table, df = 11: \(t^* = 1.363\) (one-tail 0.10); \(t^* = 1.796\) (one-tail 0.05). Since \(|t| = 1.73\) and \(1.363 < 1.73 < 1.796\): \(0.10 < p < 0.20\).
- Decision: \(p > 0.05\). Fail to reject H₀. Insufficient evidence that mean battery life differs from 25 hours.
Variant C (Doctor; n = 16, x̄ = 37.3°C, s = 0.5°C, μ₀ = 37.0°C, two-tailed, α = 0.05):
- H₀: \(\mu = 37.0\); Ha: \(\mu \neq 37.0\)
- \(\text{SE} = 0.5 / \sqrt{16} = 0.5 / 4 = 0.125\)
- \(t = (37.3 - 37.0) / 0.125 = 0.3 / 0.125 = 2.40\), \(df = 15\)
- t-table, df = 15: \(t^* = 2.131\) (one-tail 0.025 = two-tail 0.05); \(t^* = 1.753\) (one-tail 0.05). Since \(2.40 > 2.131\): \(p < 0.05\).
- Decision: Reject H₀. There is sufficient evidence at α = 0.05 to conclude the mean temperature differs from 37.0°C.
Common mistakes: (1) Using df = n instead of df = n − 1. (2) Reading the one-tail column for a two-tailed test — for a two-tailed test at α = 0.05, use the one-tail 0.025 column. (3) Reporting an exact p-value from the t-table: the t-table gives bounds, not exact p-values. State "0.05 < p < 0.10" rather than "p = 0.07."
Problem 2 — Proportion Test Conditions and z Statistic (Variants A, B, C)
Variant A (n = 150, x = 36, p₀ = 0.20, two-tailed, α = 0.05):
- Conditions: \(np_0 = 150(0.20) = 30 \geq 5\) ✓; \(n(1 - p_0) = 120 \geq 5\) ✓.
- \(\hat{p} = 36/150 = 0.24\)
- \(\text{SE} = \sqrt{0.20 \times 0.80 / 150} = \sqrt{0.001067} \approx 0.03266\)
- \(z = (0.24 - 0.20) / 0.03266 \approx 1.22\)
- \(p = 2(1 - \Phi(1.22)) = 2(1 - 0.8888) = 2(0.1112) \approx 0.222\)
- Decision: \(p = 0.222 > 0.05\). Fail to reject H₀. Insufficient evidence the smoking rate differs from 20%.
Variant B (n = 80, x = 42, p₀ = 0.60, two-tailed, α = 0.05):
- Conditions: \(np_0 = 80(0.60) = 48 \geq 5\) ✓; \(n(1-p_0) = 32 \geq 5\) ✓.
- \(\hat{p} = 42/80 = 0.525\)
- \(\text{SE} = \sqrt{0.60 \times 0.40 / 80} = \sqrt{0.003} \approx 0.05477\)
- \(z = (0.525 - 0.60) / 0.05477 \approx -1.37\)
- \(p = 2(1 - \Phi(1.37)) \approx 2(0.0853) = 0.171\)
- Decision: \(p = 0.171 > 0.05\). Fail to reject H₀. Insufficient evidence the graduate employment rate differs from 60%.
Variant C (n = 200, x = 82, p₀ = 0.35, two-tailed, α = 0.01):
- Conditions: \(np_0 = 200(0.35) = 70 \geq 5\) ✓; \(n(1-p_0) = 130 \geq 5\) ✓.
- \(\hat{p} = 82/200 = 0.41\)
- \(\text{SE} = \sqrt{0.35 \times 0.65 / 200} = \sqrt{0.001138} \approx 0.03373\)
- \(z = (0.41 - 0.35) / 0.03373 \approx 1.78\)
- \(p = 2(1 - \Phi(1.78)) = 2(0.0375) = 0.075\)
- Decision: \(p = 0.075 > 0.01\). Fail to reject H₀ at α = 0.01. Insufficient evidence the commuter rate differs from 35%.
Common mistake: Using \(\hat{p}\) instead of \(p_0\) in the SE denominator. The SE formula for a proportion test is \(\sqrt{p_0(1-p_0)/n}\) — the null value goes in the denominator because we compute the SE assuming H₀ is true.
Problem 3 — Choosing t vs. z vs. Proportion Test
Scenario A (Factory QC; n = 10, x̄ = 97 g, s = 4 g, μ₀ = 100 g, left-tailed):
- Correct test: One-sample t-test (left-tailed). Reason: σ is unknown (only s = 4 g given), n < 30. Use t with df = 9.
- H₀: μ = 100; Ha: μ < 100.
- The engineer's suspicion is directional ("below the claim"), so left-tailed is appropriate.
Scenario B (Pet ownership; n = 120, x = 54, p₀ = 0.40, two-tailed):
- Correct test: One-sample z-test for a proportion (two-tailed). Reason: the question is about a population proportion, not a mean. Conditions: \(np_0 = 48 \geq 5\) ✓; \(n(1-p_0) = 72 \geq 5\) ✓.
- H₀: p = 0.40; Ha: p ≠ 0.40.
Common mistake: Choosing the t-test for Scenario B because "σ is unknown." The t-vs-z distinction applies only to tests about a population mean, not a proportion. For proportions (when conditions are met), always use the z-distribution.
Problem 4 — CI Equivalence: Same Conclusion, Two Methods
Scenario: n = 16, x̄ = 2150 kcal, s = 300 kcal, μ₀ = 2000, α = 0.05 (two-tailed), df = 15. (This is the same setup as the lesson's Section 4 Example 1.)
Method 1 — Hypothesis Test (p-value approach):
- \(\text{SE} = 300 / \sqrt{16} = 300 / 4 = 75\) kcal
- \(t = (2150 - 2000) / 75 = 150 / 75 = 2.00\)
- t-table, df = 15: \(t^* = 2.131\) at two-tail 0.05; \(t^* = 1.753\) at two-tail 0.10. Since \(1.753 < 2.00 < 2.131\): \(0.05 < p < 0.10\).
- Decision: \(p > 0.05\). Fail to reject H₀.
Method 2 — 95% Confidence Interval:
- \(t^* = 2.131\) for df = 15, 95% confidence (same critical value as the two-tail 0.05 test)
- \(E = 2.131 \times 75 = 159.8\) kcal
- \(\text{CI} = 2150 \pm 159.8 = (1990.2,\; 2309.8)\) kcal
- \(\mu_0 = 2000\) falls inside \((1990.2,\; 2309.8)\) since \(1990.2 < 2000 < 2309.8\). → Fail to reject H₀. ✓
Why they agree: For a two-tailed test at level α, the null value μ₀ falls outside the (1 − α) CI if and only if p < α. When p is in (0.05, 0.10), μ₀ is inside the 95% CI — both conclude "fail to reject." This equivalence always holds for two-tailed tests.
Variant C from Problem 1 (n = 16, x̄ = 37.3°C, s = 0.5°C, μ₀ = 37.0°C) provides a "reject" example for comparison:
- Test: t = 2.40 > 2.131, so p < 0.05 → Reject H₀.
- \(\text{CI} = 37.3 \pm 2.131 \times 0.125 = 37.3 \pm 0.266 = (37.034,\; 37.566)\)
- μ₀ = 37.0 falls outside (37.034, 37.566) → CI also concludes: reject H₀. ✓
Common mistake: Using \(\hat{p}\) (or x̄) instead of \(\mu_0\) when checking whether the CI excludes the null. The check is: does μ₀ fall inside or outside the interval? The CI is centered at x̄, not at μ₀.