EN FR

INF-3 Solutions: Confidence Intervals for a Population Mean (Small Sample)

Solutions Reference · ← Back to Lesson INF-3

Section 5 — Guided Practice Solutions

GP1 — t-Interval Construction (Variants A–E)

Each variant follows the same four-step workflow: compute df → look up t* → compute SE and E → state the interval.

Variant A (sleep duration, \(\bar{x} = 7.2\) hrs, \(s = 0.9\) hrs, \(n = 12\), 95% CI):

Variant B (plant growth, \(\bar{x} = 14.3\) cm, \(s = 2.1\) cm, \(n = 8\), 90% CI):

Variant C (blood glucose, \(\bar{x} = 95.4\) mg/dL, \(s = 12.7\) mg/dL, \(n = 15\), 99% CI):

Variant D (reaction time, \(\bar{x} = 285\) ms, \(s = 42\) ms, \(n = 10\), 95% CI):

Variant E (protein content, \(\bar{x} = 8.7\) g, \(s = 1.2\) g, \(n = 6\), 90% CI):

Common error — df = n: Using \(df = n\) instead of \(df = n - 1\) yields a t* that is too small (closer to z*), producing an interval that is too narrow. Always subtract 1 because one degree of freedom is spent estimating \(\sigma\) with \(s\).


GP2 — When to Use t vs. z (Variants A–E)

The decision rule: if \(\sigma\) is known, use z regardless of n. If \(\sigma\) is unknown and \(n < 30\), use t. If \(\sigma\) is unknown but \(n \geq 30\), both are acceptable (z is most common).

Variant A (nutritionist, n = 20, σ unknown): Use t. σ unknown and n < 30. Correct interpretation: "We are 95% confident the true mean sodium content lies between [L] and [U] mg — this describes the reliability of the procedure, not the probability that μ is in this specific interval."

Variant B (factory QC, σ = 2.3 g known, n = 15): Use z. σ is known, so the extra uncertainty of estimating it is absent. We use z* = 1.96 (95% CI), not a t-value.

Variant C (researcher, n = 45, σ unknown): Use z (or t — both acceptable). With n ≥ 30 and σ unknown, the CLT justifies z, but t with df = 44 is equally valid (they converge). In practice, z is most commonly reported here.

Variant D (pharmacist, n = 8, σ unknown, population normal): Use t. Small n, σ unknown. The normality assumption for the population makes the t-interval valid even at n = 8.

Variant E (historian, n = 12, σ unknown): Use t. df = 11. Correct interpretation: "If we repeated this sampling procedure many times, about 90% of the resulting confidence intervals would capture the true mean — this one may or may not."


GP3 — z vs. t Width Comparison

Data: \(\bar{x} = 42.6\), \(s = 5.8\), \(n = 16\), 95% CI.

(a) The t-interval is wider (width 6.180 vs. 5.684).

(b) The t-interval is wider because \(t^* = 2.131 > z^* = 1.96\) — heavier tails of the t-distribution require pulling the critical values farther out to capture the same probability.

(c) The researcher should report the t-interval. Using z* with unknown σ underestimates uncertainty and produces false precision. The t-interval is honest about what the data actually tell us.


GP4 — Generated Problems

Solutions for GP4 are embedded in the generator output. The key steps are always:

  1. Compute \(df = n - 1\).
  2. Use the t* provided in the problem (it comes from the T_CRIT table for that df and confidence level).
  3. Compute \(SE = s/\sqrt{n}\), then \(E = t^* \times SE\).
  4. State the interval: \(\bar{x} \pm E\).

Section 6 — Independent Practice Solutions

IP1 — Full t-Interval Construction (Variants A–E)

For each variant, always check conditions first: (1) population approximately normal or n large enough, (2) σ unknown, (3) independent random sample.

Variant A (ER wait times, \(\bar{x} = 47.3\) min, \(s = 12.4\) min, \(n = 18\), 95% CI):

Variant B (calcium content, \(\bar{x} = 84.6\) mg/L, \(s = 9.3\) mg/L, \(n = 11\), 90% CI):

Variant C (sleep hours, \(\bar{x} = 6.8\) hrs, \(s = 1.1\) hrs, \(n = 20\), 99% CI):

Variant D (weight loss, \(\bar{x} = 3.7\) kg, \(s = 1.6\) kg, \(n = 25\), 95% CI):

Variant E (test scores, \(\bar{x} = 68.4\), \(s = 8.9\), \(n = 14\), 90% CI):


IP2 & IP3 — Generated Problems

Solutions for IP2 and IP3 are embedded in each generator's output. For IP3: if σ is known, use z regardless of n; if σ is unknown and n < 30, use t with df = n − 1.


IP4 — Find the Error (Variants A–E)

Variant A (z* used with n = 12 and unknown σ): Error — when σ is unknown and n < 30, the correct distribution is t, not z. Using z* = 1.96 ignores the extra variability from estimating σ with s, producing an interval that is too narrow.

Variant B (df = n = 18 instead of df = 17): Error — degrees of freedom should be \(df = n - 1 = 17\), not n = 18. Using df = 18 yields a t* that is slightly too small, producing an overconfident (too-narrow) interval.

Variant C (P(\mu \in [L, U]) = 0.95): Error — probability trap. Once the interval is computed, μ is a fixed constant; the probability is either 0 or 1. The 95% refers to the long-run frequency of the procedure — about 95% of all such intervals built from repeated samples would capture the true μ.

Variant D ("the t-interval is wrong because it's wider"): Error — the t-interval is supposed to be wider. The t-distribution has heavier tails than the normal precisely because we're estimating σ with s. The extra width is honest: it correctly accounts for the additional uncertainty. A narrower interval here would be overconfident.

Variant E (t-interval used with n = 50 and σ known): Error — when σ is known, always use z regardless of sample size. The t-distribution is only needed when σ must be estimated from the data. Using t when σ is known is conservative but technically incorrect.


IP5 — Synthesis: Lead Concentration

Data: \(n = 22\) water samples, \(\bar{x} = 8.4\) ppb, \(s = 3.1\) ppb, \(\sigma\) unknown.

Part (a) — 95% t-interval:

Part (b) — z-based comparison (incorrect):

Part (c) — Why the distinction matters: With n = 22, t* = 2.080 while z* = 1.96 — a 6% difference in the critical value. For smaller samples this gap grows dramatically (e.g., at n = 6, t* ≈ 2.571 vs. z* = 1.96 for 95% CI — a 31% difference). Reporting the z-interval here understates uncertainty by assuming σ is known precisely. If this data were being used to compare lead levels against a safety threshold, using the incorrectly narrow z-interval could lead to a false conclusion that concentrations are safe when the honest t-interval suggests otherwise.

Section 7 — Mastery Check Solutions

Feynman Test — Explain t vs. z + Why df = n − 1

Model answer: The z-distribution assumes we know the population standard deviation σ exactly. In most real studies, σ is unknown and we estimate it from the sample using s. This introduces a second source of variability: not only does x̄ vary from sample to sample, but so does s. The t-distribution accounts for this by having heavier tails — giving wider critical values — which translates into wider (more honest) confidence intervals.

We use df = n − 1 rather than n because once we've computed the sample mean x̄, the n deviations \(x_i - \bar{x}\) are constrained: they must sum to zero. This means only n − 1 of them can vary freely. One degree of freedom is "spent" computing x̄, leaving n − 1 independent pieces of information about the spread. As df → ∞, this constraint becomes negligible and the t-distribution converges to the standard normal.


Apply Question — Energy Bars

Data: \(n = 9\), \(\bar{x} = 3.2\) g protein, \(s = 0.6\) g, 99% CI, \(\sigma\) unknown.

Interpretation: We are 99% confident the true mean protein content is between 2.529 g and 3.871 g per bar.


Error Analysis — Wrong df

The researcher used df = 15 but the sample has n = 15, so the correct df is 14, not 15.

With df = 14 and 90% CI: \(t^* = 1.761\) (vs. 1.753 they used with df = 15).

Section 8 — Boss Fight Solutions

Path A — The Analyst: Lake Champlain pH Study

Data: \(\bar{x} = 4.79\), \(s = 0.33\), \(n = 15\), 95% CI, \(\sigma\) unknown.

Step 1 — Check conditions:

Step 2 — Compute the CI:

Step 3 — Does the interval suggest pH < 4.5? No. The entire interval (4.607, 4.973) lies above 4.5. If the true mean were below 4.5, we'd expect our sample mean to fall lower than 4.79 and the CI to include values below 4.5. This interval gives no evidence that the lake pH is critically low.

Step 4 — Interpretation: "We are 95% confident the true mean pH of Lake Champlain at this location is between 4.607 and 4.973. The interval lies entirely above the acidic threshold of 4.5, suggesting the lake is not in the critical range — though the lower bound of 4.61 is still quite acidic relative to neutral pH 7."


Path B — The Architect: Critique and Redesign

The junior researcher used z* = 1.96 with n = 20 and unknown σ.

Error identification: When σ is unknown and n < 30, the t-distribution must be used. Using z* = 1.96 ignores the extra uncertainty from estimating σ with s, producing an interval that is too narrow (overconfident).

Corrected CI: Data: \(\bar{x} = 9.3\) days, \(s = 3.8\) days, \(n = 20\), 90% CI.

Comparison: The researcher's z-based CI would be: \(E_z = 1.645 \times 0.850 = 1.399\), giving (7.901, 10.699). The corrected t-interval is wider by 0.071 days on each side (making the entire interval 0.142 days wider in total width) — a meaningful difference when reporting clinical recovery times. The z-interval gives a false sense of precision not warranted by these data.

Why it matters here: For medical recovery data, understating uncertainty could affect treatment protocol decisions. The additional width of the t-interval honestly reflects that with only 20 patients and no known σ, there is more uncertainty than the z-interval admits.

Section 9 — Challenge Problems Solutions

C1 — Why df = n − 1 (Intuitive Derivation)

Suppose you have n = 4 data values and you've computed \(\bar{x} = 10\). The deviations from the mean must sum to zero: \((x_1 - \bar{x}) + (x_2 - \bar{x}) + (x_3 - \bar{x}) + (x_4 - \bar{x}) = 0\).

If you know x₁ = 8, x₂ = 11, x₃ = 13 and the mean is 10, then x₄ is forced: x₄ = 4×10 − 8 − 11 − 13 = 8. You have no freedom to choose x₄.

In general: once you fix x̄ and the first n − 1 values, the last value is completely determined. Only n − 1 of the n deviations are free to vary independently. Therefore the sample variance s² — which is built from these deviations — has only n − 1 degrees of freedom.

When we plug s into the CI formula to substitute for σ, we're using a quantity with n − 1 independent pieces of information. The t-distribution with df = n − 1 correctly accounts for this reduced independence.


C2 — t-Table Convergence

t* values at 95% confidence:

dft* (95%)Difference from z* = 1.96
112.706+10.746
52.571+0.611
102.228+0.268
202.086+0.126
292.045+0.085
1.9600.000

Key observation: Even at df = 29 (n = 30), t* = 2.045 remains meaningfully larger than z* = 1.96 — a 4.3% difference. At df = 5, the gap is 31%. The convergence is slow, especially in the lower tail region that matters for critical values. This is why the t vs. z distinction remains practically important even at moderate sample sizes, not just at n < 10.

At what df does t* first drop below 2.05? From our table: df = 25 gives t* = 2.060, df = 29 gives t* = 2.045 — so somewhere around df = 28–29. Full t-tables show the crossover below 2.05 occurs at approximately df = 26 (t* = 2.056). The precise value where t* = z* = 1.960 is only reached at df = ∞.


C3 — Overlapping CIs Preview (Variants A–E)

This challenge previews INF-6 (two-sample inference). The key intuition: non-overlapping CIs are strong evidence that the two population means differ; overlapping CIs suggest the means may be similar, but overlap alone does not confirm equivalence. Formal tests (two-sample t) are needed for definitive conclusions.

For each variant:

This is a conceptual preview — no specific arithmetic is required for these variants. Focus on the logic: confidence intervals describe plausible values for each mean; comparison requires reasoning about whether those plausible ranges share common ground.