Guided Practice 1: Computing Z-Scores (Variant Bank)
Context for all variants: \( \mu = 120 \) calls, \( \sigma = 15 \) calls. Formula: \( z = \dfrac{x - \mu}{\sigma} \).
Variant A (147 calls):
\[ z = \frac{147 - 120}{15} = \frac{27}{15} = +1.80 \]
This agent handled 1.80 standard deviations more calls than the daily mean — a notably high-volume day.
Variant B (105 calls):
\[ z = \frac{105 - 120}{15} = \frac{-15}{15} = -1.00 \]
This agent handled exactly 1 standard deviation fewer calls than the daily mean — below average but not unusual.
Variant C (120 calls):
\[ z = \frac{120 - 120}{15} = \frac{0}{15} = 0.00 \]
This agent handled exactly the mean number of calls — right at average.
Variant D (93 calls):
\[ z = \frac{93 - 120}{15} = \frac{-27}{15} = -1.80 \]
This agent handled 1.80 standard deviations fewer calls than the mean — a notably low-volume day, symmetrically opposite to Variant A.
Variant E (158 calls):
\[ z = \frac{158 - 120}{15} = \frac{38}{15} \approx +2.53 \]
This agent handled 2.53 standard deviations more calls than the mean — an unusually high-volume day that would be flagged as exceptional.
Common mistakes: (1) Forgetting to subtract the mean before dividing — dividing the raw score by \( \sigma \) gives a meaningless ratio. (2) Dropping the sign — the sign carries essential directional information. A negative z-score means below the mean, not an error. (3) Using \( s \) for a population problem or \( \sigma \) for a sample problem.
Guided Practice 2: Interpreting Percentiles (MCQ)
Correct answer: "A student who scored 68 performed better than approximately 50% of all test-takers."
Why the other options are wrong:
- Option B ("in the top 68%"): Confuses the raw score value (68 points) with the percentile rank. This student is at \( P_{50} \), so 50% scored below them — not 68%.
- Option C ("a student at P75 scored 75 out of 100"): \( P_{75} = 79 \) means the value at the 75th percentile is 79 points. The subscript \( k \) in \( P_k \) is the percentage of data below, never the value itself.
- Option D ("the median is 79"): \( P_{50} = 68 \) is the median. The median is the 50th percentile, not the 75th.
Guided Practice 3: Empirical Rule Application (Variant Bank)
Context for all variants: body temperatures approximately normally distributed with \( \mu = 37.0 \)°C, \( \sigma = 0.4 \)°C.
Variant A — Between 36.2°C and 37.8°C:
\( 36.2 = 37.0 - 0.8 = \mu - 2\sigma \) and \( 37.8 = 37.0 + 0.8 = \mu + 2\sigma \).
By the Empirical Rule: approximately 95% of healthy adults have a temperature in this range.
Variant B — Between 36.6°C and 37.4°C:
\( 36.6 = 37.0 - 0.4 = \mu - \sigma \) and \( 37.4 = 37.0 + 0.4 = \mu + \sigma \).
By the Empirical Rule: approximately 68% of healthy adults have a temperature in this range.
Variant C — 38.2°C, how many SDs above the mean?
\[ z = \frac{38.2 - 37.0}{0.4} = \frac{1.2}{0.4} = +3.00 \]
This temperature is exactly 3 SDs above the mean. By the Empirical Rule, only approximately 0.15% of adults have a temperature this high — highly unusual, consistent with a fever.
Variant D — Percentage above 37.4°C:
\( 37.4 = \mu + \sigma \). Approximately 68% fall within 1 SD, so approximately 32% fall outside. By symmetry, approximately \( 32\%/2 = 16\% \) are above \( \mu + \sigma = 37.4 \)°C.
Variant E — Interval containing approximately 99.7% of adults:
\( \mu - 3\sigma = 37.0 - 1.2 = 35.8 \)°C and \( \mu + 3\sigma = 37.0 + 1.2 = 38.2 \)°C.
Approximately 99.7% of adults have temperatures between 35.8°C and 38.2°C.
Common mistakes: (1) Applying the Empirical Rule to a non-normal distribution. (2) Forgetting that "within 1 SD" means 68% total — some students take 68% as the one-sided area. (3) For one-sided tails: always halve the outside percentage by symmetry (32%/2 = 16%, not 32%).
Guided Practice 4: Classifying Distribution Shape (MCQ)
Correct answer: "Right-skewed; the median is a more appropriate measure of centre."
Reasoning: Mean (\$112,000) > Median (\$84,000). When the mean is pulled above the median, high-value outliers are dragging it to the right — the distribution is right-skewed. For right-skewed data, the median is resistant to extreme values and is more representative of the typical salary than the mean.
Why the other options are wrong:
- Left-skewed / mean appropriate: In left-skewed data, mean < median (the mean is pulled left by low outliers). Here mean > median, so the direction is wrong and the measure-of-centre recommendation is wrong.
- Right-skewed / mean appropriate: The direction is correct, but for right-skewed data with extreme high values, the mean overstates the typical salary. The median is the better choice.
- Symmetric: If symmetric, mean ≈ median. A difference of \$28,000 between mean and median is too large to be dismissed as rounding — this is clear right skew driven by very high executive salaries.