D.25 Answers: Tests for one mean
Answers to exercises in Sect. 27.13.
Answer to Exercise 27.1:
1. \(H_0\): \(\mu=7725\); \(H_1\): \(\mu\ne7725\) (two tailed).
2. \(\bar{x} = 6753.64\) and \(\text{s.e.}(\bar{x}) = s/\sqrt{n} = 1142.123/\sqrt{11} = 344.363\).
3. \(t = (6753.64 - 7725)/344.363 = -2.821\), as in output.
This ‘large’; expect small \(P\)-value; software confirms this: two-tailed \(P=0.018\).
4. Moderate evidence (\(P = 0.018\)) that the mean energy intake
is not meeting the recommended daily energy intake
(mean: 6753.6kJ; std. dev.: 1142.1kJ).
Answer to Exercise 27.2:
\(H_0\): \(\mu=120\) and \(H_1\): \(\mu\ne 120\) (two-tailed), where \(\mu\) is the mean time in seconds.
Standard error: \(\text{s.e.}(\bar{x}) = 23.8/\sqrt{85} = 2.581472\).
\(t\)-score: \((60.3 - 120)/2.581472 = -23.13\),
which is huge; \(P\)-value will be really small.
Very strong evidence (\(P<0.001\)) that children do not spend 2 minutes (on average)
brushing their teeth (mean: 60.3s; std. dev.: 23.8s).
Answer to Exercise 27.3:
\(H_0\): \(\mu=50\) and \(H_1\): \(\mu>50\) (one-tailed), where \(\mu\) is the mean mental demand.
Standard error: \(\text{s.e.}(\bar{x}) = 22.05/\sqrt{22} = 4.701076\).
\(t\)-score: \((84 - 50)/4.701076 = 7.23\),
which is very large; \(P\)-value will be very small.
Very strong evidence (\(P<0.001\)) that the mean mental demand is greater than 50.
(Notice we say greater than, because of the RQ and the alternative hypothesis.)
Answer to Exercise 27.4:
Physical:* \(t = -1.28\);
Mental:* \(t = 1.80\).
The \(P\)-values both larger than 5%.
No evidence that the mean score for patients is different than the general population score.
Answer to Exercise 27.5:
\(H_0\): \(\mu=12\) and \(H_1\): \(\mu\ne 12\) (two-tailed), where \(\mu\) is the mean weight in grams.
Standard error: \(\text{s.e.}(\bar{x}) = 0.60652/\sqrt{43} = 0.09249343\).
\(t\)-score: \((14.9577 - 12)/0.09249343 = 31.98\),
which is huge; \(P\)-value will be very small.
Very strong evidence (\(P<0.001\)) that the mean weight of a Fun Size Cherry Ripe bar
is not 12 grams (mean: 14.9577; std. dev.: 0.067g),
and they may be larger.
Answer to Exercise 27.6:
\(H_0\): \(\mu=1000\) and \(H_1\): \(\mu\ne1000\),
where \(\mu\) is the population mean guess of the spill volume.
Standard error: 46.15526.
\(t\)-score: \((846.4 - 1000)/46.15526 = -3.33\),
which is very large (and negative),
so the \(P\)-value will be very small.
Very strong evidence that the mean guess
of blood volume is not 1000,ml, the actual value.
The sample is much larger than 25:
the test is statistically valid.
Answer to Exercise 27.7: Hypotheses have the form \(H_0\): \(\mu=\text{pre-determined target}\), and \(H_1\): \(\mu\ne\text{pre-determined target}\). \(t\)-scores: \(t_1 = 0.318\), \(t_2 = 2.347\), \(t_3 = -0.466\), \(t_4 = -0.726\). \(P\)-values will be large, except for second test. No evidence that the instruments are dodgy, except perhaps for the first instrument for mid-level LH concentrations. Should be statistically valid.
While assessing the means is useful, how variable the measurements are is also useful (but beyond us).