7 Repeated Sample t-test

In the last chapter, we assumed the two groups were independent. That is, each bag of chips was independent of the other. However, sometimes we are interested in change over time or measuring the a similar variables within individuals. While many of the steps we took in the last chapter remain the same (if you skipped it, please go back!), there are some important differences.

7.1 Creatine and Muscles

You have a theory that exercise converts adenosine triphosphate (ATP) into adenosine diphosphate (ADP). When we ingest creatine phosphate, we convert ADP back into ATP and can exercise for longer. Imagine you are interested in the impact of a new type of creatine on how much weight someone can lift. You hypothesise that individuals will lift a different amount after ingesting creatine regularly for one month. Formally, we might consider the hypothesis:

$H 0 : μ_{d} = 0$ where d represent the difference in scores within a pair $H 1 : μ_{d} \neq 0$

You decide to recruit 10 individuals from the local gym, measure how much they can bench press. Then, you give them 5g each day for one month and then re-measure their bench press strength. This pre-post design results in the following data:

kable(creatine) %>% 
  kable_material()

ID	Bench1	Bench2
1	137	147
2	180	177
3	169	187
4	155	174
5	172	186
6	170	167
7	154	175
8	144	155
9	172	199
10	160	155

The t-statistic is: $t = \frac{Δ {\overset{―}{x}}_{i}}{s e_{d i f f}}$

Our next step is to compute the difference scores. Note that higher difference scores reflect lifing more at T2, after ingesting the creatine:

kbl(creatine) %>% 
  kable_material()

ID	Bench1	Bench2	Difference
1	137	147	10
2	180	177	-3
3	169	187	18
4	155	174	19
5	172	186	14
6	170	167	-3
7	154	175	21
8	144	155	11
9	172	199	27
10	160	155	-5

Our difference scores are simply one score subtract the other:

$d i f f = Δ x_{i} = x_{i 1} - x_{i 2}$

The mean of our differences scores above is ${\overset{―}{x}}_{d i f f} =$ 10.9.

We can calculate the standard error of our differences scores in a similar way as the last chapter:

$s e_{d} = \frac{\sum (d_{i} - {\overset{―}{x}}_{d i f f})^{2} \frac{1}{N}}{\sqrt{n}}$

Calculating our squared difference scores, we get:

creatine %>% 
  select(ID, Difference, square_diff) %>% 
  kable() %>% 
  kable_styling(full_width = F)

ID	Difference	square_diff
1	10	0.81
2	-3	193.21
3	18	50.41
4	19	65.61
5	14	9.61
6	-3	193.21
7	21	102.01
8	11	0.01
9	27	259.21
10	-5	252.81

and the resulting sum of the squared differences is 1126.9. The sd of the difference scores is 11.1897771. Therefore:

$s e_{d i f f} = \frac{\sqrt{1126.9 \frac{1}{10 - 1}}}{\sqrt{10}} = 3.5385$

and:

$t = \frac{10.9}{3.5385} = 3.08$

We can check our results with a formal analysis in R. We would need to specify the argument paired = TRUE.

t.test(data=creatine_long, Weight~Time, paired=T)


    Paired t-test

data:  Weight by Time
t = -3.0804, df = 9, p-value = 0.01313
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -18.904684  -2.895316
sample estimates:
mean of the differences 
                  -10.9

Thus, we would conclude that our data regarding changes in bench press weight lifted is unlikely given a true null hypothesis, t = 3.08, p = 013.

7.2 Practice Problem

Practice Problem: You are a researchers for Clearly Contact Lenses. You are asked to determine if their contacts improve the confidence of their users. You recruit 8 glasses wearers and measure their confidence. Then, clearly provide the glasses wearers contact lenses. After one month of wearing contacts, you re-assess the individuals’ confidence rating. The following data are obtained:

ID	Glasses	Contacts
1	52	52
2	46	52
3	59	60
4	68	64
5	60	59
6	61	63
7	47	42
8	60	61

Paired sample t-tests have more statistical power when compared to independent samples t-test, given the same number of observations.

7.3 Conclusion

Test	Used for	Hypothesis	Formula
independent t-test	Testing if two group means are the same	$H 0 : μ_{1} = μ_{2}$	$t = \frac{\bar{x_{1}} - \bar{x_{2}}}{\sqrt{\frac{s_{p}^{2}}{n_{1}} + \frac{s_{p}^{2}}{n_{2}}}}$
paired t-test	Testing changes in mean score	$H 0 : μ_{d i f f} = 0$	$t = \frac{Δ x_{i}}{s e_{d i f f}}$