4.2 t-tests
4.2.1 Independent samples t-test
Say we want to test whether men and women differ in the degree to which they are dominant. Let’s create a boxplot first and then check the means and the standard deviations:
ggplot(data = powercc, mapping = aes(x = gender, y = dominance)) +
geom_boxplot()
%>%
powercc group_by(gender) %>%
summarize(mean_dominance = mean(dominance),
sd_dominance = sd(dominance))
## # A tibble: 2 x 3
## gender mean_dominance sd_dominance
## <chr> <dbl> <dbl>
## 1 female 0.614 0.247
## 2 male 0.646 0.296
Men score slightly higher than women, but we want to know whether this difference is significant. An independent samples t-test can provide the answer (the men and the women in our experiment are the independent samples), but we need to check an assumption first: are the variances of the two independent samples equal?
install.packages("car") # for the test of equal variances, we need a package called car
library(car)
# Levene's test of equal variances.
# Low p-value means the variances are not equal.
# First argument = continuous dependent variable, second argument = categorical independent variable.
leveneTest(powercc$dominance, powercc$gender)
## Levene's Test for Homogeneity of Variance (center = median)
## Df F value Pr(>F)
## group 1 2.1915 0.141
## 141
The null hypothesis of equal variances is not rejected (p = 0.14), so we can continue with a t-test that assumes equal variances:
# Test whether the means of dominance differ between genders.
# Indicate whether the test should assume equal variances or not (set var.equal = FALSE for a test that does not assume equal variances).
t.test(powercc$dominance ~ powercc$gender, var.equal = TRUE)
##
## Two Sample t-test
##
## data: powercc$dominance by powercc$gender
## t = -0.6899, df = 141, p-value = 0.4914
## alternative hypothesis: true difference in means between group female and group male is not equal to 0
## 95 percent confidence interval:
## -0.12179092 0.05877722
## sample estimates:
## mean in group female mean in group male
## 0.6142857 0.6457926
You could report this as follows: “Men (M = 0.65, SD = 0.3) and women (M = 0.61, SD = 0.25) did not differ in the degree to which they rated themselves as dominant (t(141) = -0.69, p = 0.49).”
4.2.2 Dependent samples t-test
Say we want to test whether people are more willing to spend on conspicuous items than on inconspicuous items. Let’s check the means and the standard deviations first:
%>% # no need to group! we're not splitting up our sample into subgroups
powercc summarize(mean_cc = mean(cc), sd_cc = sd(cc),
mean_icc = mean(icc), sd_icc = sd(icc))
## # A tibble: 1 x 4
## mean_cc sd_cc mean_icc sd_icc
## <dbl> <dbl> <dbl> <dbl>
## 1 6.01 1.05 3.60 0.988
The means are higher for conspicuous products than for inconspicuous products, but we want to know whether this difference is significant and therefore perform a dependent samples t-test (each participant rates both conspicuous and inconspicuous products, so these ratings are dependent):
t.test(powercc$cc, powercc$icc, paired = TRUE) # Test whether the means of cc and icc are different. Indicate that this is a dependent samples t-test with paired = TRUE.
##
## Paired t-test
##
## data: powercc$cc and powercc$icc
## t = 25.064, df = 142, p-value < 2.2e-16
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 2.214575 2.593816
## sample estimates:
## mean of the differences
## 2.404196
You could report this as follows: “People indicated they were willing to pay more (t(142) = 25.064, p < .001) for conspicuous products (M = 6.01, SD = 1.05) than for inconspicuous products (M = 3.6, SD = 0.99).”
4.2.3 One sample t-test
Say we want to test whether the average willingness to pay for the conspicuous items was significantly higher than 5 (the midpoint of the scale):
t.test(powercc$cc, mu = 5) # Indicate the variable whose mean we want to compare with a specific value (5).
##
## One Sample t-test
##
## data: powercc$cc
## t = 11.499, df = 142, p-value < 2.2e-16
## alternative hypothesis: true mean is not equal to 5
## 95 percent confidence interval:
## 5.833886 6.180100
## sample estimates:
## mean of x
## 6.006993
It’s indeed significantly higher than 5. You could report this as follows: “The average WTP for conspicuous products (M = 6.01, SD = 1.05) was significantly above 5 (t(142) = 11.499, p < .001).”