aov.w {misty} | R Documentation |
Repeated Measures Analysis of Variance (Within-Subject ANOVA)
Description
This function performs an one-way repeated measures analysis of variance (within subject ANOVA) including paired-samples t-tests for multiple comparison and provides descriptive statistics, effect size measures, and a plot showing error bars for difference-adjusted Cousineau-Morey within-subject confidence intervals with jittered data points including subject-specific lines.
Usage
aov.w(formula, data, print = c("all", "none", "LB", "GG", "HF"),
posthoc = FALSE, conf.level = 0.95,
p.adj = c("none", "bonferroni", "holm", "hochberg", "hommel", "BH", "BY", "fdr"),
hypo = TRUE, descript = TRUE, epsilon = TRUE, effsize = FALSE,
na.omit = TRUE, plot = FALSE, point.size = 4, adjust = TRUE,
error.width = 0.1, xlab = NULL, ylab = NULL, ylim = NULL,
breaks = ggplot2::waiver(), jitter = TRUE, line = TRUE,
jitter.size = 1.25, jitter.width = 0.05, jitter.height = 0,
jitter.alpha = 0.1, title = "", subtitle = "Confidence Interval",
digits = 2, p.digits = 4, as.na = NULL, write = NULL, append = TRUE,
check = TRUE, output = TRUE, ...)
Arguments
formula |
a formula of the form |
data |
a matrix or data frame containing the variables in the
formula |
print |
a character vector indicating which sphericity correction
to use, i.e., |
posthoc |
logical: if |
conf.level |
a numeric value between 0 and 1 indicating the confidence level of the interval. |
p.adj |
a character string indicating an adjustment method for
multiple testing based on |
hypo |
logical: if |
descript |
logical: if |
epsilon |
logical: if |
effsize |
logical: if |
na.omit |
logical: if |
plot |
logical: if |
point.size |
a numeric value indicating the |
adjust |
logical: if |
error.width |
a numeric value indicating the horizontal bar width of the error bar. |
xlab |
a character string specifying the labels for the x-axis. |
ylab |
a character string specifying the labels for the y-axis. |
ylim |
a numeric vector of length two specifying limits of the limits of the y-axis. |
breaks |
a numeric vector specifying the points at which tick-marks are drawn at the y-axis. |
jitter |
logical: if |
line |
logical: if |
jitter.size |
a numeric value indicating the |
jitter.width |
a numeric value indicating the amount of horizontal jitter. |
jitter.height |
a numeric value indicating the amount of vertical jitter. |
jitter.alpha |
a numeric value indicating the opacity of the jittered data points. |
title |
a character string specifying the text for the title for the plot. |
subtitle |
a character string specifying the text for the subtitle for the plot. |
digits |
an integer value indicating the number of decimal places to be used for displaying descriptive statistics and confidence interval. |
p.digits |
an integer value indicating the number of decimal places to be used for displaying the p-value. |
as.na |
a numeric vector indicating user-defined missing values,
i.e. these values are converted to |
write |
a character string naming a text file with file extension
|
append |
logical: if |
check |
logical: if |
output |
logical: if |
... |
further arguments to be passed to or from methods. |
Details
- Sphericity
The F-Test of the repeated measures ANOVA is based on the assumption of sphericity, which is defined as the assumption that the variance of differences between repeated measures are equal in the population. The Mauchly's test is commonly used to test this hypothesis. However, test of assumptions addresses an irrelevant hypothesis because what matters is the degree of violation rather than its presence (Baguley, 2012a). Moreover, the test is not recommended because it lacks statistical power (Abdi, 2010). Instead, the Box index of sphericity (
\varepsilon
) should be used to assess the degree of violation of the sphericity assumption. The\varepsilon
parameter indicates the degree to which the population departs from sphericity with\varepsilon = 1
indicating that sphericity holds. As the departure becomes more extreme,\varepsilon
approaches its lower bound\hat{\varepsilon}_{lb}
:\hat{\varepsilon}_{lb} = \frac{1}{J - 1}
where
J
is the number of levels of the within-subject factor. Box (1954a, 1954b) suggested a measure for sphericity, which applies to a population covariance matrix. Greenhouse and Geisser (1959) proposed an estimate for\varepsilon
known as\hat{\varepsilon}_{gg}
that can be computed from the sample covariance matrix, whereas Huynh and Feldt (1976) proposed an alternative estimate\hat{\varepsilon}_{hf}
. These estimates can be used to correct the effect and error df of the F-test. Simulation studies showed that\hat{\varepsilon}_{gg} \leq \hat{\varepsilon}_{hf}
and that\hat{\varepsilon}_{gg}
tends to be conservative underestimating\varepsilon
, whereas\hat{\varepsilon}_{hf}
tends to be liberal overestimating\varepsilon
and occasionally exceeding one. Baguley (2012a) recommended to compute the average of the conservative estimate\hat{\varepsilon}_{gg}
and the liberal estimate\hat{\varepsilon}_{hf}
to assess the sphericity assumption. By default, the function prints results depending on the average\hat{\varepsilon}_{gg}
and\hat{\varepsilon}_{hf}
:If the average is less than 0.75 results of the F-Test based on Greenhouse-Geiser correction factor (
\hat{\varepsilon}_{gg}
) is printed.If the average is less greater or equal 0.75, but less than 0.95 results of the F-Test based on Huynh-Feldt correction factor (
\hat{\varepsilon}_{hf}
) is printed.If the average is greater or equal 0.95 results of the F-Test without any corrections are printed.
- Missing Data
The function uses listwise deletion by default to deal with missing data. However, the function also allows to use all available observations by conducting the repeated measures ANOVA in long data format when specifying
na.omit = FALSE
. Note that in the presence of missing data, the F-Test without any sphericity corrections may be reliable, but it is not clear whether results based on Greenhouse-Geiser or Huynh-Feldt correction are trustworthy given that pairwise deletion is used for estimating the variance-covariance matrix when computing\hat{\varepsilon}_{gg}
and the total number of subjects regardless of missing values (i.e., complete and incomplete cases) are used for computing\hat{\varepsilon}_{hf}
.- Within-Subject Confidence Intervals
The function provides a plot showing error bars for difference-adjusted Cousineau-Morey confidence intervals (Baguley, 2012b). The intervals matches that of a CI for a difference, i.e., non-overlapping CIs corresponds to an inferences of no statistically significant difference. The Cousineau-Morey confidence intervals without adjustment can be used by specifying
adjust = FALSE
.
Value
Returns an object of class misty.object
, which is a list with following
entries:
call |
function call |
type |
type of analysis |
data |
list with the data ( |
formula |
formula of the current analysis |
plot |
ggplot2 object for plotting the results |
args |
specification of function arguments |
result |
list with result tables, i.e., |
Author(s)
Takuya Yanagida takuya.yanagida@univie.ac.at
References
Abdi, H. (2010). The Greenhouse-Geisser correction. In N. J. Salkind (Ed.) Encyclopedia of Research Design (pp. 630-634), Sage. https://dx.doi.org/10.4135/9781412961288
Baguley, T. S. (2012a). Serious stats: A guide to advanced statistics for the behavioral sciences. Palgrave Macmillan.
Baguley, T. (2012b). Calculating and graphing within-subject confidence intervals for ANOVA. Behavior Research Methods, 44, 158-175. https://doi.org/10.3758/s13428-011-0123-7
Bakerman, R. (2005). Recommended effect size statistics for repeated measures designs. Behavior Research Methods, 37, 179-384. https://doi.org/10.3758/BF03192707
Box, G. E. P. (1954a) Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effects of Inequality of Variance in the One-way Classification. Annals of Mathematical Statistics, 25, 290–302.
Box, G. E. P. (1954b) Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, II. Effects of Inequality of Variance and of Correlation between Errors in the Two-way Classification. Annals of Mathematical Statistics, 25, 484–98.
Greenhouse, S. W., and Geisser, S. (1959). On methods in the analysis of profile data.Psychometrika, 24, 95-112. https://doi.org/10.1007/BF02289823
Huynh, H., and Feldt, L. S. (1976). Estimation of the box correction for degrees of freedom from sample data in randomized block and splitplot designs. Journal of Educational Statistics, 1, 69-82. https://doi.org/10.2307/1164736
Olejnik, S., & Algina, J. (2000). Measures of effect size for comparative studies: Applications, interpretations, and limitations. Contemporary Educational Psychology, 25, 241-286. https://doi.org/10.1006/ceps.2000.1040
Rasch, D., Kubinger, K. D., & Yanagida, T. (2011). Statistics in psychology - Using R and SPSS. John Wiley & Sons.
See Also
aov.b
, test.t
, test.z
,
cohens.d
, ci.mean.diff
, ci.mean
Examples
dat <- data.frame(time1 = c(3, 2, 1, 4, 5, 2, 3, 5, 6, 7),
time2 = c(4, 3, 6, 5, 8, 6, 7, 3, 4, 5),
time3 = c(1, 2, 2, 3, 6, 5, 1, 2, 4, 6))
# Example 1: Repeated measures ANOVA
aov.w(cbind(time1, time2, time3) ~ 1, data = dat)
# Example 2: Repeated measures ANOVA
# print results based on all sphericity corrections
aov.w(cbind(time1, time2, time3) ~ 1, data = dat, print = "all")
# Example 3: Repeated measures ANOVA
# print effect size measures
aov.w(cbind(time1, time2, time3) ~ 1, data = dat, effsize = TRUE)
# Example 4: Repeated measures ANOVA
# do not print hypotheses and descriptive statistics,
aov.w(cbind(time1, time2, time3) ~ 1, data = dat, descript = FALSE, hypo = FALSE)
## Not run:
# Example 5: Write results into a text file
aov.w(cbind(time1, time2, time3) ~ 1, data = dat, write = "RM-ANOVA.txt")
# Example 6: Repeated measures ANOVA
# plot results
aov.w(cbind(time1, time2, time3) ~ 1, data = dat, plot = TRUE)
# Load ggplot2 package
library(ggplot2)
# Save plot, ggsave() from the ggplot2 package
ggsave("Repeated_measures_ANOVA.png", dpi = 600, width = 4.5, height = 4)
# Example 7: Repeated measures ANOVA
# extract plot
p <- aov.w(cbind(time1, time2, time3) ~ 1, data = dat, output = FALSE)$plot
p
# Extract data
plotdat <- aov.w(cbind(time1, time2, time3) ~ 1, data = dat, output = FALSE)$data
# Draw plot in line with the default setting of aov.w()
ggplot(plotdat$long, aes(time, y, group = 1L)) +
geom_point(aes(time, y, group = id),
alpha = 0.1, position = position_dodge(0.05)) +
geom_line(aes(time, y, group = id),
alpha = 0.1, position = position_dodge(0.05)) +
geom_point(data = plotdat$ci, aes(variable, m), stat = "identity", size = 4) +
stat_summary(aes(time, y), fun = mean, geom = "line") +
geom_errorbar(data = plotdat$ci, aes(variable, m, ymin = low, ymax = upp), width = 0.1) +
theme_bw() + xlab(NULL) +
labs(subtitle = "Two-Sided 95
theme(plot.subtitle = element_text(hjust = 0.5),
plot.title = element_text(hjust = 0.5))
## End(Not run)