dunn.test {dunn.test} | R Documentation |
Dunn's Test
Description
Performs Dunn's test of multiple comparisons using rank sums
Usage
dunn.test (x, g=NA, method=p.adjustment.methods, kw=TRUE, label=TRUE,
wrap=FALSE, table=TRUE, list=FALSE, rmc=FALSE, alpha=0.05, altp=FALSE)
p.adjustment.methods
# c("none", "bonferroni", "sidak", "holm", "hs", "hochberg", "bh", "by")
Arguments
x |
a numeric vector, or a list of numeric vectors. Missing values are ignored. If the former, then groups must be specified using |
g |
a factor variable, numeric vector, or character vector indicating group. Missing values are ignored. |
method |
adjusts the p-value for multiple comparisons using the Bonferroni, Šidák, Holm, Holm-Šidák, Hochberg, Benjamini-Hochberg, or Benjamini-Yekutieli adjustment (see Details). The default is no adjustment for multiple comparisons. |
kw |
if TRUE then the results of the Kruskal-Wallis test are reported. |
label |
if TRUE then the factor labels are used in the output table. |
wrap |
does not break up tables to maintain nicely formatted output. If FALSE then output of large tables is broken up across multiple pages. |
table |
outputs results of Dunn's test in a table format, as qualified by the |
list |
outputs results of Dunn's test in a list format. |
rmc |
if TRUE then the reported test statistics and table are based on row minus column, rather than the default column minus row (i.e. the signs of the test statistic are flipped). |
alpha |
the nominal level of significance used in the step-up/step-down multiple comparisons procedures (Holm, Holm-Šidák, Hochberg, Benjamini-Hochberg, and Benjamini-Yekutieli). |
altp |
if TRUE then express p-values in alternative format. The default is to express p-value = P(Z |
Details
dunn.test
computes Dunn's test (1964) for stochastic dominance and reports the results among multiple pairwise comparisons after a Kruskal-Wallis test for stochastic dominance among k groups (Kruskal and Wallis, 1952). The interpretation of stochastic dominance requires an assumption that the CDF of one group does not cross the CDF of the other. dunn.test
makes m = k(k-1)/2 multiple pairwise comparisons based on Dunn's z-test-statistic approximations to the actual rank statistics. The null hypothesis for each pairwise comparison is that the probability of observing a randomly selected value from the first group that is larger than a randomly selected value from the second group equals one half; this null hypothesis corresponds to that of the Wilcoxon-Mann-Whitney rank-sum test. Like the rank-sum test, if the data can be assumed to be continuous, and the distributions are assumed identical except for a difference in location, Dunn's test may be understood as a test for median difference. dunn.test
accounts for tied ranks.
dunn.test
outputs both z-test-statistics for each pairwise comparison and the p-value = P(Z \ge
|z|) for each. Reject Ho based on p \le
\alpha
/2 (and in combination with p-value ordering for stepwise method
options). If you prefer to work with p-values expressed as p-value = P(|Z| \ge
|z|) use the altp=TRUE
option, and reject Ho based on p \le
\alpha
(and in combination with p-value ordering for stepwise method
options). These are exactly equivalent rejection decisions.
Several options are available to adjust p-values for multiple comparisons, including methods to control the family-wise error rate (FWER) and methods to control the false discovery rate (FDR):
none | no adjustment is made. Those comparisons rejected without adjustment at the \alpha |
level (two-sided test) are starred in the output table, and starred in the list when | |
using the list=TRUE option. |
|
bonferroni | the FWER is controlled using Dunn's (1961) Bonferroni adjustment, and adjusted |
p-values = max(1, pm). Those comparisons rejected with the Bonferroni | |
adjustment at the \alpha level (two-sided test) are starred in the output table, and |
|
starred in the list when using the list=TRUE option. |
|
sidak | the FWER is controlled using Šidák's (1967) adjustment, and adjusted |
p-values = max(1, 1 - (1 - p)^m). Those comparisons rejected with the Šidák | |
adjustment at the \alpha level (two-sided test) are starred in the output table, and |
|
starred in the list when using the list=TRUE option. |
|
holm | the FWER controlled using Holm's (1979) progressive step-up procedure to relax |
control on subsequent tests. p values are ordered from smallest to largest, and | |
adjusted p-values = max[1, p(m+1-i)], where i indexes the ordering. All tests after | |
and including the first test to not be rejected are also not rejected. | |
hs | the FWER is controlled using the Holm-Šidák adjustment (Holm, 1979): another |
progressive step-up procedure but assuming dependence between tests. p values | |
are ordered from smallest to largest, and adjusted | |
p-values = max[1, 1 - (1 - p)^(m+1-i)], where i indexes the ordering. All tests after | |
and including the first test to not be rejected are also not rejected. | |
hochberg | the FWER is controlled using Hochberg's (1988) progressive step-down |
procedure to increase control on successive tests. p values are ordered from largest– | |
smallest, and adjusted p-values = max[1, p*i], where i indexes the ordering. All | |
tests after and including the first to be rejected are also rejected. | |
bh | the FDR is controlled using the Benjamini-Hochberg adjustment (1995), a step- |
down procedure appropriate to independent tests or tests that are positively | |
dependent. p-values are ordered from largest to smallest, and adjusted | |
p-values = max[1, pm/(m+1-i)], where i indexes the ordering. All tests after and | |
including the first to be rejected are also rejected. | |
by | the FDR is controlled using the Benjamini-Yekutieli adjustment (2011), a step- |
down procedure appropriate to dependent tests. p-values are ordered from largest to | |
smallest, and adjusted p-values = max[1, pmC/(m+1-i)], where i indexes the | |
ordering, and the constant C = 1 + 1/2 + . . . + 1/m. All tests after and including the | |
first to be rejected are also rejected. | |
Because the sequential step-up/step-down tests rejection decisions depend on both the p-values and their ordering, those tests rejected using "holm"
, "hs"
, "hochberg"
, "bh"
, or "by"
at the indicated \alpha
level are starred in the output table, and starred in the list when using the list=TRUE
option.
Value
dunn.test
returns:
chi2 |
a scalar of the Kruskal-Wallis test statistic adjusted for ties. |
Z |
a vector of all m of Dunn z test statistics. |
P |
a vector of p-values corresponding to |
altP |
a vector of p-values corresponding to |
P.adjust |
a vector of p-values corresponding to |
altP.adjust |
a vector of p-values corresponding to |
comparisons |
a vector of strings labeling each pairwise comparison, as qualified by the |
Author(s)
Alexis Dinno (alexis.dinno@pdx.edu)
Please contact me with any questions, bug reports or suggestions for improvement. Fixing bugs will be facilitated by sending along:
[1] | a copy of the data (de-labeled or anonymized is fine), |
[2] | a copy of the command syntax used, and |
[3] | a copy of the exact output of the command. |
References
Benjamini, Y. and Hochberg, Y. (1995) Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological). 57, 289–300. <doi:10.1111/j.2517-6161.1995.tb02031.x>.
Benjamini, Y. and Yekutieli, D. (2001) The control of the false discovery rate in multiple testing under dependency. Annals of Statistics. 29, 1165–1188. <doi:10.1214/aos/1013699998>.
Dunn, O. J. (1961) Multiple comparisons among means. Journal of the American Statistical Association. 56, 52–64. <doi:10.1080/01621459.1961.10482090>.
Dunn, O. J. (1964) Multiple comparisons using rank sums. Technometrics. 6, 241–252. <doi:10.1080/00401706.1964.10490181>.
Hochberg, Y. (1988) A sharper Bonferroni procedure for multiple tests of significance. Biometrika. 75, 800–802. <doi:10.1093/biomet/75.4.800>.
Holm, S. (1979) A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics. 6, 65–70.
Kruskal, W. H. and Wallis, A. (1952) Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association. 47, 583–621. <doi:10.1080/01621459.1952.10483441>.
Šidák, Z. (1967) Rectangular confidence regions for the means of multivariate normal distributions. Journal of the American Statistical Association. 62, 626–633. <doi:10.1080/01621459.1967.10482935>.
Examples
## Example cribbed and modified from the kruskal.test documentation
## Hollander & Wolfe (1973), 116.
## Mucociliary efficiency from the rate of removal of dust in normal
## subjects, subjects with obstructive airway disease, and subjects
## with asbestosis.
x <- c(2.9, 3.0, 2.5, 2.6, 3.2) # normal subjects
y <- c(3.8, 2.7, 4.0, 2.4) # with obstructive airway disease
z <- c(2.8, 3.4, 3.7, 2.2, 2.0) # with asbestosis
dunn.test(x=list(x,y,z))
x <- c(x, y, z)
g <- factor(rep(1:3, c(5, 4, 5)),
labels = c("Normal",
"COPD",
"Asbestosis"))
dunn.test(x, g)
## Example based on home care data from Dunn (1964)
data(homecare)
attach(homecare)
dunn.test(occupation, eligibility, method="hs", list=TRUE)
## Air quality data set illustrates differences in different
## multiple comparisons adjustments
attach(airquality)
dunn.test(Ozone, Month, kw=FALSE, method="bonferroni")
dunn.test(Ozone, Month, kw=FALSE, method="hs")
dunn.test(Ozone, Month, kw=FALSE, method="bh")
detach(airquality)