ad.test.combined {kSamples} | R Documentation |
Combined Anderson-Darling k-Sample Tests
Description
This function combines several independent Anderson-Darling -sample tests
into one overall test of the hypothesis that the independent samples within
each block come from a common unspecified distribution, while the common
distributions may vary from block to block. Both versions of the
Anderson-Darling test statistic are provided.
Usage
ad.test.combined(..., data = NULL,
method = c("asymptotic", "simulated", "exact"),
dist = FALSE, Nsim = 10000)
Arguments
... |
Either a sequence of several lists, say or a list of such lists, or a formula, like y ~ g | b, where y is a numeric response vector, g is a factor with levels indicating different treatments and b is a factor indicating different blocks; y, g, b are or equal length. y is split separately for each block level into separate samples according to the g levels. The same g level may occur in different blocks. The variable names may correspond to variables in an optionally supplied data frame via the data = argument, |
data |
= an optional data frame providing the variables in formula input |
method |
=
of the final distribution vector. Otherwise, it reverts to the
simulation method using the provided |
dist |
|
Nsim |
|
Details
If is the Anderson-Darling criterion for the i-th block of
samples,
its standardized test statistic is
, with
and
representing mean and standard deviation of
. This statistic
is used to test the hypothesis that the samples in the i-th block all come
from the same but unspecified continuous distribution function
.
The combined Anderson-Darling criterion is
and
is the standardized form,
where
and
represent the mean and standard deviation of
.
The statistic
is used to simultaneously
test whether the samples
in each block come from the same continuous distribution function
.
The unspecified common distribution function
may change
from block to block. According to the reference article, two versions
of the test statistic and its corresponding combinations are provided.
The for each block of
independent samples may change from block to block.
NA values are removed and the user is alerted with the total NA count. It is up to the user to judge whether the removal of NA's is appropriate.
The continuity assumption can be dispensed with if we deal with
independent random samples, or if randomization was used in allocating
subjects to samples or treatments, independently from block to block, and if we view
the simulated or exact -values conditionally, given the tie patterns
within each block. Of course, under such randomization any conclusions
are valid only with respect to the blocks of subjects that were randomly allocated.
The asymptotic
-value calculation assumes distribution continuity. No adjustment
for lack thereof is known at this point. The same comment holds for the means
and standard deviations of respective statistics.
Value
A list of class kSamples
with components
test.name |
|
M |
number of blocks of samples being compared |
n.samples |
list of |
nt |
|
n.ties |
vector giving the number of ties in each the |
ad.list |
list of |
mu |
vector of means of the |
sig |
vector of standard deviations of the |
ad.c |
2 x 3 (2 x 4) matrix containing
|
mu.c |
mean of |
sig.c |
standard deviation of |
warning |
logical indicator, |
null.dist1 |
simulated or enumerated null distribution of version 1
of |
null.dist2 |
simulated or enumerated null distribution of version 2
of |
method |
the |
Nsim |
the number of simulations used for each block of samples. |
Note
This test is useful in analyzing treatment effects in randomized (incomplete) block experiments and in examining performance equivalence of several laboratories when presented with different test materials for comparison.
References
Scholz, F. W. and Stephens, M. A. (1987), K-sample Anderson-Darling Tests, Journal of the American Statistical Association, Vol 82, No. 399, 918–924.
See Also
Examples
## Create two lists of sample vectors.
x1 <- list( c(1, 3, 2, 5, 7), c(2, 8, 1, 6, 9, 4), c(12, 5, 7, 9, 11) )
x2 <- list( c(51, 43, 31, 53, 21, 75), c(23, 45, 61, 17, 60) )
# and a corresponding data frame datx1x2
x1x2 <- c(unlist(x1),unlist(x2))
gx1x2 <- as.factor(c(rep(1,5),rep(2,6),rep(3,5),rep(1,6),rep(2,5)))
bx1x2 <- as.factor(c(rep(1,16),rep(2,11)))
datx1x2 <- data.frame(A = x1x2, G = gx1x2, B = bx1x2)
## Run ad.test.combined.
set.seed(2627)
ad.test.combined(x1, x2, method = "simulated", Nsim = 1000)
# or with same seed
# ad.test.combined(list(x1, x2), method = "simulated", Nsim = 1000)
# ad.test.combined(A~G|B,data=datx1x2,method="simulated",Nsim=1000)