R: Test of Marginal Proportion for Clustered Data

proptestClust {htestClust}

R Documentation

Test of Marginal Proportion for Clustered Data

Description

proptestClust can be used for testing the null that the marginal proportion (probability of success) is equal to certain given values in clustered data with potentially informative cluster size.

Usage

proptestClust(x, id, p = NULL, alternative = c("two.sided", "less", "greater"),
             variance = c("sand.null", "sand.est", "emp", "MoM"), conf.level = 0.95)

Arguments

`x`	a vector of binary indicators denoting success/failure of each observation, or a two-dimensional table (or matrix) with 2 columns giving the aggregate counts of failures and successes (respectively) across clusters.
`id`	a vector which identifies the clusters; ignored if `x` is a matrix or table. The length of `id` must be the same as the length of `x`.
`p`	the null hypothesized value of the marginal proportion. Must be a single number greater than 0 and less than 1.
`alternative`	a character string specifying the alternative hypothesis. Must be one of "`two.sided`", "`greater`", or "`less`". You can specify just the initial letter.
`variance`	character string specifying the method of variance estimation. Must be one of "`sand.null`", "`sand.est`", "`emp`", or "`MoM`".
`conf.level`	confidence level of the returned confidence interval. Must be a single number between 0 and 1.

Details

If p is not given, the null tested is that the underlying marginal probability of success is .5.

The variance argument allows the user to specify the method of variance estimation, selecting from the sandwich estimate evaluated at the null hypothesis (sand.null), the sandwich estimate evaluated at the cluster-weighted proportion (sand.est), the empirical estimate (emp), or the method of moments estimate (MoM).

Value

A list with class "htest" containing the following components:

`statistic`	the value of the test statistic.
`p.value`	the p-value of the test.
`estimate`	the estimated marginal proportion.
`null.value`	the value of `p` under the null hypothesis.
`conf.int`	a confidence interval for the true marginal proportion.
`alternative`	a character string describing the alternative hypothesis.
`method`	a character string indicating the test performed and method of variance estimation.
`data.name`	a character string giving the name of the data and the total number of clusters.
`M`	the number of clusters.

References

Gregg, M., Datta, S., Lorenz, D. (2020) Variance Estimation in Tests of Clustered Categorical Data with Informative Cluster Size. Statistical Methods in Medical Research, doi:10.1177/0962280220928572.

Examples

data(screen8)
## using vectors
## suppose math proficiency is determined by score >= 65
## is the marginal proportion of students proficient in math at least 75%?
screen8$math.p <- 1*(screen8$math>=65)
proptestClust(screen8$math.p, screen8$sch.id, p = .75, alternative = "great")

## using table and empirical variance; two-sided CI
## (note that "failure" counts are the first column and "success" counts are the second column)
mathp.tab <- table(screen8$sch.id, screen8$math.p)
proptestClust(mathp.tab, variance="emp", p=.75)

## when all clusters have a size of 1, results will be in general correspondence with
## that of the classical analogue test
set.seed(123)
x <- rbinom(100, size = 1, p = 0.7)
id <- 1:100
proptestClust(x, id)
prop.test(sum(x), length(x))

[Package htestClust version 0.2.2 Index]