meantest {PEtests}R Documentation

Two-sample mean tests for high-dimensional data

Description

This function implements five two-sample mean tests on high-dimensional mean vectors. Let XRp\mathbf{X} \in \mathbb{R}^p and YRp\mathbf{Y} \in \mathbb{R}^p be two pp-dimensional populations with mean vectors (μ1,μ2)(\boldsymbol{\mu}_1, \boldsymbol{\mu}_2) and covariance matrices (Σ1,Σ2)(\mathbf{\Sigma}_1, \mathbf{\Sigma}_2), respectively. The problem of interest is to test the equality of the two mean vectors of the two populations:

H0m:μ1=μ2.H_{0m}: \boldsymbol{\mu}_1 = \boldsymbol{\mu}_2.

Suppose {X1,,Xn1}\{\mathbf{X}_1, \ldots, \mathbf{X}_{n_1}\} are i.i.d. copies of X\mathbf{X}, and {Y1,,Yn2}\{\mathbf{Y}_1, \ldots, \mathbf{Y}_{n_2}\} are i.i.d. copies of Y\mathbf{Y}. We denote dataX=(X1,,Xn1)Rn1×p(\mathbf{X}_1, \ldots, \mathbf{X}_{n_1})^\top\in\mathbb{R}^{n_1\times p} and dataY=(Y1,,Yn2)Rn2×p(\mathbf{Y}_1, \ldots, \mathbf{Y}_{n_2})^\top\in\mathbb{R}^{n_2\times p}.

Usage

meantest(dataX,dataY,method='pe.comp',delta=NULL)

Arguments

dataX

an n1n_1 by pp data matrix

dataY

an n2n_2 by pp data matrix

method

the method type (default = 'pe.comp'); chosen from

  • 'clx': the ll_\infty-norm-based mean test, proposed in Cai et al. (2014);
    see meantest.clx for details.

  • 'cq': the l2l_2-norm-based mean test, proposed in Chen and Qin (2010);
    see meantest.cq for details.

  • 'pe.cauchy': the PE mean test via Cauchy combination;
    see meantest.pe.cauchy for details.

  • 'pe.comp': the PE mean test via the construction of PE components;
    see meantest.pe.comp for details.

  • 'pe.fisher': the PE mean test via Fisher's combination;
    see meantest.pe.fisher for details.

delta

This is needed only in method='pe.comp'; see meantest.pe.comp for details. The default is NULL.

Value

method the method type

stat the value of test statistic

pval the p-value for the test.

References

Chen, S. X. and Qin, Y. L. (2010). A two-sample test for high-dimensional data with applications to gene-set testing. Annals of Statistics, 38(2):808–835.

Cai, T. T., Liu, W., and Xia, Y. (2014). Two-sample test of high dimensional means under dependence. Journal of the Royal Statistical Society: Series B: Statistical Methodology, 76(2):349–372.

Yu, X., Li, D., Xue, L., and Li, R. (2022). Power-enhanced simultaneous test of high-dimensional mean vectors and covariance matrices with application to gene-set testing. Journal of the American Statistical Association, (in press):1–14.

Examples

n1 = 100; n2 = 100; pp = 500
set.seed(1)
X = matrix(rnorm(n1*pp), nrow=n1, ncol=pp)
Y = matrix(rnorm(n2*pp), nrow=n2, ncol=pp)
meantest(X,Y)

[Package PEtests version 0.1.0 Index]