covtest.pe.fisher {PEtests}R Documentation

Two-sample PE covariance test for high-dimensional data via Fisher's combination

Description

This function implements the two-sample PE covariance test via Fisher's combination. Suppose {X1,,Xn1}\{\mathbf{X}_1, \ldots, \mathbf{X}_{n_1}\} are i.i.d. copies of X\mathbf{X}, and {Y1,,Yn2}\{\mathbf{Y}_1, \ldots, \mathbf{Y}_{n_2}\} are i.i.d. copies of Y\mathbf{Y}. Let pLCp_{LC} and pCLXp_{CLX} denote the pp-values associated with the l2l_2-norm-based covariance test (see covtest.lc for details) and the ll_\infty-norm-based covariance test (see covtest.clx for details), respectively. The PE covariance test via Fisher's combination is defined as

TFisher=2log(pLC)2log(pCLX).T_{Fisher} = -2\log(p_{LC})-2\log(p_{CLX}).

It has been proved that with some regularity conditions, under the null hypothesis H0c:Σ1=Σ2,H_{0c}: \mathbf{\Sigma}_1 = \mathbf{\Sigma}_2, the two tests are asymptotically independent as n1,n2,pn_1, n_2, p\rightarrow \infty, and therefore TFisherT_{Fisher} asymptotically converges in distribution to a χ42\chi_4^2 distribution. The asymptotic pp-value is obtained by

p-value=1Fχ42(TFisher),p\text{-value} = 1-F_{\chi_4^2}(T_{Fisher}),

where Fχ42()F_{\chi_4^2}(\cdot) is the cdf of the χ42\chi_4^2 distribution.

Usage

covtest.pe.fisher(dataX,dataY)

Arguments

dataX

an n1n_1 by pp data matrix

dataY

an n2n_2 by pp data matrix

Value

stat the value of test statistic

pval the p-value for the test.

References

Yu, X., Li, D., and Xue, L. (2022). Fisher’s combined probability test for high-dimensional covariance matrices. Journal of the American Statistical Association, (in press):1–14.

Examples

n1 = 100; n2 = 100; pp = 500
set.seed(1)
X = matrix(rnorm(n1*pp), nrow=n1, ncol=pp)
Y = matrix(rnorm(n2*pp), nrow=n2, ncol=pp)
covtest.pe.fisher(X,Y)

[Package PEtests version 0.1.0 Index]