ScoreTest_SPA {SPAtest} | R Documentation |
Score test based on saddlepoint approximation
Description
Performs score test using saddlepoint approximation to estimate the null distribution.
Usage
ScoreTest_SPA(genos,pheno,cov,obj.null,method=c("fastSPA","SPA"),minmac=5,
Cutoff=2,alpha=5*10^-8,missing.id=NA,beta.out=FALSE,beta.Cutoff=5*10^-7,log.p=FALSE)
Arguments
genos |
A vector or matrix containing the genotypes or dosages. If matrix is provided then rows should correspond to SNPs and columns should correspond to subjects. Optional, but needed if |
pheno |
A vector containing the outcomes (phenotypes). Optional, but needed if |
cov |
A matrix or data frame containing the covariates. Optional, but needed if |
obj.null |
An object of class " |
method |
String specifying the p-value calculation method. Possible values are " |
minmac |
Minimum minor allele count threshold to run SPA test, default value is |
Cutoff |
An integer or the string "BE" denoting the standard deviation cutoff to be used. If |
alpha |
Significance level for the test(s), default value is |
missing.id |
Missing value indicator. Numeric or |
beta.out |
Logical indicating whether log odds ratios (beta parameters) are to be estimated, default value is |
beta.Cutoff |
Maximum p-value threshold for beta parameters to be estimated, default value is |
log.p |
Whether to return natural log-transformed p-values, default value is |
Details
genos
can have discrete 0, 1, 2
values or continuous values between [0,2]
. The genotype or dosage values can represent any of the major allele, minor allele, reference allele or alternate allele counts (or dosages), as long as it is consistent throughout the subjects.
genos
can have missing values denoted by the missing.id
argument. Such missing values will be imputed using mean imputation. pheno
or cov
cannot have missing values.
pheno
and cov
are ignored if obj.null
is provided. If both obj.null
and cov
is missing, or obj.null
is missing and cov=NULL
, then the vector rep(1,n)
is assigned to cov
, where n
is the number of subjects.
method
= "SPA" is the basic saddlepoint approximation based test without the partially normal approximation improvement.
method
= "fastSPA" utilizes the partially normal approximation approach for improved efficiency, especially for rare variants.
Beta parameters are estimated using Firth's method, and thus computationally expensive. Therefore, it is recommended that beta parameters are only to be estimated when the p-value is very small (denoted by beta.Cutoff
). The code for beta estimation is as implemented by Clement Ma in the EPACTS software.
Value
p.value |
p-value or natural log-transformed p-value based on the saddlepoint approximation. |
p.value.NA |
p-value or natural log-transformed p-value based on the normal approximation (traditional score test). |
Is.converge |
"TRUE" or "FALSE" denoting whether the root-finding algorithm for the saddlepoint equation has converged. |
beta |
Genotype log-odds ratio estimate. |
SEbeta |
Standard error for the genotype log-odds ratio. |
Author(s)
Rounak Dey, deyrnk@umich.edu
References
Dey, R. et al., 2017. A Fast and Accurate Algorithm to Test for Binary Phenotypes and Its Application to PheWAS. The American Journal of Human Genetics, Vol 101 (1), 37-49.
Ma, C. et al., 2013. Recommended Joint and Meta-Analysis Strategies for Case-Control Association Testing of Single Low-Count Variants. Genetic Epidemiology, Vol 37 (6), 539-550.
See Also
ScoreTest_wSaddleApprox_NULL_Model
Examples
## Not run:
ScoreTest_SPA(genos,pheno,cov,obj.null,method=c("fastSPA","SPA"),
minmac=5,Cutoff=2,alpha=5*10^-8,missing.id=NA,beta.out=FALSE,beta.Cutoff=5*10^-7,log.p=FALSE)
## End(Not run)