SS_gwas {RAINBOWR} | R Documentation |
Calculate some summary statistics of GWAS (genome-wide association studies) for simulation study
Description
Calculate some summary statistics of GWAS (genome-wide association studies) for simulation study
Usage
SS_gwas(
res,
x,
map.x,
qtn.candidate,
gene.set = NULL,
n.top.false.block = 10,
sig.level = c(0.05, 0.01),
method.thres = "BH",
inflator.plus = 2,
LD_length = 150000,
cor.thres = 0.35,
window.size = 0,
saveName = NULL,
plot.ROC = TRUE
)
Arguments
res |
Data frame of GWAS results where the first column is the marker names, the second and third column is the chromosome amd map position, and the forth column is -log10(p) for each marker. |
x |
A N (lines) x M (markers) marker genotype data (matrix), coded as [-1, 0, 1] = [aa, Aa, AA]. |
map.x |
Data frame with the marker names in the first column. The second and third columns contain the chromosome and map position. |
qtn.candidate |
A vector of causal markers. You should assign where those causal markers are positioned in our marker genotype, rather than physical position of those causal markers. |
gene.set |
If you have information of gene (or haplotype block), and if you used it to perform kernel-based GWAS, you should assign your gene information to gene.set in the form of a "data.frame" (whose dimension is (the number of gene) x 2). In the first column, you should assign the gene name. And in the second column, you should assign the names of each marker, which correspond to the marker names of "x" argument. |
n.top.false.block |
We will calculate the mean of -log10(p) values of top 'n.top.false.block' blocks to evaluate the inflation level of results. The default is 10. |
sig.level |
Significance level for the threshold. The default is 0.05. |
method.thres |
Method for detemining threshold of significance. "BH" and "Bonferroni are offered. |
inflator.plus |
If 'the -log10(p) value for each marker' exceeds ('the inflation level' + 'inflator.plus'), that marker is regarded as significant. |
LD_length |
SNPs within the extent of LD are regareded as one set. This LD_length determines the size of LD block, and 2 x LD_length (b.p.) will be the size of LD block. |
cor.thres |
SNPs within the extent of LD are regareded as one set. This cor.thres also determines the size of LD block, and the region with square of correlation coefficients >= cor.thres is regareded as one LD block. More precisely, the regions which satisfies both LD_length and cor.thres condition is rearded as one LD block. |
window.size |
If you peform SNP-set analysis with slinding window, we can consider the effect of window size by this argument. |
saveName |
When drawing any plot, you can save plots in png format. In saveName, you should substitute the name you want to save. When saveName = NULL, the plot is not saved. |
plot.ROC |
If this argunent is TRUE, ROC (Reciever Operating Characteristic) curve will be drawn with AUC (Area Under the Curve). |
Value
- $log.p
-log10(p)) values of the causals.
- $qtn.logp.order
The rank of -log10(p) of causals.
- $thres
A vector which contains the information of threshold.
- $overthres
The number of markers which exceed the threshold.
- $AUC
Area under the curve.
- $AUC.relax
Area under the curve calculated with LD block units.
- $FDR
False discovery rate. 1 - Precision.
- $FPR
False positive rate.
- $FNR
False negative rate. 1 - Recall.
- $Recall
The proportion of the number of causals dected by GWAS to the number of causals you set.
- $Precision
The proportion of the number of causals dected by GWAS to the number of markers detected by GWAS.
- $Accuracy
The accuracy of GWAS results.
- $Hm
Harmonic mean of Recall and Precision.
- $haplo.name
The haplotype block name which correspond to causals.
- $mean.false
The mean of -log10(p) values of top 'n.top.false.block' blocks.
- $max.trues
Maximum of the -log10(p) values of the region near causals.