perm.significance {MVisAGe}R Documentation

A Function for Computing a Vector of Pearson Correlation Coefficients

Description

This function computes Pearson correlation coefficients on a row-by-row basis for two numerical input matrices of the same size.

Usage

perm.significance(exp.mat, cn.mat, gene.annot, method = "pearson",
  digits = 5, num.perms = 100, random.seed = NULL,
  alternative = "greater")

Arguments

exp.mat

A matrix of gene-level expression data (rows = genes, columns = samples). Missing values are not permitted.

cn.mat

A matrix of gene-level DNA copy number data (rows = genes, columns = samples). Both genes and samples should appear in the same order as exp.mat. Missing values are not permitted.

gene.annot

A three-column matrix containing gene position information. Column 1 = chromosome number written in the form 'chr1' (note that chrX and chrY should be written chr23 and chr24), Column 2 = position (in base pairs), Column 3 = cytoband. Genes should appear in the same order as exp.mat and cn.mat.

method

A character string (either "pearson" or "spearman") specifying the method used to calculate the correlation coefficient (default = "pearson").

digits

Used with signif() to specify the number of significant digits (default = 5).

num.perms

Number of permutations used to assess significance (default = 1e2).

random.seed

Random seed (default = NULL).

alternative

A character string ("greater" or "less") that specifies the direction of the alternative hypothesis, either rho > 0 or rho < 0 (default = "greater").

Value

Returns a five-column matrix. The first three columns are the same as gene.annot. The fourth column contains gene-specific Pearson or Spearman correlation coefficients based on the entries in each row of exp.mat and cn.mat, respectively (column name = "R"). The fifth column contains squared Pearson correlation coefficients (column name = "R^2"). The sixth column contains the permutation-based right-tailed p-value of the correlation coefficient (column name = "perm_pValue"). The seventh column contains Benjamini-Hochberg q-values corresponding to the p-values. Genes with constant gene expression or DNA copy number are removed because they have zero variance.

Examples

exp.mat = tcga.exp.convert(exp.mat)

 cn.mat = tcga.cn.convert(cn.mat)

 prepped.data = data.prep(exp.mat, cn.mat, gene.annot, sample.annot, log.exp = FALSE)

 perm.significance(prepped.data[["exp"]], prepped.data[["cn"]], prepped.data[["gene.annot"]])


[Package MVisAGe version 0.2.1 Index]