quickLook {dinamic}R Documentation

Find DiNAMIC's null distribution

Description

Find DiNAMIC's null distribution

Usage

quickLook(
  x,
  marker.data,
  annot.file,
  num.perms,
  num.iters,
  gain.loss = "gain",
  reformat.annot = FALSE,
  random.seed = NULL
)

Arguments

x

An n by m numeric matrix containing DNA copy number data from n subjects at m markers.

marker.data

A dataframe containing marker position data for markers in the autosomes. Column 1 contains the chromosome number for each marker, and column 2 contains the position (in base pairs) for each markers. Additional columns, if present, represent information about the markers (e.g. probe names).

annot.file

A cytoband annotation dataframe. Each row corresponds to a distinct cytoband, and column 1 contains the chromosome number, column 2 contains the start position (in base pairs), column 3 contains the end position (in base pairs), and column 4 contains the cytoband name (e.g. p21.3). Additional columns may be present, but they are not used.

num.perms

A positive integer that represents the number of cyclic shifts used to create the empirical distribution.

num.iters

A positive integer that represents the number of distinct gain (loss) loci that will be assessed. See "Details" for more information.

gain.loss

A character string that indicates whether recurrent gains (gain.loss = "gain") or recurrent losses (gain.loss = "loss") are assessed.

reformat.annot

A logical value that indicates whether annot.file needs to be reformatted (default = FALSE). See the "Note" section of makeCytoband for additional information.

random.seed

An optional random seed (default = NULL).

Details

This function applies the "Quick Look" version of DiNAMIC's cyclic shift procedure to assess the statistical significance of recurrent DNA copy number aberrations. Either recurrent gains (gain.loss = "gain") or recurrent losses (gain.loss = "loss") are assessed using a null distribution based on num.perms cyclic shifts of x. Iterative calls to DiNAMIC's peeling procedure (implemented here in the peeling function) allow users to assess the statistical significance of num.iters distinct gains (losses). As noted in Bioinformatics (2011) 27(5) 678 - 685, the "Quick Look" procedure calculates the null distribution once, and the same distribution is used to assess the statistical significance of the most aberrant gain or loss after each iteration of the peeling procedure. This approach is less computationally intensive than "Detailed Look" because the null distribution is only computed once, but simulations suggest that it provides less power to detect recurrent gains (losses). The resulting p-values are corrected for multiple comparisons because the null distribution is based on computing max(colSums(x)) or min(colSums(x)).

Value

A matrix with num.iters rows. The entries of each row correspond to the marker that is being assessed. More specifically, the entries are (1) the chromosome number, (2) the marker position (in base pairs), (3) additional marker information present in marker.data, (4) the marker number, and (5) the p-value obtained from the null distribution, (6) the endpoints of the peak interval (in base pairs), as described in Bioinformatics (2011) 27(5) 678 - 685.

Examples

quickLook(wilms.data, wilms.markers, annot.file, 100, 3)


[Package dinamic version 1.0.1 Index]