BCPlaid {biclust}R Documentation

The Plaid Model Bicluster algorithm

Description

Performs Plaid Model Biclustering as described in Turner et al., 2003. This is an improvement of original 'Plaid Models for Gene Expression Data' (Lazzeroni and Owen, 2002). This algorithm models data matrices to a sum of layers, the model is fitted to data through minimization of error.

Usage

## S4 method for signature 'matrix,BCPlaid'
biclust(x, method=BCPlaid(), cluster="b", fit.model = y ~ m + a + b,
  background = TRUE, background.layer = NA, background.df = 1, row.release = 0.7, 
  col.release = 0.7, shuffle = 3, back.fit = 0, max.layers = 20, iter.startup = 5,
  iter.layer = 10, verbose = TRUE)

Arguments

x

The data matrix where biclusters have to be found

method

Here BCPlaid, to perform Plaid algorithm

cluster

'r', 'c' or 'b', to cluster rows, columns or both (default 'b')

fit.model

Model (formula) to fit each layer. Usually, a linear model is used, that estimates three parameters: m (constant for all elements in the bicluster), a(contant for all rows in the bicluster) and b (constant for all columns). Thus, default is: y ~ m + a + b.

background

If 'TRUE' the method will consider that a background layer (constant for all rows and columns) is present in the data matrix.

background.layer

If background='TRUE' a own background layer (Matrix with dimension of x) can be specified.

background.df

Degrees of Freedom of backround layer if background.layer is specified.

shuffle

Before a layer is added, it's statistical significance is compared against a number of layers obtained by random defined by this parameter. Default is 3, higher numbers could affect time performance.

iter.startup

Number of iterations to find starting values

iter.layer

Number of iterations to find each layer

back.fit

After a layer is added, additional iterations can be done to refine the fitting of the layer (default set to 0)

row.release

Scalar in [0,1](with interval recommended [0.5-0.7]) used as threshold to prune rows in the layers depending on row homogeneity

col.release

As above, with columns

max.layers

Maximum number of layer to include in the model

verbose

If 'TRUE' prints extra information on progress.

Value

Returns an Biclust object.

Author(s)

Adaptation of original code from Heather Turner from Rodrigo Santamaria rodri@usal.es. rodri@usal.es

References

Heather Turner et al, "Improved biclustering of microarray data demonstrated through systematic performance tests",Computational Statistics and Data Analysis, 2003, vol. 48, pages 235-254.

Lazzeroni and Owen, "Plaid Models for Gene Expression Data", Standford University, 2002.

Examples

  #Random matrix with embedded bicluster
  test <- matrix(rnorm(5000),100,50)
  test[11:20,11:20] <- rnorm(100,3,0.3)
  res<-biclust(test, method=BCPlaid())
  res

  #microarray matrix
  data(BicatYeast)
  res<-biclust(BicatYeast, method=BCPlaid(), verbose=FALSE)
  res
  

[Package biclust version 2.0.3.1 Index]