FLLat.BIC {FLLat}R Documentation

Optimal Tuning Parameters for the Fused Lasso Latent Feature Model

Description

Returns the optimal values of the fused lasso tuning parameters for the Fused Lasso Latent Feature (FLLat) model by minimizing the BIC. Also returns the fitted FLLat model for the optimal values of the tuning parameters.

Usage

FLLat.BIC(Y, J=min(15,floor(ncol(Y)/2)), B="pc", thresh=10^(-4), maxiter=100,
          maxiter.B=1, maxiter.T=1)

Arguments

Y

A matrix of data from an aCGH experiment (usually in the form of log intensity ratios) or some other type of copy number data. Rows correspond to the probes and columns correspond to the samples.

J

The number of features in the FLLat model. The default is the smaller of either 1515 or the number of samples divided by 22.

B

The initial values for the features. Can be one of "pc" (the first J principal components of Y), "rand" (a random selection of J columns of Y), or a user specified matrix of initial values, where rows correspond to the probes and columns correspond to the features. The default is "pc".

thresh

The threshold for determining when the solutions have converged. The default is 10410^{-4}.

maxiter

The maximum number of iterations for the outer loop of the FLLat algorithm. The default is 100100.

maxiter.B

The maximum number of iterations for the inner loop of the FLLat algorithm for estimating the features BB. The default is 11. Increasing this may decrease the number of iterations for the outer loop but may still increase total run time.

maxiter.T

The maximum number of iterations for the inner loop of the FLLat algorithm for estimating the weights Θ\Theta. The default is 11. Increasing this may decrease the number of iterations for the outer loop but may still increase total run time.

Details

This function returns the optimal values of the fused lasso tuning parameters, λ1\lambda_1 and λ2\lambda_2, for the FLLat model. The optimal values are chosen by first re-parameterizing λ1\lambda_1 and λ2\lambda_2 in terms of λ0\lambda_0 and a proportion α\alpha such that λ1=αλ0\lambda_1=\alpha\lambda_0 and λ2=(1α)λ0\lambda_2=(1-\alpha)\lambda_0. The values of α\alpha are fixed to be {0.1,0.3,0.5,0.7,0.9}\{0.1, 0.3, 0.5, 0.7, 0.9\} and for each value of α\alpha we consider a range of λ0\lambda_0 values. The optimal values of λ0\lambda_0 and α\alpha (and consequently λ1\lambda_1 and λ2\lambda_2) are chosen by minimizing the following BIC-type criterion over this two dimensional grid:

(SL)×log(RSSSL)+kα,λ0log(SL),(SL)\times\log\left(\frac{RSS}{SL}\right) + k_{\alpha,\lambda_0}\log(SL),

where SS is the number of samples, LL is the number probes, RSSRSS denotes the residual sum of squares and kα,λ0k_{\alpha, \lambda_0} denotes the sum over all the features of the number of unique non-zero elements in each estimated feature.

Note that for extremely large data sets, this function may take some time to run.

For more details, please see Nowak and others (2011) and the package vignette.

Value

A list with components:

lam0

The optimal value of λ0\lambda_0.

alpha

The optimal value of α\alpha.

lam1

The optimal value of λ1\lambda_1.

lam2

The optimal value of λ2\lambda_2.

opt.FLLat

The fitted FLLat model for the optimal values of the tuning parameters.

Author(s)

Gen Nowak gen.nowak@gmail.com, Trevor Hastie, Jonathan R. Pollack, Robert Tibshirani and Nicholas Johnson.

References

G. Nowak, T. Hastie, J. R. Pollack and R. Tibshirani. A Fused Lasso Latent Feature Model for Analyzing Multi-Sample aCGH Data. Biostatistics, 2011, doi: 10.1093/biostatistics/kxr012

See Also

FLLat

Examples

## Load simulated aCGH data.
data(simaCGH)

## Run FLLat.BIC to choose optimal tuning parameters for J = 5 features.
result.bic <- FLLat.BIC(simaCGH,J=5)

## Plot the features for the optimal FLLat model.
plot(result.bic$opt.FLLat)

## Plot a heatmap of the weights for the optimal FLLat model.
plot(result.bic$opt.FLLat,type="weights")

[Package FLLat version 1.2-1 Index]