R: Multiple Latent Components Clustering

mlcc.kmeans {varclust}

R Documentation

Multiple Latent Components Clustering - kmeans algorithm

Description

Performs k-means based subspace clustering. Center of each cluster is some number of principal components. This number can be fixed or estimated by PESEL. Similarity measure between variable and a cluster is calculated using BIC.

Usage

mlcc.kmeans(X, number.clusters = 2, stop.criterion = 1,
  max.iter = 30, max.subspace.dim = 4, initial.segmentation = NULL,
  estimate.dimensions = TRUE, show.warnings = FALSE)

Arguments

`X`	A matrix with only continuous variables.
`number.clusters`	An integer, number of clusters to be used.
`stop.criterion`	An integer indicating how many changes in partitions triggers stopping the algorithm.
`max.iter`	An integer, maximum number of iterations of k-means loop.
`max.subspace.dim`	An integer, maximum dimension of subspaces.
`initial.segmentation`	A vector, initial segmentation of variables to clusters.
`estimate.dimensions`	A boolean, if TRUE (value set by default) subspaces dimensions are estimated.
`show.warnings`	A boolean, if set to TRUE all warnings are displayed, default value is FALSE.

Value

A list consisting of:

`segmentation`	a vector containing the partition of the variables
`pcas`	a list of matrices, basis vectors for each cluster (subspace)

References

Bayesian dimensionality reduction with PCA using penalized semi-integrated likelihood, Piotr Sobczyk, Malgorzata Bogdan, Julie Josse

Examples


sim.data <- data.simulation(n = 50, SNR = 1, K = 5, numb.vars = 50, max.dim = 3)
mlcc.res <- mlcc.kmeans(sim.data$X, number.clusters = 5, max.iter = 20, max.subspace.dim = 3)
show.clusters(sim.data$X, mlcc.res$segmentation)

[Package varclust version 0.9.4 Index]