| bclust {flexclust} | R Documentation |
Bagged Clustering
Description
Cluster the data in x using the bagged clustering
algorithm. A partitioning cluster algorithm such as
cclust is run repeatedly on bootstrap samples from the
original data. The resulting cluster centers are then combined using
the hierarchical cluster algorithm hclust.
Usage
bclust(x, k = 2, base.iter = 10, base.k = 20, minsize = 0,
dist.method = "euclidian", hclust.method = "average",
FUN = "cclust", verbose = TRUE, final.cclust = FALSE,
resample = TRUE, weights = NULL, maxcluster = base.k, ...)
## S4 method for signature 'bclust,missing'
plot(x, y, maxcluster = x@maxcluster, main = "", ...)
## S4 method for signature 'bclust,missing'
clusters(object, newdata, k, ...)
## S4 method for signature 'bclust'
parameters(object, k)
Arguments
x |
Matrix of inputs (or object of class |
k |
Number of clusters. |
base.iter |
Number of runs of the base cluster algorithm. |
base.k |
Number of centers used in each repetition of the base method. |
minsize |
Minimum number of points in a base cluster. |
dist.method |
Distance method used for the hierarchical
clustering, see |
hclust.method |
Linkage method used for the hierarchical
clustering, see |
FUN |
Partitioning cluster method used as base algorithm. |
verbose |
Output status messages. |
final.cclust |
If |
resample |
Logical, if |
weights |
Vector of length |
maxcluster |
Maximum number of clusters memberships are to be computed for. |
object |
Object of class |
main |
Main title of the plot. |
... |
Optional arguments top be passed to the base method
in |
y |
Missing. |
newdata |
An optional data matrix with the same number of columns as the cluster centers. If omitted, the fitted values are used. |
Details
First, base.iter bootstrap samples of the original data in
x are created by drawing with replacement. The base cluster
method is run on each of these samples with base.k
centers. The base.method must be the name of a partitioning
cluster function returning an object with the same slots as the
return value of cclust.
This results in a collection of iter.base * base.centers
centers, which are subsequently clustered using the hierarchical
method hclust. Base centers with less than
minsize points in there respective partitions are removed
before the hierarchical clustering. The resulting dendrogram is
then cut to produce k clusters.
Value
bclust returns objects of class
"bclust" including the slots
hclust |
Return value of the hierarchical clustering of the
collection of base centers (Object of class |
cluster |
Vector with indices of the clusters the inputs are assigned to. |
centers |
Matrix of centers of the final clusters. Only useful, if the hierarchical clustering method produces convex clusters. |
allcenters |
Matrix of all |
Author(s)
Friedrich Leisch
References
Friedrich Leisch. Bagged clustering. Working Paper 51, SFB “Adaptive Information Systems and Modeling in Economics and Management Science”, August 1999. https://epub.wu.ac.at/1272/1/document.pdf
Sara Dolnicar and Friedrich Leisch. Winter tourist segments in Austria: Identifying stable vacation styles using bagged clustering techniques. Journal of Travel Research, 41(3):281-292, 2003.
See Also
Examples
data(iris)
bc1 <- bclust(iris[,1:4], 3, base.k=5)
plot(bc1)
table(clusters(bc1, k=3))
parameters(bc1, k=3)