dpseudoF {drclust} | R Documentation |
double pseudoF (Calinski-Harabsz) index
Description
A pseudoF version for double partitioning, for the choice of the number of clusters of the units and variables (rows and columns of the data matrix). It is a diagnostic tool for inspecting simultaneously the optimal number of unit-clusters and variable-clusters.
Usage
dpseudoF(data, maxK, maxQ)
Arguments
data |
Units x variables numeric data matrix. |
maxK |
Maximum number of clusters for the units to be tested. |
maxQ |
Maximum number of clusters for the variables to be tested. |
Value
dpseudoF |
matrix containing the pF value for each pair of K and Q within the specified range |
Author(s)
Ionel Prunila, Maurizio Vichi
References
R. Rocci, M. Vichi (2008)" Two-mode multi-partitioning" <doi:10.1016/j.csda.2007.06.025>
T. Calinski & J. Harabasz (1974). A dendrite method for cluster analysis. Communications in Statistics, 3:1, 1-27
Examples
# Iris data
# Loading the numeric variables of iris data
iris <- as.matrix(iris[,-5])
dpeudoF <- dpseudoF(iris, maxK=10, maxQ = 3)
[Package drclust version 0.1 Index]