R: double pseudoF (Calinski-Harabsz) index

dpseudoF {drclust}

R Documentation

double pseudoF (Calinski-Harabsz) index

Description

A pseudoF version for double partitioning, for the choice of the number of clusters of the units and variables (rows and columns of the data matrix). It is a diagnostic tool for inspecting simultaneously the optimal number of unit-clusters and variable-clusters.

Usage

dpseudoF(data, maxK, maxQ)

Arguments

`data`	Units x variables numeric data matrix.
`maxK`	Maximum number of clusters for the units to be tested.
`maxQ`	Maximum number of clusters for the variables to be tested.

Value

dpseudoF

matrix containing the pF value for each pair of K and Q within the specified range

Author(s)

Ionel Prunila, Maurizio Vichi

References

R. Rocci, M. Vichi (2008)" Two-mode multi-partitioning" <doi:10.1016/j.csda.2007.06.025>

T. Calinski & J. Harabasz (1974). A dendrite method for cluster analysis. Communications in Statistics, 3:1, 1-27

Examples

# Iris data 
# Loading the numeric variables of iris data
iris <- as.matrix(iris[,-5]) 

dpeudoF <- dpseudoF(iris, maxK=10, maxQ = 3)

[Package drclust version 0.1 Index]