OTRIMLE for a range of numbers of clusters with density-based cluster quality statistic


Computes Optimally Tuned Robust Improper Maximum Likelihood Clustering (OTRIMLE), see otrimle, together with the density-based cluster quality statistics Q (Hennig and Coretto 2021) for a range of values of the number of clusters.


otrimleg(dataset, G=1:6, multicore=TRUE, ncores=detectCores(logical=FALSE)-1,
   erc=20, beta0=0, fixlogicd=NULL, monitor=1, dmaxq=qnorm(0.9995))



something that can be coerced into an observations times variables matrix. The dataset.


vector of integers (normally starting from 1). Numbers of clusters to be considered.


logical. If TRUE, parallel computing is used through the function mclapply from package parallel; read warnings there if you intend to use this; it won't work on Windows.


integer. Number of cores for parallelisation.


A number larger or equal than one specifying the maximum allowed ratio between within-cluster covariance matrix eigenvalues. See otrimle.


A non-negative constant, penalty term for noise, to be passed as beta to otrimle, see documentation there.


numeric of NULL. Value for the logarithm of the improper constant density logicd, see rimle, which is run instead of otrimle if this is not NULL. NULL means that otrimle determines it from the data.


0 or 1. If 1, progress messages are printed on screen.


numeric. Passed as maxq to kerndensmeasure. The interval considered for the one-dimensional density estimator is (-maxq,maxq).


For estimating the number of clusters this is meant to be called by otrimlesimg. The output of otrimleg is not meant to be used directly for estimating the number of clusters, see Hennig and Coretto (2021).


otrimleg returns a list containing the components solution, iloglik, ibic, criterion, logicd, noiseprob, denscrit, ddpm. All of these are lists or vectors of which the component number is the number of clusters.


list of output objects of otrimle or rimle.


vector of improper likelihood values from otrimle.


vector of improper BIC-values (small is good) computed from iloglik and the numbers of parameters. Note that the behaviour of the improper likelihood is not compatible with the standard use of the BIC, so this is experimental and should not be trusted for choosing the number of clusters.


vector of values of OTRIMLE criterion, see otrimle.


vector of estimated noise proportions, exproportion[1] from otrimle.


vector of density-based cluster quality statistics Q (Hennig and Coretto 2021) as provided by the measure-component of kerndensmeasure.


list of the vector of cluster-wise density-based cluster quality measures as provided by the ddpm-component of kerndensmeasure.


Christian Hennig christian.hennig@unibo.it https://www.unibo.it/sitoweb/christian.hennig/en/


See Also

otrimle, rimle, otrimlesimg, kerndensmeasure


   selectdata <- c(1:30,101:110,117:136,160:161)
   x <- banknote[selectdata,5:7]
   obanknote <- otrimleg(x,G=1:2,multicore=FALSE)

