EDOtrans {EDOtrans}R Documentation

Euclidean distance-optimized data transformation

Description

The package provides the necessary functions for performing the EDO data transformation.

Usage

EDOtrans(Data, Cls, PlotIt = FALSE, FitAlg = "normalmixEM", Criterion = "LR",
                     MaxModes = 8, MaxCores = getOption("mc.cores", 2L), Seed)

Arguments

Data

the data as a vector.

Cls

the class information, if any, as a vector of similar length as instances in the data.

PlotIt

whether to plot the fit directly.

FitAlg

which fit algorithm to use: "ClusterRGMM" = GMM from ClusterR, "densityMclust" from mclust, "DO" from DistributionOptimization (slow), "MCMC" = NMixMCMC from mixAK, or "normalmixEM" from mixtools.

Criterion

which criterion should be used to establish the number of modes from the best GMM fit: "AIC", "BIC", "FM", "GAP", "LR" (likelihood ratio test), "NbClust" (from NbClust), "SI" (Silverman).

MaxModes

for automated GMM assessment: the maximum number of modes to be tried.

MaxCores

for automated GMM assessment: the maximum number of processor cores used under Unix.

Seed

seed parameter set internally.

Value

Returns a list of transformed data and class assignments.

DataEDO

the EDO transformed data.

EDOfactor

the factor by which each data value has been divided.

Cls

the class information for each data instance.

Author(s)

Jorn Lotsch and Alfred Ultsch

References

Lotsch, J., Ultsch, A. (2021): EDOtrans – an R Package for Euclidean distance-optimized data transformation.

Examples

## example 1
data(iris)
IrisEDOdata <- EDOtrans(Data = as.vector(iris[,1]), Cls = as.integer(iris$Species))

[Package EDOtrans version 0.2.5 Index]