MixSim {MixSim}R Documentation

Mixture Simulation

Description

Generates a finite mixture model with Gaussian components for prespecified levels of maximum and/or average overlaps.

Usage

MixSim(BarOmega = NULL, MaxOmega = NULL, K, p, sph = FALSE, hom = FALSE,
       ecc = 0.90, PiLow = 1.0, int = c(0.0, 1.0), resN = 100,
       eps = 1e-06, lim = 1e06)

Arguments

BarOmega

value of desired average overlap.

MaxOmega

value of desired maximum overlap.

K

number of components.

p

number of dimensions.

sph

covariance matrix structure (FALSE - non-spherical, TRUE - spherical).

hom

heterogeneous or homogeneous clusters (FALSE - heterogeneous, TRUE - homogeneous).

ecc

maximum eccentricity.

PiLow

value of the smallest mixing proportion (if 'PiLow' is not reachable with respect to K, equal proportions are taken; PiLow = 1.0 implies equal proportions by default).

int

mean vectors are simulated uniformly on a hypercube with sides specified by int = (lower.bound, upper.bound).

resN

maximum number of mixture resimulations.

eps

error bound for overlap computation.

lim

maximum number of integration terms (Davies, 1980).

Details

If 'BarOmega' is not specified, the function generates a mixture solely based on 'MaxOmega'; if 'MaxOmega' is not specified, the function generates a mixture solely based on 'BarOmega'.

If 'hom' is TRUE, only one of 'BarOmega' or 'MaxOmega' can be specified.

Value

Pi

vector of mixing proportions.

Mu

matrix consisting of components' mean vectors (K * p).

S

set of components' covariance matrices (p * p * K).

OmegaMap

matrix of misclassification probabilities (K * K); OmegaMap[i,j] is the probability that X coming from the i-th component is classified to the j-th component.

BarOmega

value of average overlap.

MaxOmega

value of maximum overlap.

rcMax

row and column numbers for the pair of components producing maximum overlap 'MaxOmega'.

fail

flag value; 0 represents successful mixture generation, 1 represents failure.

Author(s)

Volodymyr Melnykov, Wei-Chen Chen, and Ranjan Maitra.

References

Maitra, R. and Melnykov, V. (2010) “Simulating data to study performance of finite mixture modeling and clustering algorithms”, The Journal of Computational and Graphical Statistics, 2:19, 354-376.

Melnykov, V., Chen, W.-C., and Maitra, R. (2012) “MixSim: An R Package for Simulating Data to Study Performance of Clustering Algorithms”, Journal of Statistical Software, 51:12, 1-25.

Davies, R. (1980) “The distribution of a linear combination of chi-square random variables”, Applied Statistics, 29, 323-333.

See Also

overlap, pdplot, and simdataset.

Examples

set.seed(1234)

# controls average and maximum overlaps
(ex.1 <- MixSim(BarOmega = 0.05, MaxOmega = 0.15, K = 4, p = 5))
summary(ex.1)

# controls average overlap
(ex.2 <- MixSim(BarOmega = 0.05, K = 4, p = 5, hom = TRUE))
summary(ex.2)

# controls maximum overlap
(ex.3 <- MixSim(MaxOmega = 0.15, K = 4, p = 5, sph = TRUE))
summary(ex.3)

[Package MixSim version 1.1-8 Index]