R: dcem_test: Part of DCEM package.

dcem_test {DCEM}

R Documentation

dcem_test: Part of DCEM package.

Description

For demonstrating the execution on the bundled dataset.

Usage

dcem_test()

Details

The dcem_test performs the following steps in order:

Read the data from the disk (from the file data/ionosphere_data.csv). The data folder is under the package installation folder.
The dataset details can be see by typing ionosphere_data in R-console or at http://archive.ics.uci.edu/ml/datasets/Ionosphere.
Clean the data (by removing the columns). The data should be cleaned before use. Refer trim_data to see what columns should be removed and how. The package provides the basic interface for removing columns.
Call the dcem_star_train on the cleaned data.

Accessing the output parameters

The function dcem_test() calls the dcem_star_train. It returns a list of objects as output. This list contains estimated parameters of the Gaussian (posterior probabilities, meu, sigma and prior). The parameters can be accessed as follows where sample_out is the list containing the output:

(1) Posterior Probabilities: sample_out$prob A matrix of posterior-probabilities
(2) Meu: meu

For multivariate data: It is a matrix of meu(s). Each row in the matrix corresponds to one meu.
(3) Co-variance matrices: sample_out$sigma

For multivariate data: List of co-variance matrices for the Gaussian(s).

Standard-deviation: sample_out$sigma

For univariate data: Vector of standard deviation for the Gaussian(s))
(4) Priors: sample_out$prior A vector of prior.
(5) Membership: sample_out$membership: A dataframe of cluster membership for data. Columns numbers are data indices and values are the assigned clusters.

References

Parichit Sharma, Hasan Kurban, Mehmet Dalkilic DCEM: An R package for clustering big data via data-centric modification of Expectation Maximization, SoftwareX, 17, 100944 URL https://doi.org/10.1016/j.softx.2021.100944

[Package DCEM version 2.0.5 Index]