RCLSMIX-class {rebmix}R Documentation

Class "RCLSMIX"

Description

Object of class RCLSMIX.

Objects from the Class

Objects can be created by calls of the form new("RCLSMIX", ...). Accessor methods for the slots are a.o(x = NULL), a.Dataset(x = NULL), a.s(x = NULL), a.ntrain(x = NULL), a.P(x = NULL), a.ntest(x = NULL), a.Zt(x = NULL), a.Zp(x = NULL), a.CM(x = NULL), a.Accuracy(x = NULL), a.Error(x = NULL), a.Precision(x = NULL), a.Sensitivity(x = NULL), a.Specificity(x = NULL) and a.Chunks(x = NULL), where x stands for an object of class RCLSMIX.

Slots

x:

a list of objects of class REBMIX of length oo obtained by running REBMIX on g=1,,sg = 1, \ldots, s train datasets YtraingY_{\mathrm{train}g} all of length ntraingn_{\mathrm{train}g}. For the train datasets the corresponding class membership Ωg\bm{\Omega}_{g} is known. This yields ntrain=g=1sntraingn_{\mathrm{train}} = \sum_{g = 1}^{s} n_{\mathrm{train}g}, while YtrainqYtraing=Y_{\mathrm{train}q} \cap Y_{\mathrm{train}g} = \emptyset for all qgq \neq g. Each object in the list corresponds to one chunk, e.g., (y1j,y3j)(y_{1j}, y_{3j})^{\top}.

o:

number of chunks oo. Y={yj; j=1,,n}Y = \{\bm{y}_{j}; \ j = 1, \ldots, n\} is an observed dd-dimensional dataset of size nn of vector observations yj=(y1j,,ydj)\bm{y}_{j} = (y_{1j}, \ldots, y_{dj})^{\top} and is partitioned into train and test datasets. Vector observations yj\bm{y}_{j} may further be split into oo chunks when running REBMIX, e.g., for d=6d = 6 and o=3o = 3 the set of chunks substituting yj\bm{y}_{j} may be as follows (y1j,y3j)(y_{1j}, y_{3j})^{\top}, (y2j,y4j,y6j)(y_{2j}, y_{4j}, y_{6j})^{\top} and y5jy_{5j}.

Dataset:

a data frame containing test dataset YtestY_{\mathrm{test}} of length ntestn_{\mathrm{test}}. For the test dataset the corresponding class membership Ωg\bm{\Omega}_{g} is not known.

s:

finite set of size ss of classes Ω={Ωg; g=1,,s}\bm{\Omega} = \{\bm{\Omega}_{g}; \ g = 1, \ldots, s\}.

ntrain:

a vector of length ss containing numbers of observations in train datasets YtraingY_{\mathrm{train}g}.

P:

a vector of length ss containing prior probabilities P(Ωg)=ntraingntrainP(\bm{\Omega}_{g}) = \frac{n_{\mathrm{train}g}}{n_{\mathrm{train}}}.

ntest:

number of observations in test dataset YtestY_{\mathrm{test}}.

Zt:

a factor of true class membership Ωg\bm{\Omega}_{g} for the test dataset.

Zp:

a factor of predictive class membership Ωg\bm{\Omega}_{g} for the test dataset.

CM:

a table containing confusion matrix for multiclass classifier. It contains number xqgx_{qg} of test observations with the true class qq that are classified into the class gg, where q,g=1,,sq, g = 1, \ldots, s.

Accuracy:

proportion of all test observations that are classified correctly. Accuracy=g=1sxggntest\mathrm{Accuracy} = \frac{\sum_{g = 1}^{s} x_{gg}}{n_{\mathrm{test}}}.

Error:

proportion of all test observations that are classified wrongly. Error=1Accuracy\mathrm{Error} = 1 - \mathrm{Accuracy}.

Precision:

a vector containing proportions of predictive observations in class gg that are classified correctly into class gg. Precision(g)=xggq=1sxqg\mathrm{Precision}(g) = \frac{x_{gg}}{\sum_{q = 1}^{s} x_{qg}}.

Sensitivity:

a vector containing proportions of test observations in class gg that are classified correctly into class gg. Sensitivity(g)=xggq=1sxgq\mathrm{Sensitivity}(g) = \frac{x_{gg}}{\sum_{q = 1}^{s} x_{gq}}.

Specificity:

a vector containing proportions of test observations that are not in class gg and are classified into the non gg class. Specificity(g)=ntestq=1sxqgntestq=1sxgq\mathrm{Specificity}(g) = \frac{n_{\mathrm{test}} - \sum_{q = 1}^{s} x_{qg}}{n_{\mathrm{test}} - \sum_{q = 1}^{s} x_{gq}}.

Chunks:

a vector containing selected chunks.

Author(s)

Marko Nagode

References

D. M. Dziuda. Data Mining for Genomics and Proteomics: Analysis of Gene and Protein Expression Data. John Wiley & Sons, New York, 2010.


[Package rebmix version 2.16.0 Index]