mst {MSclust} | R Documentation |
Mixture of Multiple Scaled Student-t Distributions
Description
Fits the mixture of multiple scaled Student-t distributions to the given data.
Usage
mst(X,k,ini="km",sz=NULL,df.min=1,dfU="num",frm="dir",m="BFGS",stop=c(10^-5,200),VB=FALSE)
Arguments
X |
A matrix or data frame such that rows correspond to observations and columns correspond to variables. |
k |
The number of clusters. |
ini |
Using kmeans by default or |
sz |
If initialization is manual, this matrix contains the starting value for |
df.min |
Minimum proportion of good points in each group for the contaminated normal distribution. |
dfU |
Criterion to update the degrees of freedom. |
frm |
Direct by default or indirect, technique used to compute the density function. |
m |
Method for the optimization of the eigenvector matrix, see optim for other options. |
stop |
2-dimensional vector with the Aitken criterion stopping rule and Maximum number of iterations. |
VB |
If true, tracing information on the progress of the optimization is produced; see optim() for details and plotting of the log-likelihood versus iterations. |
Value
X |
Data used for clustering. |
n |
The number of observations in the data. |
d |
The number of features in the data. |
k |
Value corresponding to the number of components. |
cluster |
Vector of group membership as determined by the model. |
detect |
Detect if the point is bad or not per each principal component given the cluster membership. |
npar |
The number of parameters. |
mu |
Either a vector of length |
Lambda |
Orthogonal matrix whose columns are the normalized eigenvectors of Sigma. |
Gamma |
Diagonal matrix of the eigenvalues of Sigma. |
Sigma |
A symmetric positive-definite matrix representing the scale matrix of the distribution. |
df |
vector containing the degrees of freedom for each component. |
z |
The component membership of each observations. |
v |
The indicator if an observation is good or bad with respect to each dimension; 1 is good, and 0 means bad. |
weight |
The matrix of the expected value of the characteristic weights; corespond to the value of |
iter.stop |
The number of iterations until convergence for the model. |
loglik |
The log-likelihood corresponding to the model. |
AIC |
The Akaike's Information Criterion of the model. |
BIC |
The Bayesian Information Criterion of the model. |
ICL |
The Integrated Completed Likelihood of the model. |
KIC |
The Kullback Information Criterion of the model. |
KICc |
The Bias correction of the Kullback Information Criterion of the model. |
AWE |
The Approximate Weight of Evidence of the model. |
AIC3 |
Another version of Akaike's Information Criterion of the model. |
CAIC |
The Consistent Akaike's Information Criterion of the model. |
AICc |
The AIC version which is used when sample size |
CLC |
The Classification Likelihood Criterion of the model. |
Author(s)
Cristina Tortora and Antonio Punzo
References
Forbes, F. & Wraith, D. (2014). A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering. Statistics and Computing, 24(6), 971–984.
Examples
## Not run:
## Not run:
data(sim)
result <- mst(X = sim, k = 2)
plot(result)
## End(Not run)
## End(Not run)