FKM.gk.noise {fclust} | R Documentation |
Gustafson and Kessel - like fuzzy k-means with noise cluster
Description
Performs the Gustafson and Kessel - like fuzzy k-means clustering algorithm with noise cluster.
Differently from fuzzy k-means, it is able to discover non-spherical clusters.
The noise cluster is an additional cluster (with respect to the k standard clusters) such that objects recognized to be outliers are assigned to it with high membership degrees.
Usage
FKM.gk.noise (X, k, m, vp, delta, RS, stand, startU, index, alpha, conv, maxit, seed)
Arguments
X |
Matrix or data.frame |
k |
An integer value or vector specifying the number of clusters for which the |
m |
Parameter of fuzziness (default: 2) |
vp |
Volume parameter (default: |
delta |
Noise distance (default: average Euclidean distance between objects and prototypes from |
RS |
Number of (random) starts (default: 1) |
stand |
Standardization: if |
startU |
Rational start for the membership degree matrix |
index |
Cluster validity index to select the number of clusters: |
alpha |
Weighting coefficient for the fuzzy silhouette index |
conv |
Convergence criterion (default: 1e-9) |
maxit |
Maximum number of iterations (default: 1e+6) |
seed |
Seed value for random number generation (default: NULL) |
Details
If startU
is given, the argument k
is ignored (the number of clusters is ncol(startU)
).
If startU
is given, the first element of value
, cput
and iter
refer to the rational start.
If a cluster covariance matrix becomes singular, then the algorithm stops and the element of value
is NaN.
The Babuska et al. variant in FKM.gkb.noise
is recommended.
Value
Object of class fclust
, which is a list with the following components:
U |
Membership degree matrix |
H |
Prototype matrix |
F |
Array containing the covariance matrices of all the clusters |
clus |
Matrix containing the indexes of the clusters where the objects are assigned (column 1) and the associated membership degrees (column 2) |
medoid |
Vector containing the indexes of the medoid objects ( |
value |
Vector containing the loss function values for the |
criterion |
Vector containing the values of the cluster validity index |
iter |
Vector containing the numbers of iterations for the |
k |
Number of clusters |
m |
Parameter of fuzziness |
ent |
Degree of fuzzy entropy ( |
b |
Parameter of the polynomial fuzzifier ( |
vp |
Volume parameter |
delta |
Noise distance |
gam |
Weighting parameter for the fuzzy covariance matrices ( |
mcn |
Maximum condition number for the fuzzy covariance matrices ( |
stand |
Standardization (Yes if |
Xca |
Data used in the clustering algorithm (standardized data if |
X |
Raw data |
D |
Dissimilarity matrix ( |
call |
Matched call |
Author(s)
Paolo Giordani, Maria Brigida Ferraro, Alessio Serafini
References
Dave' R.N., 1991. Characterization and detection of noise in clustering. Pattern Recognition Letters, 12, 657-664.
Gustafson E.E., Kessel W.C., 1978. Fuzzy clustering with a fuzzy covariance matrix. Proceedings of the IEEE Conference on Decision and Control, pp. 761-766.
See Also
FKM.gkb.noise
, Fclust
, Fclust.index
, print.fclust
, summary.fclust
, plot.fclust
, unemployment
Examples
## Not run:
## unemployment data
data(unemployment)
## Gustafson and Kessel-like fuzzy k-means with noise cluster, fixing the number of clusters
clust=FKM.gk.noise(unemployment,k=3,delta=20,RS=10)
## Gustafson and Kessel-like fuzzy k-means with noise cluster, selecting the number of clusters
clust=FKM.gk.noise(unemployment,k=2:6,delta=20,RS=10)
## End(Not run)