goodall1 {nomclust} | R Documentation |
Goodall 1 (G1) Measure
Description
The function calculates a dissimilarity matrix based on the G1 similarity measure.
Usage
goodall1(data, var.weights = NULL)
Arguments
data |
A data.frame or a matrix with cases in rows and variables in column. |
var.weights |
A numeric vector setting weights to the used variables. One can choose the real numbers from zero to one. |
Details
The Goodall 1 similarity measure was presented in (Boriah et al., 2008). It is a simple modification of the original Goodall measure (Goodall, 1966). The measure assigns higher weights to infrequent matches.
Value
The function returns an object of the class "dist".
Author(s)
Zdenek Sulc.
Contact: zdenek.sulc@vse.cz
References
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.
Goodall V.D. (1966). A new similarity index based on probability. Biometrics, 22(4), p. 882.
See Also
anderberg
,
burnaby
,
eskin
,
gambaryan
,
goodall2
,
goodall3
,
goodall4
,
iof
,
lin
,
lin1
,
of
,
sm
,
smirnov
,
ve
,
vm
.
Examples
# sample data
data(data20)
# dissimilarity matrix calculation
prox.goodall1 <- goodall1(data20)
# dissimilarity matrix calculation with variable weights
weights.goodall1 <- goodall1(data20, var.weights = c(0.7, 1, 0.9, 0.5, 0))