gambaryan {nomclust} | R Documentation |
Gambaryan (GA) Measure
Description
The function calculates a dissimilarity matrix based on the GA similarity measure.
Usage
gambaryan(data)
Arguments
data |
A data.frame or a matrix with cases in rows and variables in columns. |
Details
The Gambaryan similarity measure was presented in (Gambaryan, 1964). The measure assigns low weight to matches where the matching value occurs in about half the dataset, i.e., in between being frequent and rare, see (Borian et al., 2008).
Value
The function returns an object of the class "dist".
Author(s)
Zdenek Sulc.
Contact: zdenek.sulc@vse.cz
References
Gambaryan P. (1964). A mathematical model of taxonomy.
SSR, 17(12), 47-53.
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.
See Also
anderberg
,
burnaby
,
eskin
,
goodall1
,
goodall2
,
goodall3
,
goodall4
,
iof
,
lin
,
lin1
,
of
,
sm
,
smirnov
,
ve
,
vm
.
Examples
# sample data
data(data20)
# dissimilarity matrix calculation
prox.gambaryan <- gambaryan(data20)