gambaryan {nomclust}R Documentation

Gambaryan (GA) Measure

Description

The function calculates a dissimilarity matrix based on the GA similarity measure.

Usage

gambaryan(data)

Arguments

data

A data.frame or a matrix with cases in rows and variables in columns.

Details

The Gambaryan similarity measure was presented in (Gambaryan, 1964). The measure assigns low weight to matches where the matching value occurs in about half the dataset, i.e., in between being frequent and rare, see (Borian et al., 2008).

Value

The function returns an object of the class "dist".

Author(s)

Zdenek Sulc.
Contact: zdenek.sulc@vse.cz

References

Gambaryan P. (1964). A mathematical model of taxonomy. SSR, 17(12), 47-53.

Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation. In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.

See Also

anderberg, burnaby, eskin, goodall1, goodall2, goodall3, goodall4, iof, lin, lin1, of, sm, smirnov, ve, vm.

Examples

# sample data
data(data20)

# dissimilarity matrix calculation
prox.gambaryan <- gambaryan(data20)


[Package nomclust version 2.8.0 Index]