cosine {lsa} | R Documentation |
Cosine Measure (Matrices)
Description
Calculates the cosine measure between two vectors or between all column vectors of a matrix.
Usage
cosine(x, y = NULL)
Arguments
x |
A vector or a matrix (e.g., a document-term matrix). |
y |
Optional: a vector with compatible dimensions to |
Details
cosine()
calculates a similarity matrix between all column
vectors of a matrix x
. This matrix might be a document-term
matrix, so columns would be expected to be documents and
rows to be terms.
When executed on two vectors x
and y
,
cosine()
calculates the cosine similarity between them.
Value
Returns a n*n
similarity matrix of cosine values, comparing all
n
column vectors against each other. Executed on two vectors, their
cosine similarity value is returned.
Note
The cosine measure is nearly identical with the pearson correlation
coefficient (besides a constant factor) cor(method="pearson")
.
For an investigation on the differences in the context of textmining see
(Leydesdorff, 2005).
Author(s)
Fridolin Wild f.wild@open.ac.uk
References
Leydesdorff, L. (2005) Similarity Measures, Author Cocitation Analysis,and Information Theory. In: JASIST 56(7), pp.769-772.
See Also
Examples
## the cosinus measure between two vectors
vec1 = c( 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0 )
vec2 = c( 0, 0, 1, 1, 1, 1, 1, 0, 1, 0, 0, 0 )
cosine(vec1,vec2)
## the cosine measure for all document vectors of a matrix
vec3 = c( 0, 1, 0, 1, 1, 0, 0, 1, 0, 0, 0, 0 )
matrix = cbind(vec1,vec2, vec3)
cosine(matrix)