simil {sdam} | R Documentation |
Similarity between vectors in columns
Description
A function to compute similarity between vectors from columns in a data frame based on common attribute characteristics.
Usage
simil(x, vars, uniq, diag.incl, dichot, rm.isol, k)
Arguments
x |
a list or a data frame object |
vars |
vector with column(s) in |
uniq |
unique entries? (optional and logical) |
diag.incl |
include also entries in matrix diagonal? (optional and logical) |
dichot |
dichotomize output? (optional and logical) |
rm.isol |
remove isolates in output? (optional and logical) |
k |
cut-off for dichotomization (if not specified, |
Details
This is a function to compute the similarity between two or more vectors, which can arise from columns in a data frame and from list entries.
The similarity of artefacts or other units having common variable attributes specified in vars
is by simple matching, and
this represents a measure of proximity among the items to compare. Comparison takes an id
column from x
;
otherwise, the first column is taken provided that there are no duplicated entry names.
Both the dichotomization of the output and the removing isolated items from the system of co-occurrence relations depends on functions
from package "multiplex"
.
Value
A valued matrix of similarities among items in x
.
Author(s)
Antonio Rivero Ostoic
See Also
edhw
, get.edh
, dichot
, rm.isol
, multigraph
.
Examples
## Not run:
# get inscriptions from a Roman province
arm <- edhw(province="Armenia")
# choose variables to a data frame
armv <- edhw(x=arm, as="df",
vars=c("findspot_ancient", "type_of_inscription", "type_of_monument", "language"))
# matrix of similarities of two variables
simil(armv, vars=c("findspot_ancient", "language"))
## End(Not run)