wgt_jaccard_distance {fedmatch} | R Documentation |
Computing Weighted Jaccard Distance
Description
#' wgt_jaccard_distance
computes the Weighted Jaccard Distance between
two strings. It is vectorized, and accepts only two equal-length string
vectors.
Usage
wgt_jaccard_distance(string_1, string_2, corpus, nthreads = 1)
Arguments
string_1 |
character vector |
string_2 |
character vector |
corpus |
corpus data.table, constructed with
|
nthreads |
number of threads to use in the underlying C++ code |
Details
See the vignette fuzzy_matching
for details on how the Weighted Jaccard similarity is computed.
Value
numeric vector with the Weighted Jaccard distances for each element of string_1 and string_2.
[Package fedmatch version 2.0.6 Index]