gammaCK2par {fastLink} | R Documentation |
gammaCK2par
Description
Field comparisons for string variables. Two possible agreement patterns are considered: 0 total disagreement, 2 agreement. The distance between strings is calculated using a Jaro-Winkler distance.
Usage
gammaCK2par(matAp, matBp, n.cores, cut.a, method, w)
Arguments
matAp |
vector storing the comparison field in data set 1 |
matBp |
vector storing the comparison field in data set 2 |
n.cores |
Number of cores to parallelize over. Default is NULL. |
cut.a |
Lower bound for full match, ranging between 0 and 1. Default is 0.92 |
method |
String distance method, options are: "jw" Jaro-Winkler (Default), "dl" Damerau-Levenshtein, "jaro" Jaro, and "lv" Edit |
w |
Parameter that describes the importance of the first characters of a string (only needed if method = "jw"). Default is .10 |
Value
gammaCK2par
returns a list with the indices corresponding to each
matching pattern, which can be fed directly into tableCounts
and matchesLink
.
Author(s)
Ted Enamorado <ted.enamorado@gmail.com>, Ben Fifield <benfifield@gmail.com>, and Kosuke Imai
Examples
## Not run:
g1 <- gammaCK2par(dfA$firstname, dfB$lastname)
## End(Not run)