CompareAS16 {PPRL} | R Documentation |
Comparing bit vectors generated by CreateAS16
Description
Comparing all elements of two vectors of records with each other using Armknechts & Schnells methods "create" and "compare".
Usage
CompareAS16(IDA, dataA, IDB, dataB, password, t = 0.85)
Arguments
IDA |
A character vector or integer vector containing the IDs of the first data.frame. |
dataA |
A character vector containing the bit vectors that are to be created by Armknechts method "create". |
IDB |
A character vector or integer vector containing the IDs of the second data.frame. |
dataB |
A character vector containing the bit vectors that are to be created by Armknechts method "create". |
password |
A string containing the password used in the method "create". |
t |
A float containing the lower Tanimoto similarity threshold. |
Details
Two bit vectors generated by CreateAS16
are compared as described in the original publication.
Value
The function returns a data.frame with four columns containing all ID-pairs of all bit vectors, the estimated Tanimoto similarity and the classification (links/non-links).
Source
Armknecht, F., Schnell, R. (unpublished): Privacy Preserving Record Linkage Based on Bloom Filters and Codes. Working Paper.
See Also
Examples
# Load test data
testFile <- file.path(path.package("PPRL"), "extdata/testdata.csv")
testData <- read.csv(testFile, head = FALSE, sep = "\t",
colClasses = "character")
# Create Bloom Filter
testData <- CreateBF(ID = testData$V1, testData$V7, k = 20, padding = 1,
q = 2, l = 1000, password = "(H]$6Uh*-Z204q")
# Optional: Create the new Bloom filter, the output of this function is just
# to see the created bit vectors, it is not the input of CompareAS16.
testAS <- CreateAS16(testData$ID, testData$CLKs, password = "khäuds")
# Compare bit vectcors, input is not the out put of CreateAS16,
# but the original Bloom Filters. CreateAS16 is executated in CompareAS16.
res <- CompareAS16(testData$ID, testData$CLKs, testData$ID, testData$CLKs,
password = "khäuds", t = 0.85)