| RecLinkData.object {RecordLinkage} | R Documentation |
Record Linkage Data Object
Description
S3 class representing information about record pairs for Record
Linkage, as returned by functions compare.dedup and
compare.linkage.
Value
A list with at least the following elements:
data(fortype = "deduplication"):Object of class
"data.frame". Data frame of original records.data1,data2(fortype = "linkage"):Objects of class
"data.frame". Data frames of original records.pairs:Object of class
"data.frame"Data frame of data pairs. Each row represents the comparison pattern of two records, identified by columnsid1andid2. The other columns contain for each considered attribute a real number in the range [0..1] representing the degree of similarity. These columns are named according to the respective columns indata. The last column contains the matching status of the pair, coded as 1 for a match or 0 for a non-match.frequencies:Object of class
"numeric"Numeric vector with average frequency of values for each column included inpairs(reciprocal of number of distinct values).type:Object of class
"character"Identifies whether a linkage ("linkage") or a deduplication ("deduplication") project is represented..S3class:Internal slot.
The following elements are optional:
M:Object of class
"numeric"Vector of m-probabilities as calculated byemWeights.U:Object of class
"numeric"Vector of u-probabilities as calculated byemWeights.W:Object of class
"numeric"Vector of log-likelihood weights as calculated byemWeights, corresponding to binary comparison patterns as created bybincombinations.Wdata:Object of class
"numeric"Vector of log-likelihood weights as calculated byemWeights, corresponding to the rows ofpairs.
Author(s)
Andreas Borg, Murat Sariyar
See Also
"RecLinkData" for the S4 representation.
compare.dedup, which creates objects of this class.
"RLBigData", an alternative data structure suitable for
big data sets.