RecLinkData.object {RecordLinkage} | R Documentation |
Record Linkage Data Object
Description
S3 class representing information about record pairs for Record
Linkage, as returned by functions compare.dedup
and
compare.linkage
.
Value
A list with at least the following elements:
data
(fortype = "deduplication"
):Object of class
"data.frame"
. Data frame of original records.data1
,data2
(fortype = "linkage"
):Objects of class
"data.frame"
. Data frames of original records.pairs
:Object of class
"data.frame"
Data frame of data pairs. Each row represents the comparison pattern of two records, identified by columnsid1
andid2
. The other columns contain for each considered attribute a real number in the range [0..1] representing the degree of similarity. These columns are named according to the respective columns indata
. The last column contains the matching status of the pair, coded as 1 for a match or 0 for a non-match.frequencies
:Object of class
"numeric"
Numeric vector with average frequency of values for each column included inpairs
(reciprocal of number of distinct values).type
:Object of class
"character"
Identifies whether a linkage ("linkage"
) or a deduplication ("deduplication"
) project is represented..S3class
:Internal slot.
The following elements are optional:
M
:Object of class
"numeric"
Vector of m-probabilities as calculated byemWeights
.U
:Object of class
"numeric"
Vector of u-probabilities as calculated byemWeights
.W
:Object of class
"numeric"
Vector of log-likelihood weights as calculated byemWeights
, corresponding to binary comparison patterns as created bybincombinations
.Wdata
:Object of class
"numeric"
Vector of log-likelihood weights as calculated byemWeights
, corresponding to the rows ofpairs
.
Author(s)
Andreas Borg, Murat Sariyar
See Also
"RecLinkData"
for the S4 representation.
compare.dedup
, which creates objects of this class.
"RLBigData"
, an alternative data structure suitable for
big data sets.