deduplicate_equivalence {reclin2}R Documentation

Deduplication using equivalence groups

Description

Deduplication using equivalence groups

Usage

deduplicate_equivalence(pairs, variable, selection, x = attr(pairs, "x"))

Arguments

pairs

a pairs object, such as generated by pair_blocking

variable

name of the variable to create in x that will contain the group labels.

selection

a logical variable with the same length as pairs has rows, or the name of such a variable in pairs. Pairs are only selected when select is TRUE. When missing it is assumed all pairs are selected.

x

the first data set; when missing attr(pairs, "x") is used.

Value

Returns x with a variable containing the group labels. Records with the same group label (should) correspond to the same entity.


[Package reclin2 version 0.5.0 Index]