geco_30over_3err {bstrl}R Documentation

Simulated Noisy Records

Description

A dataset containing several files of noisy simulated records. Records are simulated using GeCo (Tran, Vatsalan, and Cristen (2013)) and organized into files of 200 records each. The columns in each file consist of two ID columns for validating links:

Usage

data(geco_30over_3err)

Format

A list of 7 data.frames. Each data.frame has 200 rows and 16 columns.

Details

The columns also consist of fields used to perform linkage, into which 3 errors have been randomly inserted:

Linkage may be performed on either the full dataset or on only a subset of the fields.

See Also

geco_small


[Package bstrl version 1.0.2 Index]