geco_small {bstrl}R Documentation

Simulated Noisy Records (smaller set)

Description

A dataset containing several files of noisy simulated records. Records are simulated using GeCo (Tran, Vatsalan, and Cristen (2013)) and organized into files of 10 records each. These files are subsets of the larger dataset. The columns in each file consist of two ID columns for validating links:

Usage

data(geco_small)

Format

A list of 7 data.frames. Each data.frame has 10 rows and 16 columns.

Details

The columns also consist of fields used to perform linkage, into which 3 errors have been randomly inserted:

Linkage may be performed on either the full dataset or on only a subset of the fields.

See Also

geco_30over_3err


[Package bstrl version 1.0.2 Index]