combine_make_match_string {podcleaner}R Documentation

Mutate operation(s) in directory data.frame trade address column

Description

Creates a 'match.string' column in the provided Scottish post office directory data.frame composed of entry(/ies) full name and trade address pasted together. Missing trade address entry(/ies) are replaced with a random generated string.

Usage

combine_make_match_string(directory)

Arguments

directory

A Scottish post office directory in the form of a data.frame or other object that inherits from the data.frame class such as a tibble. Columns must at least include forename, surname, address.trade.number, address.trade.body.

Value

A data.frame of the same class as the one provided in directory; columns include at least forename, surname, address.trade.number, address.trade.body, match.string.

Details

The purpose of the 'match.string' column is to facilitates the matching of the general to trades directory down the line. It allows to calculate a string distance metric between each pair of entries and match those falling below a specified threshold.

See Also

combine_match_general_to_trades for the matching of the general to trades directory.


[Package podcleaner version 0.1.2 Index]