general_fix_structure {podcleaner}R Documentation

Mutate operation(s) in Scottish post office general directory data.frame column(s)

Description

Attempts to fix the structure of the raw Scottish post office general directory data.frame provided. For each entry, general_fix_structure attempts to fix parsing errors by moving pieces of information provided to the right columns; further attempts to separate trade from house address, separate multiple trade addresses as well as separate number from address body.

Usage

general_fix_structure(directory, verbose)

Arguments

directory

A Scottish post office general directory in the form of a data.frame or other object that inherits from the data.frame class such as a tibble. Columns must at least include occupation, addresses.

verbose

Whether the function should be executed silently (FALSE) or not (TRUE).

Value

A data.frame of the same class as the one provided in directory; columns include at least occupation, address.trade.number, address.trade.body, address.house.number and address.house.body. "house" suffix in occupation column is move to addresses, occupation information is repatriated from addresses to occupation column; addresses is split into trade and house address columns; additional records are created for each extra trade address identified.


[Package podcleaner version 0.1.2 Index]