R: state_match

state_match {messy.cats}

R Documentation

state_match

Description

A wrapper function for cat_match()hat only requires an inputted vector of messy states. state_match() uses a built in clean list of state names state.name as the reference clean vector.

Usage

state_match(messy_states, threshold = NA, p = 0)

Arguments

`messy_states`	Vector containing the messy state names that will be replaced by the closest match from `state.name`
`threshold`	The maximum distance that will form a match. If this argument is specified, any element in the messy vector that has no match closer than the threshold distance will be replaced with NA. Default: NA
`p`	Only used with method "jw", the Jaro-Winkler penatly size. Default: 0

Details

State names are often misspelled or abbreviated in datasets, especially datasets that have been manually digitized or created. state_match() is a warpper function of cat_match() that quickly solves this common issue of mispellings or different formats of country names across datasets. This wrapper function uses a built in clean list of country names state.name as the reference clean vector and matches your inputted messy vector of names to their nearest state in state.name.

Value

state_match() returns a cleaned version of the bad vector, with each element replaced by the most similar element of the good vector.

Examples

if(interactive()){
 #EXAMPLE1
 lst <- c("Indianaa", "Wisvconsin", "aLaska", "NewJersey", "Claifoarni")
 matched <- state_match(lst)
 }

[Package messy.cats version 1.0 Index]