bigPresidentData {dataMaid}R Documentation

Semi-artificial data about the US presidents (extended version)

Description

A dataset with information about the first 45 US presidents as well as a 46th person, who is not a US president, and a duplicate of one of the 45 actual presidents. The dataset was constructed to show the capabilities of dataMaid and therefore, it has been constructed to include errors and miscodings. Each observation in the dataset corresponds to a person. The dataset uses the non-standard class Name which is simply an attribute that has been added to two variables in order to show how dataMaid handles non-supported classes. Note that the dataset is an extended and more error-filled version of the dataset presidentData which is also included in the package.

Usage

bigPresidentData

Format

A data frame with 47 rows and 15 variables.

lastName

A Name type variable containing the last name of the president.

firstName

A Name type variable containing the first name of the president.

orderOfPresidency

A factor variable indicating the order of the presidents (with George Washington as number 1 and Donald Trump as number 45).

birthday

A Date variable with the birthday of the president.

dateOfDeath

A Date variable with the date of the president's death.

stateOfBirth

A character variable with the state in which the president was born.

party

A charcter variable with the party to which the president was associated.

presidencyBeginDate

A Date variable with the date of inauguration of the president.

presidencyEndDate

A Date variable with the date at which the presidency ends.

assassinationAttempt

A numeric variable indicating whether there was an assassination attempt (1) or not (0) on the president.

sex

A factor variable with the sex of the president.

ethnicity

A factor variable with the ethnicity of the president.

presidencyYears

A numeric variable with the duration of the presidency, in years.

ageAtInauguration

A character variable with the age at inauguration.

favoriteNumber

A complex type variable with a fictional favorite number for each president.

Source

Artificial dataset constructed based on the US president dataset available from Data Explorer.

References

Petersen AH, Ekstrøm CT (2019). “dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R.” _Journal of Statistical Software_, *90*(6), 1-38. doi: 10.18637/jss.v090.i06 ( doi: 10.18637/jss.v090.i06).

Examples

data(bigPresidentData)


[Package dataMaid version 1.4.1 Index]