bigPresidentData {dataMaid} | R Documentation |
Semi-artificial data about the US presidents (extended version)
Description
A dataset with information about the first 45 US presidents as well as a 46th
person, who is not a US president, and a duplicate of one of the 45 actual presidents.
The dataset was constructed to show the capabilities
of dataMaid
and therefore, it has been constructed to include errors and miscodings.
Each observation in the dataset corresponds to a person. The dataset uses the
non-standard class Name
which is simply an attribute that has been added to
two variables in order to show how dataMaid
handles non-supported classes. Note that the dataset
is an extended and more error-filled version of the dataset presidentData
which is
also included in the package.
Usage
bigPresidentData
Format
A data frame with 47 rows and 15 variables.
- lastName
A
Name
type variable containing the last name of the president.- firstName
A
Name
type variable containing the first name of the president.- orderOfPresidency
A factor variable indicating the order of the presidents (with George Washington as number 1 and Donald Trump as number 45).
- birthday
A Date variable with the birthday of the president.
- dateOfDeath
A Date variable with the date of the president's death.
- stateOfBirth
A character variable with the state in which the president was born.
- party
A charcter variable with the party to which the president was associated.
- presidencyBeginDate
A Date variable with the date of inauguration of the president.
- presidencyEndDate
A Date variable with the date at which the presidency ends.
- assassinationAttempt
A numeric variable indicating whether there was an assassination attempt (
1
) or not (0
) on the president.- sex
A factor variable with the sex of the president.
- ethnicity
A factor variable with the ethnicity of the president.
- presidencyYears
A numeric variable with the duration of the presidency, in years.
- ageAtInauguration
A character variable with the age at inauguration.
- favoriteNumber
A
complex
type variable with a fictional favorite number for each president.
Source
Artificial dataset constructed based on the US president dataset available from Data Explorer.
References
Petersen AH, Ekstrøm CT (2019). “dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R.” _Journal of Statistical Software_, *90*(6), 1-38. doi: 10.18637/jss.v090.i06 ( doi: 10.18637/jss.v090.i06).
Examples
data(bigPresidentData)