| stuatt {eeptools} | R Documentation |
Student Attributes from the Strategic Data Project Toolkit
Description
A synthetic dataset of student attributes from the Strategic Data Project which includes records with errors to practice data cleaning and implementing business rules for consistency in data.
Usage
stuatt
Format
A data frame with 87534 observations on the following 9 variables.
sida numeric vector of the unique student ID
school_yeara numeric vector of the school year
malea numeric vector indicating 1 = male
race_ethnicitya factor with levels
ABHM/OWbirth_datea numeric vector of the student birthdate
first_9th_school_year_reporteda numeric vector of the first year a student is reported in 9th grade
hs_diplomaa numeric vector
hs_diploma_typea factor with levels
Alternative DiplomaCollege Prep DiplomaStandard Diplomahs_diploma_datea factor with levels
12/2/200812/21/20084/14/20084/18/2008...
Details
This is the non-clean version of the data to allow for implementing business rules to clean data.
Source
Available from the Strategic Data Project online at https://sdp.cepr.harvard.edu/toolkit-effective-data-use
References
Visit the Strategic Data Project online at: https://sdp.cepr.harvard.edu/
Examples
data(stuatt)
head(stuatt)