| CENSUS {regclass} | R Documentation |
CENSUS data
Description
Information from the 2010 US Census
Usage
data("CENSUS")
Format
A data frame with 3534 observations on the following 39 variables.
ResponseRatea numeric vector, 0-100 representing the percentage of households in a block group that mailed in the form
Areaa numeric vector, land area in square miles
Urbana numeric vector, percentage of block group in Urbanized area (50000 or greater)
Suburbana numeric vector, percentage of block group in an Urban Cluster area (2500 to 49999)
Rurala numeric vector, percentage of block group in an Urban Cluster area (2500 to 49999)
Malea numeric vector, percentage of males
AgeLess5a numeric vector, percentage of individuals aged less than 5 years old
Age5to17a numeric vector
Age18to24a numeric vector
Age25to44a numeric vector
Age45to64a numeric vector
Age65plusa numeric vector
Hispanicsa numeric vector, percentage of individuals who identify as Hispanic
Whitesa numeric vector, percentage of individuals who identify as white (alone)
Blacksa numeric vector
NativeAmericansa numeric vector
Asiansa numeric vector
Hawaiiansa numeric vector
Othera numeric vector, percentage of individuals who identify as another ethnicity
RelatedHHa numeric vector, percentage of households where at least 2 members are related by birth, marriage, or adoption; same-sex couple households with no relatives of the householder present are not included
MarriedHHa numeric vector, percentage of households in which the householder and his or her spouse are listed as members of the same household; does not include same-sex married couples
NoSpouseHHa numeric vector, percentage of households with no spousal relationship present
FemaleHHa numeric vector, percentage of households with a female householder and no husband of householder present
AloneHHa numeric vector, percentage of households where householder is living alone
WithKidHHa numeric vector, percentage of households which have at least one person under the age of 18
MedianHHIncomeBlocka numeric vector, median income of households in the block group (from American Community Survey)
MedianHHIncomeCitya numeric vector, median income of households in the tract
OccupiedUnitsa numeric vector, percentage of housing units that are occupied
RentingHHa numeric vector, percentage of housing units occupied by renters
HomeownerHHa numeric vector, percentage of housing units occupied by the owner
MobileHomeUnitsa numeric vector, percentage of housing units that are mobile homes (from American Community Survey)
CrowdedUnitsa numeric vector, percentage of housing units with more than 1 person per room on average
NoPhoneUnitsa numeric vector, percentage of housing units without a landline
NoPlumbingUnitsa numeric vector, percentage of housing units without active plumbing
NewUnitsa numeric vector, percentage of housing units constructed in 2010 or later
Populationa numeric vector, number of people in the block group
NumHHa numeric vector, number of households in the block group
NumUnitsa numeric vector, number of housing units in the block group
logMedianHouseValuea numeric vector, the logarithm of the median home value in the block group
Details
The goal is to predict ResponseRate from the other predictors. ResponseRate is the percentage of households in a block group that mailed in the census forms. A block group is on average about 40 blocks, each typically bounded by streets, roads, or water. The number of block groups per county in the US is typically between about 5 and 165 with a median of about 20.
References
See https://www2.census.gov/programs-surveys/research/guidance/planning-databases/2014/pdb-block-2014-11-20a.pdf for variable definitions.