adult {rebmix} | R Documentation |
Adult Dataset
Description
The adult
dataset containing 48842 instances with 16 continuous, binary and discrete variables was extracted from the census bureau database. Extraction was done by Barry Becker from the 1994 census bureau database.
Usage
data(adult)
Format
adult
is a data frame with 48842 cases (rows) and 16 variables (columns) named:
-
Type
binarytrain
ortest
. -
Age
continuous. -
Workclass
one of the 8 discrete valuesprivate
,self-emp-not-inc
,self-emp-inc
,federal-gov
,local-gov
,state-gov
,without-pay
ornever-worked
. -
Fnlwgt
stands for continuous final weight. -
Education
one of the 16 discrete valuesbachelors
,some-college
,11th
,hs-grad
,prof-school
,assoc-acdm
,assoc-voc
,9th
,7th-8th
,12th
,masters
,1st-4th
,10th
,doctorate
,5th-6th
orpreschool
. -
Education.Num
continuous. -
Marital.Status
one of the 7 discrete valuesmarried-civ-spouse
,divorced
,never-married
,separated
,widowed
,married-spouse-absent
ormarried-af-spouse
. -
Occupation
one of the 14 discrete valuestech-support
,craft-repair
,other-service
,sales
,exec-managerial
,prof-specialty
,handlers-cleaners
,machine-op-inspct
,adm-clerical
,farming-fishing
,transport-moving
,priv-house-serv
,protective-serv
orarmed-forces
. -
Relationship
one of the 6 discrete valueswife
,own-child
,husband
,not-in-family
,other-relative
orunmarried
. -
Race
one of the 5 discrete valueswhite
,asian-pac-islander
,amer-indian-eskimo
,other
orblack
. -
Sex
binaryfemale
ormale
. -
Capital.Gain
continuous. -
Capital.Loss
continuous. -
Hours.Per.Week
continuous. -
Native.Country
one of the 41 discrete valuesunited-states
,cambodia
,england
,puerto-rico
,canada
,germany
,outlying-us(guam-usvi-etc)
,india
,japan
,greece
,south
,china
,cuba
,iran
,honduras
,philippines
,italy
,poland
,jamaica
,vietnam
,mexico
,portugal
,ireland
,france
,dominican-republic
,laos
,ecuador
,taiwan
,haiti
,columbia
,hungary
,guatemala
,nicaragua
,scotland
,thailand
,yugoslavia
,el-salvador
,trinadad&tobago
,peru
,hong
orholand-netherlands
. -
Income
binary<=50k
or>50k
.
Source
A. Asuncion and D. J. Newman. Uci machine learning repository, 2007. http://archive.ics.uci.edu/ml/.
References
A. Asuncion and D. J. Newman. Uci machine learning repository, 2007. http://archive.ics.uci.edu/ml/.
Examples
data(adult)
# Find complete cases.
adult <- adult[complete.cases(adult),]
# Show level attributes for binary and discrete variables.
levels(adult[["Type"]])
levels(adult[["Workclass"]])
levels(adult[["Education"]])
levels(adult[["Marital.Status"]])
levels(adult[["Occupation"]])
levels(adult[["Relationship"]])
levels(adult[["Race"]])
levels(adult[["Sex"]])
levels(adult[["Native.Country"]])
levels(adult[["Income"]])