okc_binary {applicable}R Documentation

OkCupid Binary Predictors

Description

OkCupid Binary Predictors

Details

Data originally from Kim (2015) includes a training and test set consistent with Kuhn and Johnson (2020). Predictors include ethnicity indicators and a set of keywords derived from text essay data.

Value

okc_binary_train,okc_binary_test

data frame frames with 61 columns

Source

Kim (2015), "OkCupid Data for Introductory Statistics and Data Science Courses", Journal of Statistics Education, Volume 23, Number 2. http://www.amstat.org/publications/jse/contents_2015.html

Kuhn and Johnson (2020), Feature Engineering and Selection, Chapman and Hall/CRC . https://bookdown.org/max/FES/ and https://github.com/topepo/FES

Examples

data(okc_binary)
str(okc_binary_train)

[Package applicable version 0.0.1.2 Index]