okc_binary {applicable}R Documentation

OkCupid Binary Predictors

Description

OkCupid Binary Predictors

Details

Data originally from Kim (2015) includes a training and test set consistent with Kuhn and Johnson (2020). Predictors include ethnicity indicators and a set of keywords derived from text essay data.

Value

okc_binary_train, okc_binary_test

data frame frames with 61 columns

Source

Kim (2015), "OkCupid Data for Introductory Statistics and Data Science Courses", Journal of Statistics Education, Volume 23, Number 2. https://www.tandfonline.com/doi/abs/10.1080/10691898.2015.11889737

Kuhn and Johnson (2020), Feature Engineering and Selection, Chapman and Hall/CRC . https://bookdown.org/max/FES/ and https://github.com/topepo/FES

Examples

data(okc_binary)
str(okc_binary_train)

[Package applicable version 0.1.0 Index]