| FI_test {FastImputation} | R Documentation |
Imputation Test Data
Description
Smaller simulated dataset drawn from the same distribution as FI_train and FI_true. This dataset is entirely the same as FI_true except this one has 5% of its values missing. Used with FastImputation.
Usage
data(FI_test)
Format
A data frame with 9 variables and 250 observations.
user_id_1Sequential user ids
bounded_below_2Multivariate normal, transformed using
exp(x)unbounded_3Multivariate normal
unbounded_4Multivariate normal
bounded_above_5Multivariate normal, transformed using
-exp(x)bounded_above_and_below_6Multivariate normal, transformed using
pnorm(x)unbounded_7Multivariate normal
unbounded_8Multivariate normal
categorical_9"A" if the first of three multivariate normal draws is greatest; "B" if the second is greatest; "C" if the third is greatest
Author(s)
Stephen R. Haptonstahl srh@haptonstahl.org
Source
All columns start as multivariate normal draws. Columns 2, 5, and 6 are transformed. Column 9 is the result of three multivariate normal columns being interpreted as one-hot encoding of a three-valued categorical variable.