readData {synMicrodata} | R Documentation |
Read the original datasets
Description
Read the original input datasets to be learned for synthetic data generation. The package allows the input data to have missing values and impute them with the posterior predictive distribution, so no missing values exist in the synthetic data output.
Usage
readData(Y_input, X_input, RandomSeed = 99)
Arguments
Y_input |
data.frame consisting of continuous variables of the original data.
It should consist only of |
X_input |
data.frame consisting of categorical variables of the original data.
It should consist only of |
RandomSeed |
random seed number. |
Value
readData
returns an object of "readData_passed
" class.
An object of class "readData_passed
" is a list containing the following components:
n_sample |
number of records in the input dataset. |
p_Y |
number of continuous variables. |
Y_mat_std |
matrix with standardized values of |
mean_Y_input |
mean vectors of original |
sd_Y_input |
standard deviation vectors of original |
NA_Y_mat |
matrix indicating missing values in |
p_X |
number of categorical variables. |
D_l_vec |
numbers of levels of each categorical variable. |
X_mat_std |
matrix with the numeric-transformed values of |
levels_X_input |
list of levels of each categorical variable. |
NA_X_mat |
matrix indicating missing values in |
var_names |
list containing variable names of |
orig_data |
original dataset. |