prepare_set {dataPreparation}R Documentation

Preparation pipeline

Description

Full pipeline for preparing your data_set set.

Usage

prepare_set(data_set, final_form = "data.table", verbose = TRUE, ...)

Arguments

data_set

Matrix, data.frame or data.table

final_form

"data.table" or "numerical_matrix" (default to data.table)

verbose

Should the algorithm talk? (logical, default to TRUE)

...

Additional parameters to tune pipeline (see details)

Details

Additional arguments are available to tune pipeline:

Value

A data.table or a numerical matrix (according to final_form).
It will perform the following steps:

Examples

# Load ugly set
## Not run: 
data(tiny_messy_adult)

# Have a look to set
head(tiny_messy_adult)

# Compute full pipeline
clean_adult <- prepare_set(tiny_messy_adult)

# With a reference date
adult_agg <- prepare_set(tiny_messy_adult, analysis_date = as.Date("2017-01-01"))

# Add aggregation by country
adult_agg <- prepare_set(tiny_messy_adult, analysis_date = as.Date("2017-01-01"), key = "country")

# With some new aggregation functions
power <- function(x) {sum(x^2)}
adult_agg <- prepare_set(tiny_messy_adult, analysis_date = as.Date("2017-01-01"), key = "country",
                        functions = c("min", "max", "mean", "power"))

## End(Not run)
# "##NOT RUN:" mean that this example hasn't been run on CRAN since its long. But you can run it!

[Package dataPreparation version 1.1.1 Index]