R: Fold a dataset avoiding overlapping of different time series

filtered_fold_dt {dbnR}

R Documentation

Fold a dataset avoiding overlapping of different time series

Description

If the dataset that is going to be folded contains several different time series instances of the same process, folding it could introduce false rows with data from different time series. Given an id variable that labels the different instances of a time series inside a dataset and a desired size, this function folds the dataset and avoids mixing data from different origins in the same instance.

Usage

filtered_fold_dt(dt, size, id_var, clear_id_var = TRUE)

Arguments

`dt`	data.table to be folded
`size`	the size of the data.table
`id_var`	the variable that labels each individual instance of the time series
`clear_id_var`	boolean that decides whether or not the id_var column is deleted

Value

the filtered data.table

Examples

dt <- dbnR::motor[201:2500]
dt[, n_sec := rep(seq(46), each = 50)] # I'll create secuences of 50 instances each
f_dt <- dbnR::fold_dt(dt, size = 2)
dim(f_dt)
f_dt <- dbnR::filtered_fold_dt(dt, size = 2, id_var = "n_sec")
dim(f_dt)  # The filtered folded dt has a row less for each independent secuence

[Package dbnR version 0.7.9 Index]