SyntheticData1 {ctsfeatures}R Documentation

SyntheticData1

Description

Synthetic dataset containing 80 CTS generated from four different generating processes.

Usage

data(SyntheticData1)

Format

A tsibble with four columns, which are:

Value

The categorical values of the time series in the dataset.

Series

Integer values indicating the considered time series (there are 80 time series in the dataset).

Time

Integer values indicating the temporal indexes of the observations.

Class

Integer values indicating the class of each time series.

@details The column Value is the concatenation of 80 time series of length 600 taking three categorical values. Series 1-20, 21-40, 41-60 and 61-80 were generated from Markov Chains with different matrices of transition probabilities (see Scenario 1 in López-Oriona et al. (2023)). Therefore, there are 4 different classes in the dataset.

References

López-Oriona Á, Vilar JA, D’Urso P (2023). “Hard and soft clustering of categorical time series based on two novel distances with an application to biological sequences.” Information Sciences, 624, 467–492.


[Package ctsfeatures version 1.2.2 Index]