R: SyntheticData1

SyntheticData1 {ctsfeatures}

R Documentation

SyntheticData1

Description

Synthetic dataset containing 80 CTS generated from four different generating processes.

Usage

data(SyntheticData1)

Format

A tsibble with four columns, which are:

Value: The categorical values of the time series in the dataset.
Series: Integer values indicating the considered time series (there are 80 time series in the dataset).
Time: Integer values indicating the temporal indexes of the observations.
Class: Integer values indicating the class of each time series.

@details The column Value is the concatenation of 80 time series of length 600 taking three categorical values. Series 1-20, 21-40, 41-60 and 61-80 were generated from Markov Chains with different matrices of transition probabilities (see Scenario 1 in López-Oriona et al. (2023)). Therefore, there are 4 different classes in the dataset.

References

López-Oriona Á, Vilar JA, D’Urso P (2023). “Hard and soft clustering of categorical time series based on two novel distances with an application to biological sequences.” Information Sciences, 624, 467–492.

[Package ctsfeatures version 1.2.2 Index]