GCCclus {SLBDD}R Documentation

Clustering of Time Series Using the Generalized Cross Correlation Measure of Linear dependency

Description

Clustering of time series using the Generalized Cross Correlation (GCC) measure of linear dependency proposed in Alonso and Peña (2019).

Usage

GCCclus(x, lag, rs, thres, plot, printSummary = TRUE, lag.set, silh = 1)

Arguments

x

T by k data matrix: T data points in rows with each row being data at a given time point, and k time series in columns.

lag

Selected lag for computing the GCC between the pairs of series. Default value is computed inside the program.

rs

Relative size of the minimum group considered. Default value is 0.05.

thres

Percentile in the distribution of distances that define observations that are not considered outliers. Default value is 0.9.

plot

If the value is TRUE, a clustermatrix plot of distances and a dendogram are presented. Default is FALSE.

printSummary

If the value is TRUE, the function prints a summary table of the clustering. Default is TRUE.

lag.set

If lag is not specified and the user wants to use instead of lags from 1 to 'lag' a non consecutive set of lags they can be defined as lag.set = c(1, 4, 7).

silh

If silh = 1 standard silhoutte statistics and if silh = 2 modified procedure. Default value is 1.

Details

First, the matrix of Generalized Cross correlation (GCC) is built by using the subrutine GCCmatrix, then a hierarchical grouping is constructed and the number of clusters is selected by either the silhouette statistics or a modified silhouette statistics The modified silhouette statistics is as follows:

Value

A list containing:

References

Alonso, A. M. and Peña, D. (2019). Clustering time series by linear dependency. Statistics and Computing, 29(4):655–676.

Examples

data(TaiwanAirBox032017)
output <- GCCclus(TaiwanAirBox032017[1:50,1:8])


[Package SLBDD version 0.0.4 Index]