diss.CID {TSclust} | R Documentation |
Complexity-Invariant Distance Measure For Time Series
Description
Computes the distance based on the Euclidean distance corrected by the complexity estimation of the series.
Usage
diss.CID(x, y)
Arguments
x |
Numeric vector containing the first of the two time series. |
y |
Numeric vector containing the second of the two time series. |
Details
This distance is defined
CID(x,y) = ED(x,y) \times CF(x,y)
where CF(x,y)
is a complexity correction factor defined as:
max(CE(x), CE(y)) / min(CE(x), CE(y))
and CE(x)
is a compexity estimate of a time series x
. diss.CID
therefore increases the distance between series with different complexities. If the series have the same complexity estimate, the distance defenerates Euclidean distance. The complexity is defined in diss.CID
as:
CE(x) = \sqrt{ \sum_{t=1} (x_{t+1} - x_t)^2 }
Value
The computed dissimilarity.
Author(s)
Pablo Montero Manso, José Antonio Vilar.
References
Batista, G. E., Wang, X., & Keogh, E. J. (2011). A Complexity-Invariant Distance Measure for Time Series. In SDM (Vol. 31, p. 32).
Montero, P and Vilar, J.A. (2014) TSclust: An R Package for Time Series Clustering. Journal of Statistical Software, 62(1), 1-43. http://www.jstatsoft.org/v62/i01/.
See Also
Examples
n = 100
x <- rnorm(n) #generate sample series, white noise and a wiener process
y <- cumsum(rnorm(n))
diss.CID(x, y)
z <- rnorm(n)
w <- cumsum(rnorm(n))
series = rbind(x, y, z, w)
diss(series, "CID")