Information Technology Reference
In-Depth Information
n
r i
r
n
x i
=
x j
(12.12)
n
r (
j
=
i
1
) +
1
i
=
1
...
r
,
j
=
1
...
n
The PAA dimensionality reduction is intuitive and simple, yet has been shown
to rival more sophisticated dimensionality reduction techniques like Fourier trans-
forms and wavelets [ 15 ]. Having transformed a time series database into PAA, we
can apply our proposed recurrence plot-based time series distance measure on the
reduced representation. Since the computational complexity of our RRR distance
measure is quadratic in the length n of the time series, reducing the original time
series to r dimensions leads to a performance improvement of factor
2 . In our
(
n
/
r
)
experiments on the real-life vehicular data we use a compression rate of n
10,
which correspond to a speedup of two orders of magnitude or rather 100 times less
matrix entries to compute. However, this approach comes with the cost of missing
recurrences [ 23 ].
/
r
=
12.8 Adjustment Window Condition
Another approach to reduce the computational complexity of our proposed recurrence
plot-based (RRR) time series distance measure is to constrain the number of cells
that are evaluated in the distance matrix [ 30 ]. Constraints have been successfully
applied to the Dynamic Time Warping (DTW) distance to create tight lower bounds
which allow to prune similarity calculations [ 8 , 11 ]. The two most commonly used
constraints are the Itakura Parallelogram [ 6 ] and the Sakoe-Chiba Band [ 29 ], which
both speed up calculations by a constant factor, but still lead to quadratic complexity
if the window size
is a linear function of the time series.
Given the formal definition of (joint) cross recurrence (see Eqs. 12.2 and 12.4 ),
the Sakoe-Chiba Band is an adjustment window condition which corresponds to the
fact that time-axis fluctuations in usual cases never causes a too excessive timing
difference [ 29 ]:
w
|
i
j
|≤ w
(12.13)
d
x i ,
∈ R
,
=
...
,
=
...
y j
i
1
N
j
1
M
In general, constraints work well in domains where time series have only a small
variance, but perform poorly if time series are of events that start and stop at radically
different times [ 30 ]. Since this study considers time series that exhibit recurring
patterns at arbitrary positions, we refrain from applying constraints for the data
under study.
 
Search WWH ::




Custom Search