Database Reference
In-Depth Information
that end, we use techniques related to those for discovering correlations
across streams, illustrating the relationship between the two problems.
Let us first assume that someone gives us a window size. Then, the
problem we want to solve is the following:
Problem 8.1 (Fixed-window optimal patterns) Given a time se-
ries x t , t =1 , 2 ,... and a window size w , find the patterns that best
summarize the series at this window size.
w ,cho-
sen so that they capture “most” of the information in the series (in a
way that we will make precise later).
In practice, however, we do not know apriori the right window size.
Therefore, with respect to the second requirement (multi-scale), we want
to solve the following problem:
[ v i, 1 ,...,v i,w ] T
The patterns are w -dimensional vectors v i
R
Problem 8.2 (Optimal local patterns) Given a time series x t and
a set of windows
W
{
w 1 ,w 2 ,w 3 ,...
}
, find (i) the optimal patterns for
each of these, and (ii) the best window w to describe the key patterns
in the series.
:=
So how do we go about finding these patterns? An elementary concept
we need to introduce is time-delay coordinates . We are given a time series
x t , t =1 , 2 ,... with m points seen so far. Intuitively, when looking for
patterns of length w , we divide the series in consecutive, non-overlapping
subsequences of length w . Thus, if the original series is a m × 1matrix
(not necessarily materialized), we substitute it with a w × w matrix.
Instead of m scalar values we now have a sequence of m/w vectors with
dimension w . It is natural to look for patterns among these time-delay
vectors.
Definition 5.7 (Delay coordinates) Given a sequence denoted by
x
[ x 1 ,x 2 ,...,x t ,...,x m ] T and a delay (or window) w ,the delay co-
ordinates are a
w matrix with the t -th row equal to X ( w )
m/w
×
( t ) :=
[ x ( t 1) w +1 ,x ( t 1) w +2 ,...,x t w ] T .
Of course, neither x nor X ( w ) need to be fully materialized at any point
in time. In practice, we only need to store the last row of X ( w ) .
Also, note that we choose non-overlapping windows. We could also
use overlapping windows, in which case X ( w ) would have m−w +1rows,
with row t consisting of values x t ,x t +1 ,...,x t + w . In this case, there are
some subtle differences [23], akin to the differences between “standard”
wavelets and maximum-overlap or redundant wavelets [46]. However,
in practice non-overlapping windows are equally effective for pattern
Search WWH ::




Custom Search