Database Reference
In-Depth Information
{ f }:[2.1,0,2.1]
{ g }:[1.8,2.1,2.7]
{ h }:[1.5,0,1.5]
{ i }:[0,0,1.0]
{ f,g }:[1.26,0,1.89]
{ f,h }:[1.05,0,1.05]
{ g,h }:[1.35,0,1.35]
{ g,i }:[0,0,0.90]
{ f,g,h }:[0.945,0,0.945]
Fig. 14.9 The UF-stream structure for streaming uncertain data D 3
constructed for the streaming uncertain data in Table 14.7 with preMinsup = 0.9 <
1.0 = minsup .
As UF-streaming uses preMinsup , the UF-stream structure may contain some
false positives (i.e., every pattern Y having preMinsup
expSup ( Y , D w ) < minsup ,
where D w represents the streaming uncertain data in the current sliding window) in
addition to all truly frequent patterns (i.e., every pattern X having expSup ( X , D w )
minsup ). At the time when the user requests for frequent patterns from uncertain
streaming data, UF-streaming traverses the UF-stream structure and returns only
those truly frequent patterns.
8.3
TUF-streaming for the Time-Fading Model
Besides the sliding window model (used by both SUF-growth and UF-streaming),
there are also other stream processing models such as time-fading and landmark mod-
els. Leung and Jiang [ 38 ] proposed the TUF-streaming algorithm to mine frequent
patterns from probabilistic datasets of uncertain data in a fashion similar to UF-
streaming, except that TUF-streaming uses the time-fading model instead of the slid-
ing window model. When using the time-fading model, TUF-streaming puts heavier
weights on recent batches of streaming uncertain data than older batches. Specifi-
cally, like UF-streaming, the TUF-streaming algorithm also uses the “immediate”
mining mode to apply UF-growth (ref. Sect. 5.1) to every incoming batch of stream-
ing uncertain data with a preMinsup threshold to find “frequent” patterns from that
batch. When using the time-fading model, the corresponding TUF-stream structure
(i) captures all “frequent” patterns mined from all batches but weights recent batches
heavier than older batches (i.e., monotonically decreasing weights from recent to
older data). See Fig. 14.10 , in which α (where 0
1) is a time-fading factor.
While TUF-streaming handles batches of streaming uncertain data that come in
order, there are situations where batches may get delayed and thus arrived out of
the desired order. To deal with these situations, Jiang and Leung [ 26 ] extended the
TUF-streaming algorithm to mine frequent patterns from these delayed batches when
using the time-fading model.
Search WWH ::




Custom Search