Continuous Ranking Queries on Uncertain Streams - Ranking Queries on Uncertain Data

Database Reference

In-Depth Information

6.1.1 Top-k Probabilities in a Sliding Window

Consider a set of uncertain data streams

O = {

O 1 ,...,

O n }

and sliding window

W t

. W t

)

in the sliding window. In this subsection, we consider how to rank the data streams

according to their instances in sliding window W t

ω ( O )

ω (

O i )= {

O i [

− ω +

] ,...,

O i [

] }

is the set of instances of O i (

≤

ω ( O )

. When it is clear from con-

text, we write W t

and W t

ω ( O )

ω (

)

( O )

(

)

, respectively.

We reduce computing the top- k probability of a stream O into computing the

top- k probabilities of instances of O .

O i

simply as W

and W

O i

Definition 6.1 (Top- k probability of instance). For an instance o and a top- k query

Q k , the top-k probability of o , denoted by Pr k

(

)

, is the probability that o is ranked

)= { w ∈W | o ∈ Q k

(

) }

in the top- k lists in possible worlds. That is, Pr k

(

Following with Definitions 2.11 and 2.14, we have the following.

Corollary 6.1 (Top- k probability). For an uncertain data stream O, a sliding win-

dowW t

and a top-k query Q k ,

ω ( O )

Pr k

∑

o ∈ W t

Pr k

∑

o ∈ W t

Pr k

(

)

(

) .

ω ( O )

We sort all instances according to their scores. Let R denote the ranking order

of instances. For two instances o 1

o 2 if o 1 is ranked before (i.e.,

better than) o 2 in R . Clearly, the rank of an instance o of stream O in the possible

worlds depends on only the instances of other streams that are ranked better than o .

We capture those instances as the dominant set of o .

o 2 , we write o 1

≺

Definition 6.2 (Dominant set). Given a set of streams

, a sliding window W , and

a top- k query Q k , for an instance o of stream O

∈ O

, the dominant set of o is the

set of instances of streams in

O −{

}

that are ranked better than o , denoted by

o ∈

o ≺

(

)= {

( O−

) |

}

In a possible world w , an instance o is ranked the i -th place if and only if there

are

(

−

)

instances in DS

(

)

appearing in w , and each of those instances is from a

unique stream.

Based on this observation, for instance o and stream O such that o

O ,we

∈

denote by O ≺

o in a possible world w if there exists o ∈

O )

, o ≺

o , and o and

(

o appear in w . Apparently, we have

∑

o )

O )

O ≺

(

) ∩

(

o ∈

(

) ,

from unique streams

appear in a possible world. Then, the top- k probability of o can be written as

Let Pr

(

) ,

)

be the probability that i instances in DS

(

)

k − 1

i = 0 Pr ( DS ( o ) , i )=

k − 1

i = 0 Pr ( DS ( o ) , i ) .

Pr k

(

)

Ranking Queries on Uncertain Data

Search WWH ::

Custom Search

Home