Probabilistic Ranking Queries on Uncertain Data - Ranking Queries on Uncertain Data

Database Reference

In-Depth Information

5.5.2.1 Answering PT- k Queries

The following example illustrates how to evaluate PT- k queries.

Example 5.6 (Answering PT-k queries). Consider the uncertain tuples indexed in

Figure 5.3 again, and a PT-k query with k

45 .

To find the tuples satisfying the query, we only need to look at the prob-interval

containing p

3 and p

45 , which is b 3 =(

]

. In the U-list of b 3 , we find that

3 , which means that Pr 3

6 and Pr 3

t 3 .

6 . There-

fore, t 3 and t 4 can be added into the answer set without calculating their exact top-k

probabilities. In the L-list of b 3 , we find that t 2 .

U 3 =

3 and t 4 .

U 3 =

(

t 3 ) >

(

t 4 ) >

4 , which means Pr 4

L 3 =

(

t 2 ) ≤

4 .

Therefore, t 2 can be pruned.

Thus, only the top- 3 probability of t 1 needs to be calculated in order to further

verify if t 1 is an answer to the query. Since Pr 3

(

t 1 )=

5 , it can be added into the

answer set. The final answer is

{

t 1 ,

t 3 ,

t 4 }

The three steps for PT- k query evaluation are as follows.

Step 1: Bounding We use Corollary 5.4 to determine whether the top- k probability

of t lies in b i .

Corollary 5.4 (Bounding top- k probabilities). Let T be a set of uncertain tuples

indexed by PRist with granularity parameter h. For a tuple t

∈

T and a positive

U i , then i − h <

integer k, if b i ( 1

≤

h) is the prob-interval such that t

L i <

Pr k

h .

Proof. According to the definition of PRist ,wehave Pr t . L i

(

) ≤

−

and Pr t . L i + 1

(

) ≤

(

) >

i − 1

L i , Pr k

Pr t . L i + 1

i − 1

. On the other hand, Pr t . U i

. Since k

(

) ≥

(

) >

(

) >

h and

Pr t . U i − 1

U i ,wehave Pr k

Pr t . U i − 1

(

) ≤

h . Since k

(

) ≤

(

) ≤

h .

Step2: Pruning and Validating A tuple may be pruned or validated by checking

its lower rank L i or upper rank U i in the prob-interval containing the probability

threshold, as stated in Theorem 5.10.

Theorem 5.10 (Answering PT- k queries). Let T be a set of uncertain tuples in-

dexed by PRist with granularity parameter h. For a tuple t

T and a PT-k query Q k f

∈

−

h ]

with probability threshold p

∈

b i =(

k, then Pr k

1. Pruning :ift

L i >

(

) <

k, then Pr k

Proof. According to the definition of PRist ,wehave Pr t . L i

2. Validating :ift

U i ≤

(

) >

−

and Pr t . L i + 1

(

) ≤

(

) >

i − 1

k , Pr k

Pr t . L i

i − 1

. Therefore, if t

L i >

(

) ≤

(

) ≤

p . Moreover, according to

the definition of PRist , Pr t . U i

h and Pr t . U i − 1

h . Thus, if t

(

) >

(

) ≤

≤

U i

k , then

Pr k

Pr t . U i

h >

(

) ≥

(

) >

p .

Step 3: Evaluating We only need to evaluate the exact top- k probabilities for those

tuples whose top- k probabilities falling into the prob-interval containing the proba-

bility threshold p , and which cannot be validated or pruned by Theorem 5.10.

Ranking Queries on Uncertain Data

Search WWH ::

Custom Search

Home