Probabilistic Ranking Queries on Uncertain Data - Ranking Queries on Uncertain Data - page 109

Database Reference

In-Depth Information

(0.8,1.0]

b5

(0.8,1.0]

b5

L5:

2 (t4)

L5:

2 (t4)

4 (t1)

4 (t2)

4 (t3)

4 (t1,t2,t3)

U4:

L4:

3 (t4)

U4:

L4:

3 (t4)

2 (t3)

4 (t1)

2 (t4)

4 (t2)

4 (t1)

4 (t3)

4 (t2)

(0.6,0.8]

(0.6,0.8]

b4

b4

2 (t3,t4)

4 (t1,t2)

U3:

L3:

3 (t3,t4)

U3:

L3:

U2:

L2:

U1:

3 (t3)

0 (t1)

3 (t4)

1 (t3)

4 (t1)

1 (t4)

4 (t2)

4 (t2)

(0.4,0.6]

(0.4,0.6]

b3

b3

0 (t1)

1 (t3,t4)

4 (t2)

2 (t3,t4)

U2:

L2:

1 (t1)

1 (t1)

0 (t1)

1 (t1)

2 (t3)

0 (t3)

1 (t3)

2 (t4)

1 (t2)

2 (t2)

4 (t2)

1 (t4)

2 (t4)

(0.2,0.4]

(0.2,0.4]

b2

b2

0 (t1,t3)

1 (t2,t4)

U1:

1 (t1,t3)

2 (t2,t4)

(0,0.2]

b1

(0,0.2]

b1

(a) The PRist index.

(b) The compressed PRist index.

Fig. 5.3 A PRist index for the uncertain tuples in Table 2.2.

, we maintain two lists associated with

the interval. The L-list stores a lower bound for each tuple, and the U-list stores

an upper bound for each tuple. Therefore, for t 4 , lower bound 0 is inserted into the

L-list of

To store the bounds in interval

(

0

,

0

.

2

]

.

The lower and upper bounds of ranks for other prob-intervals can be computed

in the same way. Finally, the entry of t 4 in a PRist is shown in Figure 5.2(b).

(

0

,

0

.

2

]

, and upper bound 2 is inserted into the U-list of

(

0

,

0

.

2

]

>

Formally, given a set of uncertain tuples T and a granularity parameter h

0, a

i

−

1

i

h

PRist index for T contains a set of prob-intervals

{

b 1

,...,

b h }

, where b i

=(

,

]

h

(1

≤

h ).

Each prob-interval b i (

i

≤

2

≤

i

≤

h

−

1

)

is associated with two lists: a U -list and an

L -list.

An entry in the U -list of b i corresponds to a tuple t and consists of two items:

tuple id t and an upper rank of t in b i , denoted by t

.

U i , such that one of the following

holds: (1) Pr t . U i

i

h

and Pr t . U i − 1

i

h

when Pr m

i

(

t

) >

≤

(

t

) >

h ; or (2) t

.

U i =

m when

Pr m

i

T has an entry in the U -list. All entries in the U -list are

sorted in ascending order of the upper ranks.

An entry in the L -list of b i corresponds to a tuple t and consists of two items:

tuple id t and a lower rank of t in b i , denoted by t

(

t

) ≤

h . Each tuple t

∈

.

L i , such that one of the following

holds: (1) Pr t . L i

i − 1

h

and Pr t . L i + 1

i − 1

h

when Pr m

i − 1

h

(

t

) ≤

(

t

) >

(

t

) >

;or(2) t

.

L i =

m

when Pr m

i

−

1

h . Each tuple t

T has an entry in the L -list. All entries in the

L -list are sorted in ascending order of the lower ranks.

Prob-interval b 1 is associated with only a U -list but no L -list. Prob-interval b h

is associated with only an L -list but no U -list. For any tuple t

(

t

) ≤

∈

∈

T , t

.

L 1

=

0 and

t

m . The reason is that the lower ranks for entries in b 1 are always 0 and the

upper ranks for entries in b h are always m . Those two lists can be omitted.

.

U h

=

Example 5.5 ( PRist ). For tuples t 1 ,···,

t 4 in Table 2.2, whose top-k probabilities are

plotted in Figure 2.5, the PRist index is shown in Figure 5.3(a).

Next Page

Ranking Queries on Uncertain Data

Search WWH ::

Custom Search

Home