Database Reference
In-Depth Information
To proof Theorem 5.12, We first prove t
.
L i <
MR p (
t
)
. Following the defini-
i
1
h
tion of PRist ,wehave Pr t . L i
(
t
)
. Thus, MR i 1
h
(
t
)
t
.
L i , which follows from
i
1
h <
Lemma 5.3. Moreover, since
p ,wehave MR i 1
h
(
t
)
MR p (
t
)
, which follows
from Lemma 5.2. Thus, t
.
L i <
MR p (
t
)
.
Similarly, MR p (
t
)
t
.
U i can be proved.
Step 2: Pruning and Validating Given a top-
(
,
)
p
l
query, let b i be the prob-interval
U as a pruning condition. Any tuple t
whose lower rank in b i is at least k can be pruned. Moreover, for any tuple t ,ifthe
upper rank of t in b i is not greater than l lower ranks of other tuples in b i , then t can
be validated.
containing p . We use the l -th rank k in list b i
.
Theorem 5.13 (Answering Top-
queries). Let T be a set of uncertain tuples
indexed by PRist with granularity parameter h. For a tuple t
(
p
,
l
)
S, a top-
(
p
,
l
)
query,
and prob-interval b i such that p
b i :
l, then any tuple t such that
1. Pruning : Let T 1 = {
t x |
t x .
U i
t
.
U i
t x
T
}
.If
|
T 1 |≥
t .
U i is not an answer to the query;
2. Validating : Let T 2 = {
L i
t
.
t y |
t y .
L i
t
.
U i
t y
T
}
.If
|
T 2 |≤
l, then t is an answer to
the query.
Proof. To prove the first item, we only need to show that there are at least l tuples
whose p -ranks are smaller than MR p (
t )
. For any tuple t x
T 1 ,wehave MR p (
t x )
U i , which is guaranteed by Theorem 5.12. Similarly, for any tuple t such
t x .
U i
t
.
that t .
t )
t )
L i
t
.
U i ,wehave MR p (
t
.
U i . Therefore, MR p (
t x )
MR p (
. Since
|
l , there are at least l tuples like t x . So the first item holds.
To prove the second item, we only need to show that there are fewer than l tuples
whose p -ranks are smaller than MR p (
T 1 |≥
. For any tuple t
T 2 , since t .
t
)
L i >
t
.
U i ,
t ) >
(
(
)
MR p
MR p
t
. Therefore, only the tuples in T 2 may have smaller p -ranks than
MR p
(
)
|
|≤
t
. Since
T 2
l , the second item holds.
Step 3: Evaluating For any tuple that can neither be validated nor be pruned, we
need to calculate their exact p -ranks. To calculate the p -rank of tuple t , we calculate
the top-1, top-2,
···,
top- i probabilities for each rank i until we find the first rank k
such that Pr k
p .
The complexity is analyzed as follows. Let d 1 be number of tuples that cannot
be pruned in the pruning and validation phase. In the evaluating phase, the p -ranks
of the tuples that cannot be pruned or validated are calculated. For each such tuple,
its p -rank can be m in the worst case, where m is the number of rules in T . Then,
calculating the p -rank for each tuple takes O
(
t
)
m 2
time. Let d 2 be the number of
tuples that cannot be pruned or validated. The evaluating step takes O
(
)
m 2 d 2 )
(
time.
m 2 d 2 )
The overall complexity is O
(
d 1 +
.
 
Search WWH ::




Custom Search