Database Reference
In-Depth Information
e σ 2
1
1
2
z ( O LN ( C , O , σ ))
(
G h
(
o
,
z
)
G h
( o
,
z
))
2 h 2
|
O
|
π
(
,
,
, σ )
(
,
,
, σ )
(
,
,
, σ )
(
,
,
, σ )
Since LT
o
C
O
LT
o
C
O
, LT
o
C
O
LT
o
C
O
0. Thus,
1
h 2
e σ 2
(
,
)
(
,
)
T
o
O
T
o
O
2 h 2
π
Inequality 4.2 is shown.
From Theorem 4.1, we can also derive the minimum neighborhood threshold
value
σ
to satisfy certain approximation quality.
Corollary 4.1 (Choosing neighborhood threshold). Given an uncertain o bject O,
an instance x
2ln 2
S, and a quality requirement
θ
,if
σ
πθ
h
·
h, then
T
(
x
,
O
)
LT
(
x
,
C
,
O
, σ ) < θ
for any subset C
(
x
C
)
.
Proof. From Theorem 4.1, for any instance o
C , subset C
O and neighborhood
threshold
σ
,wehave
1
h 2
e σ 2
T
(
x
,
O
)
LT
(
x
,
C
,
O
, σ ) <
2 h 2
π
e σ 2
1
h 2
θ
θ
In order to meet the quality requirement
, it should hold that
2 h 2
.
π
2ln 2
Therefore,
σ
πθ
h
·
h .
4.2.2 DLTA: Direct Local Typicality Approximation Using
VP-trees
Inequality 4.3 in Theorem 4.1 can be used immediately to approximate simple typ-
icality computation with quality guarantee. Given a neighborhood threshold
σ
, for
each instance x
S , the direct local typicality approximation (DLTA) algorithm
σ
{
}
( {
},
, σ )
computes the
-neighborhood of
x
, i.e., LN
x
O
and the local simple typi-
cality LT
(
x
,{
x
},
O
, σ )
. Then, it returns the k instances with the highest local simple
typicality values as the approximation to the answer of the top- k simple typicality
query.
The quality of the approximation answer is guaranteed by the following theorem.
Theorem 4.2 (Approximation quality). Given an uncertain object O, neighbor-
hood threshold
and integer k, let A be the k instances with the highest simple
typicality values, and A be the k instances with the highest local simple typicality
values. Then,
σ
A T
(
o
,
O
) o A T
(
o
,
O
)
1
h 2
e σ 2
o
<
(4.7)
2 h 2
k
π
 
Search WWH ::




Custom Search