Database Reference
In-Depth Information
And for case (2)
π
r
s
r
s
r
r
J
1 +
1
1
1
π
r
J
1 J
π
r
r
+
r
r
J
1
1
s
J ( s, r )=
s
r
1
s
r
1
π
r
r
J
1
1 J π
π
r
r
+
r
r
J
r
J s
1
1
π
r
s
J
J
.
π
r
π
r
J
s + J
−J
s
J
We denote the new similarity upper bound with
π
r
s
J
J
J =
.
J
s + J
π
r
−J
s
J
π
r
The following holds
xy
Lemma 7.7. The function f ( x, y )=
x + y−xy is monotonically increas-
ing in x and y .
Proof. f ( x, y ) is monotonically increasing in x and y if and only if the
function g ( x, y )=
1
f ( x,y )
is monotonically decreasing in x and y . But
g ( x, y )= 1
x + 1
y
1 ,
which is clearly monotonically decreasing in x and y .
J is monotonically
Hence the new upper bound similarity threshold
π r . Since by construction strings are processed
s and
J
J
increasing in
s , they are also processed in decreasing order
in decreasing order of
J
J . Given a new string r , ready to be inserted in list L ( λ π ), as we
are scanning L ( λ π ) to identify new candidate pairs with respect to r ,
we can stop scanning once we encounter the first string s s.t.
of
J <
J k .
No subsequent string can have a larger similarity upper bound, since,
by construction, they have been inserted in L ( λ π ) in decreasing order
 
Search WWH ::




Custom Search