Decomposition in Transition: Adaptive Matrix Factorization - Realtime Data Mining

Database Reference

In-Depth Information

Proposition 8.7 Let ( V , h , i ) be an inner product space and S V. Then, for

all c , w

V ,

∈

;

i kk min

v ∈ S

v c

k max

v ∈ S

;

i c

;

i þ kk max

v ∈ S

v c

Proof Taking maximums on both sides of the rightmost inequality of the above

Lemma yields

max

v ∈ S

;

i max

v ∈ S

;

i þ kkv c

max

v ∈ S

;

i þ max

v ∈ S

kkv c

¼ c

;

iþkk max

v ∈ S

v c

Similarly, by taking maximums in the left inequality, we obtain

max

v ∈ S

;

i max

v ∈ S

;

i þ kkv c

max

v ∈ S

;

i þ min

v ∈ S

kkv c

¼ c

;

iþkk min

v ∈ S

v c

□

This result equips us with a means to enclose the exact maximum inner product

between a vector in the considered cluster and a given vector, while the only

information that is required is the inner product between the given vector and the

cluster center and the maximum distance from the cluster center. The latter,

emerging as a side product of the foregoing clustering procedure, may safely be

regarded as readily available.

By virtue of our bound, we may retrofit our branching heuristic into a full-blown

branch-and-bound procedure which computes the exact result, though forfeiting a

large degree of efficiency. To balance the tradeoff between fast computation and

accuracy, we introduce a parameter h

[0;1] to relax the bound.

Let T denote the refinement tree obtained from recursive k-means clustering of

the x i , and let children (.) denote the functions that map a node of T to the set of its

children. We define for t

∈

u h ðÞ:¼ c t ;

i þhkk max

i ∈ C t

x i c t

k , l h ðÞ:¼ c t ;

i hkk min

i ∈ C t

x i c t

k ,

where C t , c t denote the cluster and cluster center corresponding to t ,respectively.

Moreover, in the below-described algorithm, select _ node ( ) denotes some function

which judiciously picks the next node to be refined from a list of candidate nodes (one

may, e.g., choose the node t with the greatest upper bound u h ( t ) among all nodes in

the list).

Realtime Data Mining

Search WWH ::

Custom Search

Home