Database Reference
In-Depth Information
Dining
Asian Food
European Food
Thai Food
Chinese Food Japanese Food
French Food
Italian Food
Fig. 4.10 A TAH for relaxation of food styles
similarity between category C and category T . Thus, homophily effects among
friends U and V can be estimated as,
Hist W CT ð
ð
R UI
R VI Þ
Þ
I
2
I
ð
U
Þ\
I
ð
V
Þ
(4.12)
Equation (4.12) counts U' sand V' s rating differences on all of their commonly
reviewed items. But for each item I that they both reviewed, the contribution of
the rating difference on I to the final histogram is multiplied by a factor of W CT
which is the similarity between the categories to which the target item and item
I belong.
Given two categories C and T , the value of W CT can be decided based on the
following two observations. First, let us define D ( C , T ) as the distance from
categories C and T to their lowest common ancestor LCA( C , T ) in the TAH.
(Note that C and T are the leaf nodes in the same depth.) The smaller the distance,
the closer C and T are in the domain space; thus, they are more closely related.
Second, categories in a specific domain are more strongly related to one another
than in general domains. We use
j
LCA( C , T )
j
as the number of all the leaf nodes
under LCA( C , T ) to measure its generalities. The larger
j
LCA( C , T )
j
, the more
general the domain space that both C and T belong to. Following these observa-
tions, we propose to measure W CT as in (4.13).
(
1
if C
¼
T
;
otherwise
W CT ¼
(4.13)
:
D
ð
C
T
Þ
log 2 ðj
ð
C
T
Þj þ
Þ
;
LCA
;
1
1
log 2 4 ¼
Therefore, the similarity between Thai food and Chinese food is
0
5
:
;
1
2log 2 6 ¼
while the similarity between Thai food and Italian food is
0
19
Since
:
:
0.19 is less than 0.5, it is consistent with our intuition. Note that similar intuitions
have been used to estimate the similarity between two concepts in a TAH [ 29 ]. The
difference in our work is that we estimate the similarity between leaf nodes in a
TAH, while [ 29 ] has no such a restriction. In addition, (4.13) assumes a linear decay
model of W CT in D ( C,T ), which is arguable. Future work can be made on selecting a
better model to fit a specific domain.
Once we obtain W CT , the homophily of a pair of users can be quantified [as
shown in (4.12)]. By doing so, even though these two users may not have enough
Search WWH ::




Custom Search