Information Technology Reference
In-Depth Information
• The types of the relations are equals: x
)
• one element of R 1 is related to at most one element of R 2 :
R
y
l
(
x
)=
l
(
y
x
R 1 ,
(
! y
R 2 , such that x
R
y
)
R
• one element of R 2 is related to at most one element of R 1 :
x
y
R 2 ,
(
! x
R 1 , such that x
R
y
)
y
To define the similarity measure regarding the context of two concepts, we use the two sets INTERSEC
and COMPL , defined as follows as follows.
INTERSEC is a set of couples of relations nodes of R 1 and R 2 that are related through the
R
R
relation.
x , y )
Let INTERSEC
= { (
x , y
)
R 1 ×
R 2 |
x
R
y with
(
x , y
)
R 1 ×
R 2 ,
(
R 1 ×
R 2 ,ifx
R
y
x R
y ,x
x
y }
.
COMPL is the set of relations that could not be related through
=
y
=
R
R 1 ×
R 2 |
y
x , y )
x
= { (
)
(
Let COMPL
x , y
R 2 such that
INTERSEC
x , y )
= ∅) }
The similarity of the context of c 1 and c 2 is then defined according to the cardinality of the two sets
INTERSEC and COMPL :
(
(
= ∅
R 1 such that
INTERSEC
x
y
|
|
INTERSEC
(
)=
sim context
c 1 , c 2
|
| + |
|
INTERSEC
COMPL
5.3.1.4 Similarity of concepts
To compare two concepts, now, we use a similarity measure that combines all the measures
described above.
The order of importance of the component of two concepts, when processing their similarity
is the following one:
1. their concept types
2. their referents
3. the context in which they are used
To account for this hierarchy of importance, within the similarity measure sim
gene , we apply
different coefficients to the individual similarity (and dissimilarity) measures: a coefficient
of 4 is applied to the part accounting for the similarity of the concept types, 2 to the part
accounting for the referents and 1 for the contexts. In order to keep a normalized similarity,
the the similarity score processed as described above is divided by 7.
|
2
Definition 5.6. The similarity measure sim
|
gene :
C
[
0, 1
]
, where
C
is a set of concepts defined
on the same vocabulary, is expressed as follows:
(
2 such that c 1 =[
c 1 , c 2 ) ∈C
t 1 : v 1 ]
and c 2 =[
t 2 : v 2 ]
,
• If the most specific common parent of t 1 and t 2 is the root of the type hierarchy, we have
sim
|
(
c 1 , c 2 )=
0 .
gene
• Otherwise, we have
sim
(
diss type (
t 1 , t 2 ))+
sim ref (
v 1 , v 2 )+
sim context (
c 1 , c 2 )
4
1
2
|
(
c 1 , c 2 )=
gene
7
where diss type , sim ref and sim context are the similarity and dissimilarity measures defined above.
 
Search WWH ::




Custom Search