Global Positioning System Reference
In-Depth Information
Table 3. contd.
Pairs of classes
HJ
Lin
Dice
MDSM v
MDSM c
tSim
GSim v
GSim c
(Department, Province)
0.5700
0.5393
0.2857
0.1620
0.1230
0.2500 0.3871 0.4114
(Department, Region)
0.7100
0.6212
0.2857
0.1543
0.1171
0.2500 0.4259 0.4571
(Department, Country)
0.8200
0.6724
0.2500
0.1350
0.1025
0.2500 0.4502 0.4857
(Department, State)
0.7300
0.6718
0.2857
0.1543
0.1171
0.2500 0.4500 0.4854
(Department, County)
0.7200
0.5993
0.5714
0.3240
0.2460
0.5000 0.5471 0.5554
Correlation
1.0000
0.7127
0.2972
0.3774
0.3419
0.3318 0.7364 0.8037
We observe that the highest correlation of similarity measure with
HJ is achieved by GSim c (0.8037), and the correlation of GSim v (0.7364)
with HJ is slightly higher than the correlation of Lin (0.7127). We also
observe a signifi cant difference between similarity scores obtained by the
hierarchical-based approach of Lin and the feature-based approaches of
Dice and MDSM . The scores obtained by the combined approach GSim,
in most cases are greater than the values computed according to Dice and
MDSM , and they are less than the measures calculated according to the
information content approach by Lin . The reason is twofold. Firstly, similar
to Dice and MDSM , the GSim approach considers the structure of classes,
and the heterogeneity of the attributes, in some cases, signifi cantly impacts
on the similarity values that are considerably less than the ones obtained
according to Lin . Secondly, the GSim method captures the informativeness
of geographic classes organized as a hierarchy, while Dice and MDSM are
mainly based on the common and non-common characteristics of classes.
In Table 3, some scores by Dice and MDSM are equal to zero, indicating
that the pairs of considered classes have no common features and they are
completely distinct. Similar results we obtain by tSim component of GSim ,
which is addressed to capture the similarity of classes' structures.
Analysis of parameters
In this section, we analyze the variability and commonality parameters in
the combined GSim method. These parameters change depending on the
variation of the number of occurrences of attributes and parts in formulas
(5), and (6). To this end, we perform two experiments. In the fi rst experiment,
we change attributes occurrences, and fi x the occurrences of parts, while
in the second one we change parts' occurrences and fi x the occurrences of
attributes.
We start with the fi rst experiment, for which we fi x the number of
occurrences of parts, say 19, and change occurrences of attributes. Intuitively,
as the number of occurrences of attributes increases, consequently, the
number of common attributes between classes increases as well. This
Search WWH ::




Custom Search