The Semantics of Search - Social Semantics: The Search for Meaning on the Web

Information Technology Reference

In-Depth Information

Fig. 6.4

The interface used to judge Semantic Web results for relevancy

likelihood of purely coincidental agreement by the judges. Fleiss'

both corrects

for chance agreement and can be used for more than two judges (Fleiss 1971).

The null hypothesis is that the judges cannot distinguish relevant from irrelevant

results, and so are judging results randomly. Overall, for both relevance judgments

over Semantic Web results and web-page results,

κ =

5724 ( p

05, 95%

[

]

Confidence interval

), indicating the rejection of the null hypothesis

and 'moderate' agreement. For web-page results only,

5678

5771

κ =

5216 ( p

05,

[

]

95% Confidence interval

), also indicating the rejection of the

null hypothesis and 'moderate' agreement. Lastly, for only Semantic Web results,

κ =

5150

5282

[

]

), also indicating

the null hypothesis is to be rejected and 'moderate' agreement. So, in all cases there

is 'moderate' agreement, which is sufficient given the general difficulty of producing

perfectly reliable relevancy judgments. Interestingly enough, the difference in

5925 ( p

05, 95% Confidence interval

5859

5991

between the web-page results and Semantic Web results show that the judges were

actually slightly more reliable in their relevancy judgments of information from

the Semantic Web rather than the hypertext Web. This is likely due to the more

widely varying nature of the hypertext results, as compared to the more consistent

informational nature of Semantic Web results.

Were judges more reliable with entities or concepts? Recalculating the

for all

κ =

results based on entity queries,

5989 ( p

05, 95% Confidence interval

[

]

κ =

5923

6055

), while for all results based on concept queries was

5447

[

]

( p

). So it appears that judges are

slightly more reliable discovering information about entities rather than concepts,

backing the claim made by Hayes and Halpin that there is more agreement in

general about 'less' abstract things like people and places rather than abstract

concepts (Hayes and Halpin 2008). However, agreement is still very similar and

'moderate' for both information about entities and concepts. It is perhaps due to the

entity-centric and concept-centric definition of relevance that the agreement was not

higher.

05, 95% Confidence interval

5381

5512

Social Semantics: The Search for Meaning on the Web

Search WWH ::

Custom Search

Home