Information Technology Reference
In-Depth Information
more aspects. In addition to content-based information, IMDb also contains ratings
and reviews. Most of the IMDb's data are available as RDF triples ensuring the
efficient automatic processing of the information.
Discussion : Semantic knowledge resources are valuable sources for recommender
systems because these resources provide different types of knowledge ranging from
fine-grained descriptions of entities (e.g., movies and actors) to ratings and to user
created reviews. Althoughmost resources are not perfectly tailored to a recommender
scenario, themachine readable representation of knowledgemakes it easy to use these
data in a recommender system. The comprehensive collection of data allows semantic
recommender systems to overcome the cold-start problem and to consider a greater
number of aspects than traditional recommender systems. In addition, the graph-
based representation of knowledge and the separation of structure (nodes and edges)
from natural language labels (“node names” might available in different languages)
simplifies the creation of explanations for computed recommendations.
The use of semantic approaches has many advantages; but there are also several
challenges arising from the complexity and the heterogeneity of semantic datasets.
The use of semantically represented data in recommender systems leads to sev-
eral new research questions. Most traditional recommender systems using semantic
datasets focus on datasets having only two node types avoiding problems with the
heterogeneity of edge semantics.
The aggregation of heterogeneous data into one big dataset requires a deep analy-
sis of the different sub-datasets focusing on the meaning of the respective entities
and relationships. In addition, optimized scaling as well as weighting models and
domain-specific edge algebras should be defined reflecting the semantics of the spe-
cific datasets [ 19 ]. Last but not least, the computational complexity of processing
huge graphs must be taken into account when defining a recommender approach for
semantic datasets.
In the next sections we discuss the challenges in detail and present an approach
for learning a universal semantic recommender.
5.3.3 A Dataset for a Semantic Movie Recommender
In order to implement a semantic movie recommender system, we have to find
semantic data sources providing detailed knowledge about movies and all aspects
potentially relevant for computing recommendations. In addition, training data for
optimizing the recommender models are needed. In this section we describe our
semantic movie dataset and discuss the characteristics of the dataset. Subsequently,
we explain step-by-step how to build a powerful semantic recommender based on
the dataset.
The Internet Movie Database (IMDb) is an online database containing informa-
tion related to movies, actors, directors, and production data. IMDb is one of the
most popular online entertainment websites with over 160 million visits each month
[ 12 , 28 ]. Most of the movie data are freely available as Linked Open Data simplifying
Search WWH ::




Custom Search