Design Considerations for a Social Network-Based Recommendation System (SNRS) - Community-Built Databases: Research and Development

Database Reference

In-Depth Information

4.2 Background

After the pioneering work done in the Grouplens project in 1994 [ 5 ], collaborative

filtering (CF) soon became one of the most popular algorithms in recommender

systems. Many variations of this algorithm have also been proposed such as hybrid

approaches of combining CF with content-based filtering [ 7 , 17 - 19 ] or adopting

different weighting schemes [ 6 , 20 ]. In this chapter, we will use the traditional CF

proposed in the Grouplens project as one of the comparison methods. Therefore, the

remainder of this section will focus on this algorithm.

The assumption of CF is that people who agreed in the past tend to agree again in

the future. Therefore, CF first finds users with tastes similar to those of the target

users. CF will then make recommendations to the target user by predicting the

target user's rating of the target item based on the ratings of his/her top- K similar

users. User ratings are often represented by discrete values within a certain range,

for example, 1-5. Here 1 indicates an extreme dislike of the target item, while

5 shows high praise. Let R UI be the rating of the target user U on the target item I .

Thus, R UI is estimated as the weighted sum of the votes of similar users as follows.

R UI ¼

Z X

R U þ

;

Þð

R VI

R V Þ ;

(4.1)

2 C

where R U and R V represent the average ratings of the target user U and every user

V in U 's neighborhood,

, which consists of the top- K similar users of U . w ( U , V )

is the weight between users U and V , and z

P V wU ; V

is a normalizing constant to

normalize total weight to one. Specifically, w ( U, V ) can be defined using the

Pearson correlation coefficient [ 5 ].

P I ð

R V Þ

P I ð

R UI

R U Þð

R VI

Þ¼

(4.2)

;

P I ð

R UI

R U Þ

R VI

R V Þ

where the summations over I are over the common items for which both user U and

V have voted.

As we can see, the traditional CF models user-to-user relations based purely on

user rating similarities and does not utilize at all the semantic friend relations among

users. However, such semantics are essential to the buying decisions of users. In the

following sections, we are going to present a new paradigm of recommender systems

which improves the performance of recommender systems by using the semantic

information in social networks.

4.3 Yelp.com

For this research, we collect a dataset from a real online social network Yelp.com.

As one of the most popular Web 2.0 websites, Yelp provides users with local

searches for restaurants, shopping, spas, nightlife, etc. Besides maintaining the

Community-Built Databases: Research and Development

Search WWH ::

Custom Search

Home