Introduction - Learning to Rank for Information Retrieval

Information Technology Reference

In-Depth Information

Partly because of the central role of ranker in search engines, great attention

has been paid to the research and development of ranking technologies. Note that

ranking is also the central problem in many other information retrieval applications,

such as collaborative filtering [ 30 ], question answering [ 3 , 71 , 79 , 83 ], multimedia

retrieval [ 86 , 87 ], text summarization [ 53 ], and online advertising [ 16 , 49 ]. In this

topic, we will mainly take document retrieval in search as an example. Note that

even document retrieval is not a narrow task. There are many different ranking sce-

narios of interest for document retrieval. For example, sometimes we need to rank

documents purely according to their relevance with regards to the query. In some

other cases, we need to consider the relationships of similarity [ 73 ], website struc-

ture [ 22 ], and diversity [ 91 ] between documents in the ranking process. This is also

referred to as relational ranking [ 61 ].

To tackle the problem of document retrieval, many heuristic ranking models have

been proposed and used in the literature of information retrieval. Recently, given

the amount of potential training data available, it has become possible to leverage

machine learning technologies to build effective ranking models. Specifically, we

call those methods that learn how to combine predefined features for ranking by

means of discriminative learning “learning-to-rank” methods.

In recent years, learning to rank has become a very hot research direction in

information retrieval. First, a large number of learning-to-rank papers have been

published at top conferences on machine learning and information retrieval. Exam-

ples include [ 4 , 7 - 10 , 15 , 18 , 20 , 21 , 25 , 26 , 31 , 37 , 44 , 47 , 55 , 58 , 62 , 68 , 72 , 76 ,

82 , 88 , 89 ]. There have been multiple sessions in recent SIGIR conferences ded-

icated for the topic on learning to rank. Second, benchmark datasets like LETOR

[ 48 ] have been released to facilitate the research on learning to rank. 5 Many re-

search papers on learning to rank have used these datasets for their experiments,

which make their results easy to compare. Third, several activities regarding learn-

ing to rank have been organized. For example, the workshop series on Learning to

Rank for Information Retrieval (2007-2009), the workshop on Beyond Binary Rel-

evance: Preferences, Diversity, and Set-Level Judgments (2008), and the workshop

on Redundancy, Diversity, and Interdependent Document Relevance (2009) have

been organized at SIGIR. A special issue on learning to rank has been organized at

Information Retrieval Journal (2009). In Table 1.1 , we have listed the major activ-

ities regarding learning to rank in recent years. Active participation of researchers

in these activities has demonstrated the continued interest from the research com-

munity on the topic of learning to rank. Fourth, learning to rank has also become

a key technology in the industry. Several major search engine companies are using

learning-to-rank technologies to train their ranking models. 6

When a research area comes to this stage, several questions as follows naturally

arise.

5 http://research.microsoft.com/~LETOR/ .

6 http://glinden.blogspot.com/2005/06/msn-search-and-learning-to-rank.html ,

http://www.ysearchblog.com/2008/07/09/boss-the-next-step-in-our-open-search-ecosystem/ .

Learning to Rank for Information Retrieval

Search WWH ::

Custom Search

Home