Statistical Ranking Framework - Learning to Rank for Information Retrieval

Information Technology Reference

In-Depth Information

16.2.1 The Pointwise Approach

We denote all the m documents associated with query q as x

x j }

j = 1 , and their

relevance degrees as y

1 . Note that in the subset ranking framework, there

is no assumption of sampling of each individual documents. Instead, it is ( x , y )

(which is a representation for the corresponding query) that is regarded as a random

variable sampled from the space

y j }

× Y

according to an unknown probability

distribution P .

Suppose the pointwise loss function is L(f

;

x j ,y j ) . Then the expected risk can

be represented as follows,

;

R(f )

L(f

x j ,y j )P (d x ,d y ).

(16.9)

× Y

Intuitively, the expected risk means the average loss that a ranking model f

would make for all the documents associated with a random query q . Since it is

almost impossible to compute the expected risk, in practice, the empirical risk on

the training set is used as an estimate of the expected risk.

L f

x (i)

,y (i)

R(f )

;

(16.10)

i =

j =

16.2.2 The Pairwise Approach

For the pairwise approach, once again, we denote all the m documents associated

with query q as x

x j }

1 , and denote the relevance degrees as y

y j }

1 .We

regard ( x , y ) as a random variable sampled from the space

× Y

according to

an unknown probability distribution P .

Suppose the pairwise loss function is L(f

;

x v ,x v ,y u,v ) . For any two different

documents x u and x v , we denote y u,v =

I { y u y v } −

1. Accordingly, the expected

risk can be represented as follows,

m(m −

R(f ) =

L(f ; x u ,x v ,y u,v )P (d x ,d y ). (16.11)

1 )

× Y

Intuitively, the expected risk means the average loss that a ranking model f

would make for all the document pairs associated with a random query q . Since it

is almost impossible to compute the expected risk, in practice, the empirical risk on

the training set is used as an estimate of the expected risk. In particular, given the

training data

( x (i) , y (i) )

{

}

1 ,the empirical risk can be defined as follows,

L f

u,v .

m(m

R(f )

x (i)

,x (i)

,y (i)

;

(16.12)

−

1 )

i =

u =

v = u +

Learning to Rank for Information Retrieval

Search WWH ::

Custom Search

Home