Generalization Analysis for Ranking - Learning to Rank for Information Retrieval

Information Technology Reference

In-Depth Information

17.3.2 For Subset Ranking

In [ 14 ], the stability theory [ 6 ] is extended to perform query-level generalization

analysis on the pairwise approach, under the framework of subset ranking. In par-

ticular, the average view is taken in the analysis.

To assist the analysis, the concept of uniform leave-one-query-out pairwise loss

stability (also referred to as query-level stability for short) is proposed in [ 14 ]. Sup-

pose we have learned a ranking model f 1 from a training set with n queries, using

an algorithm

Then we randomly remove a query and all its associated document pairs from the

training data, and learn a ranking model f 2 from the new training set. If the dif-

ference between the losses with respect to f 1 and f 2 on any unseen document pair

(x u ,x v ) is smaller than τ(n) , we say the algorithm

. Suppose L is the loss function that is minimized in algorithm

has uniform leave-one-query-

out pairwise loss stability with coefficient τ with respect to L .

Based on the concept of the query-level stability, a query-level generalization

bound has been derived in [ 14 ], as shown in Theorem 17.8 . The theorem states that if

a pairwise ranking algorithm has query-level stability, then with a large probability,

the expected query-level risk can be bounded by the empirical query-level risk and

a term that depends on the query number and the stability of the algorithm.

be a pairwise ranking algorithm , (q 1 ,S ( 1 ) ),...,(q n ,S (n) ) be

n training queries and their associated labeled documents , and let L be the pairwise

loss function . If

Theorem 17.8 Let

(q 1 ,S ( 1 ) ), . . . , (q n ,S (n) ),

∀

∈ Q

,(x u ,x v ,y u,v )

∈ X

× Y

L(f (q i ,S (i) ) i = 1 ;

x u ,x v ,y u,v )

|≤

B ,

has query-level stability with coefficient τ ,

(q i ,S (i) )

−

{

}

then with probability at least 1

δ( 0 <δ< 1 ) over the samples of

in the product space i = 1 { Q ×

) ∞ }

(

× Y

, the following inequality holds :

log δ

2 n

+ 4 nτ (n)

≤ R(f

R(f

1 )

2 τ(n)

(17.13)

i =

{ (q i ,S (i) ) }

When using this theorem to perform the query-level generalization analysis on

pairwise ranking algorithms, what one needs to do is to compute the query-level

stability coefficient τ(n) of the algorithms.

For example, as shown in [ 14 ], Ranking SVM [ 11 , 12 ] has a query-level stability

with coefficient τ(n)

4 κ 2

m (i)

m (i) is the number of document

λn ×

max

m (i) , where

n i = 1

κ 2 <

pairs associated with query q i , and

. On this basis, one

can have the following discussions regarding the query-level generalization ability

of Ranking SVM.

•

∀

∈ X

,K(x,x)

≤

∞

When the number of training queries approaches infinity, with a large probability

the empirical risk of Ranking SVM will converge to its expected risk, at a rate of

O( 1

√ n ) .

Learning to Rank for Information Retrieval

Search WWH ::

Custom Search

Home