Support Vector Machine - Advanced Artificial Intelligence

Information Technology Reference

In-Depth Information

asserts that one can estimate on the basis of the values of empirical risk the

minimal possible value of the risk.

8.2.2 Key theorem of learning theory

In 1989 Vapnik and Chefvonenkis proposed the key theorem of learning theory

as follows (Vapnik and Chefvonenkis, 1991).

(

∈

Theorem 8.1 Let

be a set of functions that satisfy the condition

≤

(

)

(

)

≤

(

≤

(

)

≤

(8.6)

Then for the ERM principle to be consistent in the following sense:

lim

{

sup

(

)

−

(

))

}

∀ε

(8.7)

emp

→

∞

∈

It is necessary and sufficient that the empirical risk

emp (

) converge uniformly

L y w

( ,

∈

to the actual risk

(

) over the set

. We call this type of uniform

convergence uniform one-sided convergence.

8.2.3 VC entropy

≤ ≤ ∈ , be a set of bounded loss functions.

Using this set of functions and the training set 1 ,

L y w

( ,

)

Definition 8.2 Let A

> , one can construct the

following set of l-dimensional vectors:

(

)

(

L z

(

L z

(

)),

∈

(8.8)

This set of vectors belong to the l-dimensional cube and has a finite minimal ε-

net in the metric

(or in the metric

L p ).

(

;

)

Let

be the number of elements of the minimal ε-net of

(

;

)

this set of vectors q (

w ∈Λ. Note that

is a random variable,

since it is constructed using random Vectors

z 1 ,…,z l .The logarithm of the random

(

;

)

(

;

)

(

;

)

value

is called the

≤

(

)

≤

∈

random VC entropy of the set offunctions

on the

sample

z 1 ,…,z l . The expectation of the random VC entropy

(

;

)

(

;

)

Advanced Artificial Intelligence

Search WWH ::

Custom Search

Home