Data Visualization via KernelMachines - Data Visualization

Graphics Reference

In-Depth Information

Kernel Machines in the Framework

of an RKHS

10.2

he goal of this section is twofold. First, it serves as an introduction to some ba-

sic RKHS theory that is relevant to kernel machines. Secondly, it provides a unified

framework for kernelizing some classical linear methods, such as PCA, CCA, sup-

port vector clustering (SVC), etc., to allow for nonlinear structure exploration. For

further details, we refer the reader to Aronszajn ( ) for the theory of reproduc-

ing kernels and reproducing kernel Hilbert spaces and Berlinet and homas-Agnan

( )fortheirusageinprobability, statistics andmachinelearning. Listedbeloware

some definitions and basic properties.

Let

R p bethesamplespaceofthedata,whichserveshereasanindexset.

A real symmetric function κ

X⊂

R is said to be positive definite if,for any

positive integer m, any sequence of numbers

a , a ,...,a m

,andpoints

i, j = a i a j κ

AnRKHSisaHilbertspaceofrealvaluedfunctions on

x , x ,...,x m

,wehave

(

x i , x j

)

thatsatisfy theproperty

thatallevaluation functionals areboundedlinearfunctionals. Notethat anRKHS

isaHilbertspaceofpointwise-definedfunctions,wherethe

-normconvergence

implies pointwise convergence.

For every positive definite kernel κ on

there is a corresponding unique

RKHS,denotedby

κ ,ofrealvaluedfunctionson

.Conversely,foreveryRKHS

there is a unique positive definite kernel κ such that

(ċ)

, κ

(

ċ)

(

)

∀

, which is known as the reproducing property. We say that this

RKHS admits the kernel κ. A positive definite kernel is also termed a “reproduc-

ing kernel.”

A reproducing kernel κ that satisfies the condition ∫ XX κ

∀

(

x,u

)

dxdu

has

a countable discrete spectrum given by

(

x,u

λ q ϕ q

(

)

ϕ q

(

)

,orκ

λ q ϕ q

ϕ q for short.

( . )

hemainideabehindkernelmachinesistofirstmapthedataintoanEuclideanspace

X⊂

R p into an infinite-dimensional Hilbertspace. Next, a particular classical statis-

ticalprocedure,suchasPCA,iscarriedoutinthisfeatureHilbertspace.Suchahybrid

model of a classical statistical procedure and a kernel machine is nonparametric in

nature, butwhen fitting the data it uses the underlying parametric procedure(forex-

ample,thePCAfindssomeofthemain linear components). heextra effortinvolved

is the preparation of the kernel data before they are fed into some classical proce-

dures. Below we will introduce two different but isomorphic maps that are used to

embed the underlying Euclidean sample space into a feature Hilbertspace. Consider

the transformation

′

(

λ ϕ

(

)

λ ϕ

(

)

,...,

λ q ϕ q

(

)

,...

)

( . )

Data Visualization

Search WWH ::

Custom Search

Home