Information Geometry of Covariance Matrix: Cartan-Siegel Homogeneous Bounded Domains, Mostow/Berger Fibration and Fréchet Median - Matrix Information Geometry

Digital Signal Processing Reference

In-Depth Information

q i

n i !

P M (

n 1 ,

n 2 ,...,

n M /

q 1 ,...,

q M ) =

with q i

priors

i = 1

n i

and p i

(9.1)

· √ 2

n n

e − n

Stirling formula gives n

!≈

π ·

n when n

→+∞

. We could then

observe that it converges to discrete version of Kullback-Leibler:

p i log p i

q i

Lim

log [ P M ]

(

)

(9.2)

→+∞

Based on variational approach, Donsker and Varadhan gave variational definition of

Kullback divergence:

Sup E p (φ) −

e φ )

(

) =

log E q (

(9.3)

Consider:

log p

(ω)

φ(ω) =

(ω)

log

log p

log E q (

e φ ) =

(ω)

⇒

E p (φ) −

(ω)

−

(ω)

(

) −

log

(

) =

(

)

This proves that the supremum over all

is no smaller than the divergence.

E p log e φ

E q (

log q φ (ω)

log E q (

e φ ) =

E p (φ) −

(ω)

e φ )

(ω)

) − E p (φ) −

log E q (

e φ ) =

e φ(ω)

(ω)

with q φ (ω) =

e φ(θ) ⇒

(

(ω)

(θ)

log p (ω)

q φ (ω)

0 using the divergence inequality.

In the same context, link with “Large Deviation Theory” and Fenchel-Legendre

transform which gives that logarithm of generating function are dual to Kullback

Divergence. This relation is given by:

≥

Matrix Information Geometry

Search WWH ::

Custom Search

Home