Basics of Bayesian Inference - Electromagnetic Brain Imaging

Biomedical Engineering Reference

In-Depth Information

B.6.1

Derivation of the EM Algorithm Using the Free Energy

B.6.1.1

Derivation of Posterior Distribution (E-step)

As a preparation for introducing the variational technique, we derive the EM algo-

rithm in a different manner based on an optimization of a functional called the free

energy. In this section, the hyperparameters are collectively expressed as

. We define

a functional such that,

d x q

F [

(

), ʸ ]=

(

) [

log p

(

| ʸ ) −

log q

(

) ] .

(B.51)

This

F [

(

), ʸ ]

is a function of hyperparameters

and an arbitrary probability distri-

(

)

F [

(

), ʸ ]

bution q

is called the free energy using a terminology in statistical

physics. We show, in the following, that maximizing the free energy

.This

F [

(

), ʸ ]

with

respect to q

results in the E step, and maximizing it with respect to the hyperpa-

rameters results in the M step of the EM algorithm.

When maximizing

(

)

F [

(

), ʸ ]

with respect to q

(

)

, since q

(

)

is a probability

distribution, the constraint ∞

−∞

1 must be imposed. Therefore, this maxi-

mization problem can be formulated such that,

(

)

d x

subject to ∞

−∞

(

) =

F [

(

), ʸ ] ,

(

)

argmax

d x

(B.52)

(

)

Such a constrained optimization problem can be solved by using the method of

Lagrange multipliers, in which defining the Lagrange multiplier as

, the Lagrangian

is defined as

∞

,ʳ ]= F [

, ʸ ]+ ʳ

(

)

d x

−

−∞

∞

d x q

(

) [

log p

(

| ʸ ) −

log q

(

) ]+ ʳ

(

)

d x

−

(B.53)

−∞

The constrained optimization problem in Eq. ( B.52 ) is now rewritten as the uncon-

strained optimization problem in Eq. ( B.53 ). The probability distribution q

(

)

that

,ʳ ]

maximizes the Lagrangian

is the solution of the constrained optimization

problem in Eq. ( B.52 ).

Differentiating

,ʳ ]

with respect to q

(

)

, and setting the derivative to zero, we

have

ʴ L[

(

), ʳ ]

log p

(

| ʸ ) −

log q

(

) −

+ ʳ =

(B.54)

(

)

A brief explanation on the differentiation of a functional, as well as the derivation of

Eq. ( B.54 ), is presented in Sect. C.5 in the Appendix. Differentiating

,ʳ ]

with

respect to

gives

Electromagnetic Brain Imaging

Search WWH ::

Custom Search

Home