Neuroelectromagnetic Source Imaging of Brain Dynamics - Computational Neuroscience

Information Technology Reference

In-Depth Information

8.6.4 Sparse Bayesian Learning (SBL) and Automatic Relevance

Determination (ARD)

Sparse Bayesian learning (SBL) uses the same Gaussian likelihood model defined

in (8.8) and also uses a hierarchical Bayes formulation similar to that explained for

MAP estimation, but instead of integrating out the hyperparameters as in parameter

MAP estimation, in SBL we integrate out the parameters [45, 44, 55, 89, 100, 79,

69, 99, 101, 60]. Thus, instead of finding point estimates at the posterior modes us-

ing fixed priors, it performs the evidence maximization procedure to learn adaptive

hyperparameters from the data itself. SBL assumes an automatic relevance determi-

nation (ARD) prior for the current density defined as

i = 1 N ( 0 , α − 1

(

| α )=

) ,

(8.27)

where

is a vector of hyperparameters or precisions (i.e., inverse source variances),

d α is the number of hyperparameters, and each J i : has a zero-mean Gaussian prior

with covariance

α − 1

I . The inverse source and noise variances have Gamma hyper-

priors,

i = 1 Gamma ( α i | a , b ) ,

( α )=

(8.28)

( σ − 2

ϒ )=

Gamma

ϒ |

) ,

(8.29)

where a , b , c , and d are the degrees of freedom parameters of the Gamma dis-

tributions of

σ − 2

) − 1 b a

−

1 e − b α with

and

given by Gamma

( α |

)= Γ (

0 t a − 1 e − t dt . The Gamma hyperprior results in a student-t prior for the

source parameters. However, to avoid tuning the hyperprior, the Gamma distribu-

tion parameters can be set to a small number (e.g., a

Γ (

10 − 4 )tomake

these priors noninformative (i.e., flat in log space, as is common for scale parame-

ters), or they can be made exactly zero, in which case we obtain the Jeffreys prior,

which results in scale invariance.

SBL is an important alternative because the posterior mode may not be repre-

sentative of the full posterior, and thus, a better point estimate may be obtained,

the posterior mean, by tracking the posterior probability mass. In the case of the

Jeffreys prior, this is achieved by finding the maximum likelihood hyperparameters

α ( ml )

(

)

that maximize a tractable Gaussian approximation of the evidence

of the hyperparameters, also known as the type-II likelihood or marginal likelihood

and

(

)

α ( ml ) ,

ϒ )=

ϒ )

arg max

α , σ

(

| α , σ

(

)

(

| α , σ

= N (

Σ B ) ,

(8.30)

or equivalently by minimizing the negative log marginal likelihood

Computational Neuroscience

Search WWH ::

Custom Search

Home