The Optimal Set of Classifiers - Design and Analysis of Learning Classifier Systems

Information Technology Reference

In-Depth Information

The first expression can be evaluated by combining (7.6) and (7.7) to get

E W,τ (ln p ( Y

W , τ , Z ))

z nk

w kj x n ,τ − 1

E W,τ (ln

( y nj |

))

2 ln 2 π +

z nk

2 E τ (ln τ k )

−

z nk

E W,τ τ k ( y nj −

w kj x n ) 2

−

D 2

z nk E τ (ln τ k )

z nk

j E W,τ τ k ( y nj −

w kj x n ) 2 +const. ,

−

(7.59)

where k z nk = 1 was used. Using (7.12) and (7.11), the second expectation

results in

V )) =

E V (ln p ( Z

z nk E V (ln g k ( x n ))

≈

z nk ln g k ( x )

| v k = v k ,

(7.60)

where the expectation of ln g k ( x n ) was approximated by the logarithm of its

maximum a-posteriori estimate, that is, ln g k ( x n ) evaluated at v k = v k .This

approximation was applied as a direct evaluation of the expectation does not

yield a closed-form solution. The same approximation was applied by Waterhouse

et al. [227, 226] for the MoE model.

Combining the above expectations results in the posterior

ln q Z ( Z )=

z nk ln ρ nk +const. ,

(7.61)

with

j E W,τ τ k ( y nj −

w kj x n ) 2 .

| v k = v k + D 2 E τ (ln τ k )

ln ρ nk =ln g k ( x n )

−

(7.62)

Without the logarithm, the posterior becomes q Z ( Z ) ∝ n k ρ z nk

nk , and thus,

under the constraint k z nk =1,weget

q Z ( Z )=

ρ nk

j ρ nj

r z nk

nk ,

with

r nk =

E Z ( z nk ) .

(7.63)

As for all posteriors, the variational posterior for the latent variables has the

same distribution form as its prior (7.12).

Design and Analysis of Learning Classifier Systems

Search WWH ::

Custom Search

Home