Nonlinear Filtering and Machine Learning - Unsupervised Signal Processing

Digital Signal Processing Reference

In-Depth Information

chain rule. From Figure 7.8, if we mentally build a signal-flow graph from

each focused weight up to the output, we can obtain

u o

∂

J BP (

)

∂

(

)

) · ∂

(

)

∂

(

)

) · ∂

(

)

· ∂

y i (

)

) · ∂

u i (

)

) ·

u o

∂

w ij

∂

(

∂

(

∂

(

∂

y i (

)

∂

u i (

∂

w ij

w i ·

φ [ u i (

(

) · ( −

) · (

) ·

)

]

x j (

)

w i φ [ u i (

=−

2 e

(

)

] x j (

)

(7.33)

Notice that we may express the output in a compact form as

w 0

w oT 1 φ y int

(

) =

(

)

(7.34)

w 1 , w 2 ,

, w o N neuron .

) = 1, y 1 (

) and w oT

where y int

(

)

...

, y N neuron (

...

related to the weights in the output layer w o

Thus, the gradient of J BP (

)

and the bias w 0 are given by

∂

J BP (

)

y int

=−

2 e

(

)

(

)

(7.35)

∂

w o

∂

J BP

(

)

=−

2 e

(

)

(7.36)

w 0

∂

Now, if we define

] T

(

) =

[ x 1 (

)

x 2 (

)

···

x K (

)

(7.37)

we can express the gradient of J BP (

)

with respect to the i th set of weights

present in the hidden layer by

∂

J BP (

)

w i φ [ u i (

=−

2 e

(

)

] x

(

)

, i

...

, N neuron

(7.38)

∂

w i

, w iK ] T .

Having found the gradient vector with respect to the weights of the hid-

den layer and of the output layer, we are in a position to update all weights

of our network in the spirit of the steepest-descent method. However, we

must not forget that we purposely calculated the derivatives with respect to

an instantaneous squared error, which means that the resulting BPA will

be conceptually similar to the LMS procedure. This approach constitutes

the online BPA, which is particularly suitable to real-time applications. It

is expressed by Algorithm 7.1 .

being w i

[ w i 1 , w i 2 ,

...

Unsupervised Signal Processing

Search WWH ::

Custom Search

Home