Iterative Optimization - A Rapid Introduction to Adaptive Filtering

Digital Signal Processing Reference

In-Depth Information

1 u i ( −

u i (

) = (

− μλ i )

(3.11)

, which

goes along the direction defined by its associated eigenvector. In order for the algo-

rithm to converge as n

Then, each eigenvalue

λ i determines the mode of convergence

(

− μλ i )

, the misalignment vector (and its transformed version)

must vanish. Since ( 3.11 ) would be associated to an exponential behavior, the nec-

essary and sufficient condition for the stability of the SD algorithm would be

→∞

− μλ i | <

,...,

−

(3.12)

This shows that the stability of the algorithm depends only on

(a design parameter)

and R x (or more precisely, its eigenvalues). To satisfy the stability condition, the step

size should be chosen according to

λ max .

< μ <

(3.13)

Recalling the canonical form of J MSE (

)

introduced in ( 2.25 ) , we can use ( 3.11 )

to write

−

( n +

) u i ( −

J MSE (

) =

J MMSE + ξ(

) =

J MMSE +

0 λ i (

− μλ i )

(3.14)

The second term

is known as the excess mean square error (EMSE) and mea-

sures how far the algorithm is from theminimum. Equation ( 3.14 ) shows the evolution

through the error surface as a function of the iteration number, and is known as the

learning curve or MSE curve. It is the result of the sum of L exponentials associated

to the natural modes of the algorithm. Since

ξ(

)

(

) >

i , when ( 3.13 )

is satisfied the convergence is also monotonic (and EMSE goes to zero in steady

state). Clearly, the choice of

λ i (

− μλ i )

∀

will not only affect the stability of the algorithm but

also its convergence performance (when stable). Actually, from the L modes of con-

vergence

(

− μλ i )

there will be one with the largest magnitude, that will give the

slowest rate of convergence to the associated component of the transformed vector

. Therefore, this will be the mode that determines the overall convergence speed

of the SD algorithm. It is then possible to look for a value of

(

)

that guarantees the

maximum overall rate of convergence by minimizing the magnitude of the slowest

mode, i.e.,

μ opt =

argmin

max

− μλ i | .

(3.15)

,...,

− μλ

| <

In looking for the

μ opt we only need to study the modes associated to

λ max and

λ min

as a function of

, since all the others will lie in between. For

μ < μ opt the mode

associated to

λ max has smaller magnitude than the one associated to

λ min .Asthe

reverse holds for

μ > μ opt , the optimal step size must satisfy the condition

A Rapid Introduction to Adaptive Filtering

Search WWH ::

Custom Search

Home