Digital Signal Processing Reference
In-Depth Information
2 )) in the preceding algorithm is suffi-
cient but not necessary for the algorithm to converge (the upper bound could actually
be 2
2
Remark 1 The condition
μ t
(0
,
2
/
(
|||
H
|||
||| |||
2 ). Other variants of this algorithm have been proposed very recently in the
literature to accelerate the forward-backward algorithm for solving ( P λ ) . These meth-
ods basically propose different strategies to choose the descent step-size sequence (
/ |||
H
|||
μ t ) t
to get faster practical convergence behavior; see, for example, Elad (2006), Vonesch
and Unser (2007), and Wright et al. (2009). These algorithms can all be seen as spe-
cial instances of a more general forward-backward splitting considered by Chen and
Rockafellar (1997) (see also references therein), where
μ t may actually be a matrix.
Accelerated Multistep Algorithm
In the same vein as in Section 7.3.2.3, the convergence rate on the objective (7.26)
can be improved to O (1
t 2 ) using the accelerated multistep Nesterov algorithm
(Nesterov 2007). Indeed, problem ( P λ ) satisfies the necessary requirements on F 1
and F 2 to apply the Nesterov iteration. Unlike Algorithm 22, the Nesterov acceler-
ated version uses explicitly all previous iterates
/
( t ) , hence the
name “multistep algorithm.” Algorithm 23 details the steps of the Nesterov scheme
to minimize ( P λ ). It is formulated using proximal operators, as described by Weiss
et al. (2009).
( k ) , k
α
<
t to compute
α
Algorithm 23 Nesterov Scheme to Solve ( P λ )
(0)
T ,
2
2 )),
(0)
Initialization: Choose some
α
∈ R
μ
(0
,
2
/
(
|||
H
|||
||| |||
θ 0 =
0,
ξ
=
0.
Main iteration:
for t
=
0 to N iter
1 do
1. First proximal computation:
( t )
(0)
( t ) )
υ
=
prox
(
α
ξ
.
(7.29)
θ t
μθ t
( t )
( t )
( t )
= θ t α
+
a t υ
2. Set a t =
μ +
μ
2
+
4
/
2 and
ω
.
θ t +
a t
3. Second proximal computation:
+ 2 H y
( t ) )
( t
+
1)
( t )
α
=
prox
ω
H
(
ω
.
(7.30)
μ/
2
4. Accumulate the gradient directions:
a t H y
1) )
( t
+
1)
( t )
( t
+
ξ
= ξ
H
(
α
.
(7.31)
5. Update the step
θ t + 1 = θ t +
a t .
( N iter ) .
Output: Solution of ( P λ ):
α
It worth noting that Beck and Teboulle (2009), building on another paper by
Nesterov, have proposed an accelerated algorithm to solve ( P λ ) that achieves the
same global rate of convergence as Algorithm 23. Despite some similarities, the
 
Search WWH ::




Custom Search