Digital Signal Processing Reference
In-Depth Information
factored into a sufficient statistics followed by a single-microphone post-filter.
As a straightforward extension of [
1
], if we know the RIRs, optimal estimation
of the speech signal can be achieved using the simple two-step method. How-
ever, it is actually not easy to satisfy the assumption of the known RIRs. In this
chapter, we address a realistic implementation of the sufficient statistics with
unknown RIRs.
If we know the source signal, we can adaptively estimate the RIRs based on an
acoustic echo cancelation scheme [
4
]. Because more correctly beamformed output
is nearer to the original source signal, we might be able to use the beamformed
output as a reference signal to estimate the RIRs [
5
]. In this chapter we propose
using a delay-and-sum beamformer (DSB) to provide the information necessary for
an initial constrained estimate of the RIR, which is then updated iteratively using
a multi-path generalized sidelobe canceller (GSC) based on the evolving RIR
estimate. Good RIR estimation makes the multi-path GSC more accurate, and
this again guarantees better RIR estimation. We demonstrate that, with a reasonable
constraint on the sparsity of the room impulse response, the algorithm converges to
a useful approximate RIR. Even though we may not get perfect RIR identification,
the converged RIR is nevertheless sufficient to compute coefficient vectors for a
multi-path fixed beamformer (FBF) which outperforms the naive DSB. By
leveraging the converged RIR, we are able to mitigate the common practical
problem of multi-path GSC, namely, its tendency to cancel the target signal due
the indistinguishability of signal from reverberation at the beamformer.
To visualize the situation in a tractable way, we first show the convergence of a
simplified version of the proposed scheme. A simple simulation test shows that this
method achieves sufficient blind deconvolution at the output of FBF. We then
evaluate the proposed algorithm using real-world moving-car recordings [
6
].
13.2 Proposed Method
13.2.1 Multi-path GSC
Multi-path GSC can be formulated as an optimization problem as shown in (
13.1
),
which is a generalized version of GSC [
7
] under a known multi-path environment,
represented by the RIR as coded into a constraint matrix C:
n
o
¼ f
w
T
T
subject to
C
T
argmin
~
E
~
~
y
ð
n
Þ~
y
ð
n
Þ
~
w
~
w
;
(13.1)
w
w
T
is an estimated source signal at the current time
n
,
f
where
s
^
ð
n
Þ¼~
~
y
ð
n
Þ
¼
T
.
½
is a noisy signal vector measured by the microphone array,
the array of filter coefficients is
10
0
y
~
ð
n
Þ
T
encoding the estimated
L-tap inverse RIR filters for all of the N recorded signals, and
~
w
¼
½
w
1
w
2
w
NL
Search WWH ::
Custom Search