Digital Signal Processing Reference
In-Depth Information
still providing improvements over standard speech enhancement techniques.
The proposed dialog-based framework provides improved recognition performance
over calibration-only systems; this effect is attributed to the ability to continually
update enhancement parameters according to changes in noise conditions.
Acknowledgments Parts of the work presented here were funded through the Australian Coop-
erative Research Centre for Advanced Automotive Technology (AutoCRC).
References
1. Kleinschmidt T, Mason M, Wong E, Sridharan S (2009) The Australian english speech corpus
for
in-car speech processing.
In: Proceedings of
ICASSP.
IEEE Computer Society,
Washington, DC, pp 4177-4180
2. Seltzer ML, Raj B, Stern RM (2004) Likelihood-maximizing beamforming for robust hands-
free speech recognition. IEEE Trans Speech Audio Process 12(5):489-498
3. BabaAli B, Sameti H, Safayani M (2009) Likelihood-maximizing-based multiband spectral
subtraction for robust speech recognition. EURASIP J Adv Signal Process 878105:1-15
4. Shi G, Aarabi P, Jiang H (2007) Phase-based dual-microphone speech enhancement using a
prior speech model. IEEE Trans Audio Speech Lang Process 15(1):109-118
5. Lockwood P, Boudy J, Blanchet M (1992) Non-linear spectral subtraction (NSS) and hidden
Markov models for robust speech recognition in car noise environments. In: Proceedings of the
ICASSP, San Francisco, pp 265-268
6. Wu K-G, Chen P-C (2001) Efficient speech enhancement using spectral subtraction for car
hands-free applications. In: Proceedings of IEEE international conference on consumer elec-
tronics, Los Angeles, pp 220-221
7. Kleinschmidt T, Boyraz P, Boˇil H, Sridharan S, Hansen JHL (2009) Assessment of speech
dialog systems using multi-model cognitive load analysis and driving performance metrics. In:
Proceedings of IEEE international conference on vehicular electronics & safety, Pune,
pp 167-172
8. Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans
Acoust Speech Signal Process 27(2):113-120
9. Nasersharif B, Akbari A (2006) A framework for MFCC feature extraction using
SNR-dependent compression of enhanced Mel-filter bank energies.
INTERSPEECH,
1632-Mon1A20.3
10. Kleinschmidt T (2010) Robust speech recognition using speech enhancement. PhD thesis,
Queensland University of Technology
11. Lee B, Hasegawa-Johnson M, Goudeseune C, Kamdar S, Borys S, Liu M, Huang T (2004)
AVICAR:
audio-visual
speech corpus
in a
car environment.
In: Proceedings of
INTERSPEECH, Jeju Island, pp 2489-2492
12. Kleinschmidt T, Dean D, Sridharan S, Mason M (2007) A continuous speech recognition
protocol for the AVICAR database. In: Proceedings of ICSPCS, Gold Coast, pp 339-344
13. Young S, Evermann G, Gales M, Hain T, Kershaw D, Liu X, Moore G, Odell J, Ollason D,
Povey D, Valtchev V, Woodland P (2006) The HTK Book. Cambridge University: Engineer-
ing Department, Cambridge
Search WWH ::




Custom Search