Information Technology Reference
In-Depth Information
Räsänen O, Pohjalainen J (2013) Random subset feature selection in automatic recognition of
developmental disorders, affective states, and level of conflict from speech. In: Proceedings of
interspeech, Lyon, Aug 2013, pp 210-214
Salakhutdinov R (2009) Learning deep generative models. Ph.D. thesis, University of Toronto
Schmidhuber J (1992) Learning complex extended sequences using the principle of history
compression. Neural Comput 4(2):234-242
Schuller B (2012)
The computational paralinguistics challenge.
IEEE Signal Process Mag
29(4):97-101
Schuller B, Batliner A (2013) Computational paralinguistics: emotion, affect and personality in
speech and language processing. Wiley, New York
Schuller B, Batliner A, Steidl S, Seppi D (2011) Recognising realistic emotions and affect
in speech: state of the art and lessons learnt from the first challenge. Speech Commun
53(9/10):1062-1087 [Special Issue on Sensing Emotion and Affect - Facing Realism in Speech
Processing]
Schuller B, Steidl S, Batliner A, Nöth E, Vinciarelli A, Burkhardt A, van Son R, Weninger F, Eyben
F, Bocklet T, Mohammadi G, Weiss B (2012) The interspeech 2012 speaker trait challenge. In:
Proceedings of interspeech, Portland, OR
Schuller B, Steidl S, Batliner A, Vinciarelli A, Scherer K, Ringeval F, Chetouani M Weninger
F, Eyben F, Marchi E, Mortillaro M, Salamin H, Polychroniou A, Valente F, Kim S (2013)
The interspeech 2013 computational paralinguistics challenge: social signals, conflict, emotion,
autism. In: Proceedings of interspeech, Lyon, Aug 2013
Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process
45:2673-2681
Stuhlsatz A, Meyer C, Eyben F, Zielke T, Meier G, Schuller B (2011) Deep neural networks for
acoustic emotion recognition: raising the benchmarks. In: Proceedings of ICASSP, Prague, pp
5688-5691
Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features
with denoising autoencoders. In: Proceedings of ICML, New York, NY, 2008, pp 1096-1103
Vinciarelli A, Dielmann A, Favre S, Salamin H (2009) Canal9: a database of political debates
for analysis of social interactions. In: Proceedings of the international conference on affective
computing and intelligent interaction, Sept 2009, pp 1-4
Vinciarelli A, Pantic M, Bourlard H (2009) Social signal processing: survey of an emerging
domain. Image Vis Comput 27(12):1743-1759
Waibel A, Hanazawa T, Hinton G, Shikano K, Lang K (1989) Phoneme recognition using time-
delay neural networks. IEEE Trans Acoust Speech Signal Process 37(3):328-339
Wang N, Melchior J, Wiskott L (2012) An analysis of Gaussian-binary restricted Boltzmann
machines for natural images. In: Proceedings of ESANN, Bruges, Apr 2012, pp 287-292
Wrede B, Shriberg E (2003) Spotting “hot spots” in meetings: human judgments and prosodic
cues. In: Proceedings of Eurospeech, ISCA, Geneva, Sept 2003, pp 2805-2808
Yamamoto K, Asano F, Yamada T, Kitawaki N (2006) Detection of overlapping speech in meetings
using support vector machines and support vector regression. IEICE Trans Fundam Electron
Commun Comput Sci 89-A(8):2158-2165
Zeiler M, Ranzato M, Monga R, Mao M, Yang K, Le QV, Nguyen P, Senior A, Vanhoucke V,
Dean J, Hinton G (2013) On rectified linear units for speech processing. In: ICASSP, IEEE,
Vancouver, May 2013, pp 3517-3521
Zelenák M, Hernando J (2011) The detection of overlapping speech with prosodic features for
speaker diarization. In: Proceedings of interspeech, ISCA, Florence, Aug 2011, pp 1041-1044
Search WWH ::




Custom Search