Be at Odds? Deep and Hierarchical Neural Networks for Classification and Regression of Conflict in Speech - Conflict and Multimodal Communication: Social Research and Machine Intelligence

Information Technology Reference

In-Depth Information

Räsänen O, Pohjalainen J (2013) Random subset feature selection in automatic recognition of

developmental disorders, affective states, and level of conflict from speech. In: Proceedings of

interspeech, Lyon, Aug 2013, pp 210-214

Salakhutdinov R (2009) Learning deep generative models. Ph.D. thesis, University of Toronto

Schmidhuber J (1992) Learning complex extended sequences using the principle of history

compression. Neural Comput 4(2):234-242

Schuller B (2012)

The computational paralinguistics challenge.

IEEE Signal Process Mag

29(4):97-101

Schuller B, Batliner A (2013) Computational paralinguistics: emotion, affect and personality in

speech and language processing. Wiley, New York

Schuller B, Batliner A, Steidl S, Seppi D (2011) Recognising realistic emotions and affect

in speech: state of the art and lessons learnt from the first challenge. Speech Commun

53(9/10):1062-1087 [Special Issue on Sensing Emotion and Affect - Facing Realism in Speech

Processing]

Schuller B, Steidl S, Batliner A, Nöth E, Vinciarelli A, Burkhardt A, van Son R, Weninger F, Eyben

F, Bocklet T, Mohammadi G, Weiss B (2012) The interspeech 2012 speaker trait challenge. In:

Proceedings of interspeech, Portland, OR

Schuller B, Steidl S, Batliner A, Vinciarelli A, Scherer K, Ringeval F, Chetouani M Weninger

F, Eyben F, Marchi E, Mortillaro M, Salamin H, Polychroniou A, Valente F, Kim S (2013)

The interspeech 2013 computational paralinguistics challenge: social signals, conflict, emotion,

autism. In: Proceedings of interspeech, Lyon, Aug 2013

Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process

45:2673-2681

Stuhlsatz A, Meyer C, Eyben F, Zielke T, Meier G, Schuller B (2011) Deep neural networks for

acoustic emotion recognition: raising the benchmarks. In: Proceedings of ICASSP, Prague, pp

5688-5691

Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features

with denoising autoencoders. In: Proceedings of ICML, New York, NY, 2008, pp 1096-1103

Vinciarelli A, Dielmann A, Favre S, Salamin H (2009) Canal9: a database of political debates

for analysis of social interactions. In: Proceedings of the international conference on affective

computing and intelligent interaction, Sept 2009, pp 1-4

Vinciarelli A, Pantic M, Bourlard H (2009) Social signal processing: survey of an emerging

domain. Image Vis Comput 27(12):1743-1759

Waibel A, Hanazawa T, Hinton G, Shikano K, Lang K (1989) Phoneme recognition using time-

delay neural networks. IEEE Trans Acoust Speech Signal Process 37(3):328-339

Wang N, Melchior J, Wiskott L (2012) An analysis of Gaussian-binary restricted Boltzmann

machines for natural images. In: Proceedings of ESANN, Bruges, Apr 2012, pp 287-292

Wrede B, Shriberg E (2003) Spotting “hot spots” in meetings: human judgments and prosodic

cues. In: Proceedings of Eurospeech, ISCA, Geneva, Sept 2003, pp 2805-2808

Yamamoto K, Asano F, Yamada T, Kitawaki N (2006) Detection of overlapping speech in meetings

using support vector machines and support vector regression. IEICE Trans Fundam Electron

Commun Comput Sci 89-A(8):2158-2165

Zeiler M, Ranzato M, Monga R, Mao M, Yang K, Le QV, Nguyen P, Senior A, Vanhoucke V,

Dean J, Hinton G (2013) On rectified linear units for speech processing. In: ICASSP, IEEE,

Vancouver, May 2013, pp 3517-3521

Zelenák M, Hernando J (2011) The detection of overlapping speech with prosodic features for

speaker diarization. In: Proceedings of interspeech, ISCA, Florence, Aug 2011, pp 1041-1044

Conflict and Multimodal Communication: Social Research and Machine Intelligence

Search WWH ::

Custom Search

Home