Digital Signal Processing Reference
In-Depth Information
Table 1.2 Continued
￿
The combination of evidences from different features is proved to perform better for many
speech tasks
This may be due to supplementary or complementary evidences provided by different
features. Hence, in this work the following combination of features may be explored to
study emotion recognition performance
Excitation and spectral features
Spectral and prosodic features
Excitation and prosodic features
Excitation, spectral and prosodic features
￿
Multilevel classification systems provide better classification over single level classification
Two level emotion classification system is proposed
At the first level all the emotions are divided into few broad groups, where similar
(confusable) emotions are placed in different groups
At the second level emotions in broad groups are further classified
￿
The ultimate goal of any speech emotion recognition system is to process real life emotions
Combination of different features may be used for real-life emotion recognition
Hindi movie clips may be used to represent real life emotions
￿
Chapter 2 : Robust Emotion Recognition using Pitch Synchronous and Sub-
syllabic Spectral Features provides the details about the extraction of spectral
features from sub-syllabic regions such as consonant, vowel, and consonant-vowel
(CV) transition regions. Extraction of spectral features from pitch synchronous
analysis is also explained. Development of emotion recognition systems using
Gaussian mixture models is discussed.
￿
Chapter 3 : Robust Emotion Recognition using Sentence, Word and Syllable
Level Prosodic Features discusses in detail about the use of global and local
prosodic features for developing emotion recognition systems. Global (static) and
local (dynamic) prosodic features extracted from sentences, words, and syllables
are proposed for classifying the speech emotions. The contribution of prosodic
features from different speech regions (initial, middle, and final) is also analyzed
using local and global features. For capturing emotion-specific prosody from the
proposed features, support vector machine models are used.
￿
Chapter 4 : Robust Emotion Recognition using a Combination of Excitation
Source, Spectral and Prosodic Features discusses the combination of com-
plementary and supplementary evidences provided by the source, system, and
prosodic features for improving the emotion recognition performance. This chapter
provides emotion recognition performance studies for various combinations of
features. Here, evidences from various features are combined using optimal linear
weighted combination scheme.
￿
Chapter 5 : Robust Emotion Recognition using Speaking Rate Features
discusses the development of two stage emotion recognition system based on
speaking rate. In this case initially emotions are classified into broad categories
 
Search WWH ::




Custom Search