EMOTION RECOGNITION FROM SPEECH VIA BOOSTED GAUSSIAN MIXTURE MODELS

被引:0
|
作者
Tang, Hao [1 ]
Chu, Stephen M. [2 ]
Hasegawa-Johnson, Mark [1 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, 1406 W Green St, Urbana, IL 61801 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
Emotion recognition; Gaussian mixture model; Bayesian optimal classifier; EM algorithm; boosting;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set Then, classification is performed to minimize the classification error w.r.t. the estimated class-conditional distributions. We call this method the EM-GMM algorithm. In this paper, we introduce a boosting algorithm for reliably and accurately estimating the class-conditional GMMs. The resulting algorithm is named the Boosted-GMM algorithm. Our speech emotion recognition experiments show that the emotion recognition rates are effectively and significantly "boosted" by the Boosted-GMM algorithm as compared to the EM-GMM algorithm. This is due to the fact that the boosting algorithm can lead to more accurate estimates of the class-conditional GMMs, namely the class-conditional distributions of acoustic features.
引用
收藏
页码:294 / +
页数:2
相关论文
共 50 条
  • [21] Cascaded projection of Gaussian mixture model for emotion recognition in speech and ECG signals
    Huang, Chengwei
    Wu, Di
    Zhang, Xiaojun
    Xiao, Zhongzhe
    Xu, Yishen
    Ji, Jingjing
    Tao, Zhi
    Zhao, Li
    Journal of Southeast University (English Edition), 2015, 31 (03) : 320 - 326
  • [22] Anchor Models for Emotion Recognition from Speech
    Attabi, Yazid
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (03) : 280 - 290
  • [23] Efficient Gaussian mixture for speech recognition
    Zouari, Leila
    Chollet, Gerard
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 294 - +
  • [24] STRANDED GAUSSIAN MIXTURE HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
    Zhao, Yong
    Juang, Biing-Hwang
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4301 - 4304
  • [25] Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    Axelrod, Scott
    Goel, Vaibhava
    Gopinath, Ramesh
    Olsen, Peder
    Visweswariah, Karthik
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 172 - 189
  • [26] Noise Compensation for Speech Recognition Using Subspace Gaussian Mixture Models
    Bouallegue, Mohamed
    Rouvier, Mickael
    Matrouf, Driss
    Linares, Georges
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 318 - 321
  • [27] A SIMPLIFIED SUBSPACE GAUSSIAN MIXTURE TO COMPACT ACOUSTIC MODELS FOR SPEECH RECOGNITION
    Bouallegue, Mohamed
    Matrouf, Driss
    Linares, Georges
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4896 - 4899
  • [28] i-vector Algorithm with Gaussian Mixture Model for Efficient Speech Emotion Recognition
    Gomes, Joan
    El-Sharkawy, Mohamed
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 476 - 480
  • [29] MULTILINGUAL ACOUSTIC MODELING FOR SPEECH RECOGNITION BASED ON SUBSPACE GAUSSIAN MIXTURE MODELS
    Burget, Lukas
    Schwarz, Petr
    Agarwal, Mohit
    Akyazi, Pinar
    Feng, Kai
    Ghoshal, Arnab
    Glembek, Ondrej
    Goel, Nagendra
    Karafiat, Martin
    Povey, Daniel
    Rastrow, Ariya
    Rose, Richard C.
    Thomas, Samuel
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4334 - 4337
  • [30] Discriminative training of Gaussian mixture models for large vocabulary speech recognition systems
    Bahl, LR
    Padmanabhan, M
    Nahamoo, D
    Gopalakrishnan, PS
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 613 - 616