EMOTION RECOGNITION FROM SPEECH VIA BOOSTED GAUSSIAN MIXTURE MODELS

被引：0

作者：

Tang, Hao ^{[1
]}

Chu, Stephen M. ^{[2
]}

Hasegawa-Johnson, Mark ^{[1
]}

Huang, Thomas S. ^{[1
]}

机构：

[1] Univ Illinois, Dept Elect & Comp Engn, 1406 W Green St, Urbana, IL 61801 USA

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA

来源：

ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3 | 2009年

关键词：

Emotion recognition; Gaussian mixture model; Bayesian optimal classifier; EM algorithm; boosting;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set Then, classification is performed to minimize the classification error w.r.t. the estimated class-conditional distributions. We call this method the EM-GMM algorithm. In this paper, we introduce a boosting algorithm for reliably and accurately estimating the class-conditional GMMs. The resulting algorithm is named the Boosted-GMM algorithm. Our speech emotion recognition experiments show that the emotion recognition rates are effectively and significantly "boosted" by the Boosted-GMM algorithm as compared to the EM-GMM algorithm. This is due to the fact that the boosting algorithm can lead to more accurate estimates of the class-conditional GMMs, namely the class-conditional distributions of acoustic features.

引用

页码：294 / +

页数：2

共 50 条

[21] Cascaded projection of Gaussian mixture model for emotion recognition in speech and ECG signals
Huang, Chengwei
Wu, Di
Zhang, Xiaojun
Xiao, Zhongzhe
Xu, Yishen
Ji, Jingjing
Tao, Zhi
Zhao, Li
Journal of Southeast University (English Edition), 2015, 31 (03) : 320 - 326
[22] Anchor Models for Emotion Recognition from Speech
Attabi, Yazid
Dumouchel, Pierre
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (03) : 280 - 290
[23] Efficient Gaussian mixture for speech recognition
Zouari, Leila
Chollet, Gerard
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 294 - +
[24] STRANDED GAUSSIAN MIXTURE HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
Zhao, Yong
Juang, Biing-Hwang
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4301 - 4304
[25] Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
Axelrod, Scott
Goel, Vaibhava
Gopinath, Ramesh
Olsen, Peder
Visweswariah, Karthik
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 172 - 189
[26] Noise Compensation for Speech Recognition Using Subspace Gaussian Mixture Models
Bouallegue, Mohamed
Rouvier, Mickael
Matrouf, Driss
Linares, Georges
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 318 - 321
[27] A SIMPLIFIED SUBSPACE GAUSSIAN MIXTURE TO COMPACT ACOUSTIC MODELS FOR SPEECH RECOGNITION
Bouallegue, Mohamed
Matrouf, Driss
Linares, Georges
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4896 - 4899
[28] i-vector Algorithm with Gaussian Mixture Model for Efficient Speech Emotion Recognition
Gomes, Joan
El-Sharkawy, Mohamed
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 476 - 480
[29] MULTILINGUAL ACOUSTIC MODELING FOR SPEECH RECOGNITION BASED ON SUBSPACE GAUSSIAN MIXTURE MODELS
Burget, Lukas
Schwarz, Petr
Agarwal, Mohit
Akyazi, Pinar
Feng, Kai
Ghoshal, Arnab
Glembek, Ondrej
Goel, Nagendra
Karafiat, Martin
Povey, Daniel
Rastrow, Ariya
Rose, Richard C.
Thomas, Samuel
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4334 - 4337
[30] Discriminative training of Gaussian mixture models for large vocabulary speech recognition systems
Bahl, LR
Padmanabhan, M
Nahamoo, D
Gopalakrishnan, PS
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 613 - 616

← 1 2 3 4 5 →