A Preliminary Study of Emotion Recognition Employing Adaptive Gaussian Mixture Models with the Maximum A Posteriori Principle

被引:0
|
作者
Yang, Jing-Hsiang [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 | 2014年
关键词
emotion recognition; MFCC; PLPCC; GMM; MAP adaptation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel processing structure to improve the performance of the automatic speech emotion recognition. In this structure, the Gaussian mixture model (GMM) is first created for each type of emotions with speech features in the training set, which consists of the utterances produced by several speakers. Next, the emotion GMMs are further adapted via a portion of the speaker-specific data in the training set using the maximum a posteriori (MAP) criterion, and thus the resulting new GMMs are expected to be better-suited for the testing utterances produced by the specific speaker in emotion recognition in comparison with the original speaker-independent GMMs. Experimental results show that after MAP adaptation for the GMMs, the emotion recognition accuracy can be improved significantly irrespective of the selected speech feature types being mel-frequency cepstral coefficients (MFCC) or perceptual linear predictive cepstral coefficients (PLPCC).
引用
收藏
页码:1575 / +
页数:2
相关论文
共 50 条
  • [31] Subspace constrained Gaussian mixture models for speech recognition
    Axelrod, S
    Goel, V
    Gopinath, RA
    Olsen, PA
    Visweswariah, K
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1144 - 1160
  • [32] Regularized Subspace Gaussian Mixture Models for Speech Recognition
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (07) : 419 - 422
  • [33] Pattern Recognition with Gaussian Mixture Models of Marginal Distributions
    Omachi, Masako
    Omachi, Shinichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (02): : 317 - 324
  • [34] MUSIC EMOTION RECOGNITION WITH ADAPTIVE AGGREGATION OF GAUSSIAN PROCESS REGRESSORS
    Fukayama, Satoru
    Goto, Masataka
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 71 - 75
  • [35] Gaussian Mixture Models Reduction by Variational Maximum Mutual Information
    Bar-Yosef, Yossi
    Bistritz, Yuval
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (06) : 1557 - 1569
  • [36] Recursive Gaussian Mixture Models for Adaptive Process Monitoring
    Zheng, Junhua
    Wen, Qiaojun
    Song, Zhihuan
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (16) : 6551 - 6561
  • [37] Learning mixture models with the regularized latent maximum entropy principle
    Wang, SJ
    Schuurmans, D
    Peng, FC
    Zhao, YX
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (04): : 903 - 916
  • [38] Improved Gaussian Mixture Models for Adaptive Foreground Segmentation
    Katsarakis, Nikolaos
    Pnevmatikakis, Aristodemos
    Tan, Zheng-Hua
    Prasad, Ramjee
    WIRELESS PERSONAL COMMUNICATIONS, 2016, 87 (03) : 629 - 643
  • [39] Noise adaptive training for subspace Gaussian mixture models
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3459 - 3463
  • [40] Improved Gaussian Mixture Models for Adaptive Foreground Segmentation
    Nikolaos Katsarakis
    Aristodemos Pnevmatikakis
    Zheng-Hua Tan
    Ramjee Prasad
    Wireless Personal Communications, 2016, 87 : 629 - 643