A Preliminary Study of Emotion Recognition Employing Adaptive Gaussian Mixture Models with the Maximum A Posteriori Principle

被引:0
|
作者
Yang, Jing-Hsiang [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 | 2014年
关键词
emotion recognition; MFCC; PLPCC; GMM; MAP adaptation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel processing structure to improve the performance of the automatic speech emotion recognition. In this structure, the Gaussian mixture model (GMM) is first created for each type of emotions with speech features in the training set, which consists of the utterances produced by several speakers. Next, the emotion GMMs are further adapted via a portion of the speaker-specific data in the training set using the maximum a posteriori (MAP) criterion, and thus the resulting new GMMs are expected to be better-suited for the testing utterances produced by the specific speaker in emotion recognition in comparison with the original speaker-independent GMMs. Experimental results show that after MAP adaptation for the GMMs, the emotion recognition accuracy can be improved significantly irrespective of the selected speech feature types being mel-frequency cepstral coefficients (MFCC) or perceptual linear predictive cepstral coefficients (PLPCC).
引用
收藏
页码:1575 / +
页数:2
相关论文
共 50 条
  • [1] MAXIMUM A POSTERIORI ADAPTATION OF SUBSPACE GAUSSIAN MIXTURE MODELS FOR CROSS-LINGUAL SPEECH RECOGNITION
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4877 - 4880
  • [2] Variational Gaussian Mixture Models for Speech Emotion Recognition
    Mishra, Harendra Kumar
    Sekhar, C. Chandra
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 183 - 186
  • [3] COMPARING MAXIMUM A POSTERIORI VECTOR QUANTIZATION AND GAUSSIAN MIXTURE MODELS IN SPEAKER VERIFICATION
    Kinnunen, Tomi
    Saastamoinen, Juhani
    Hautamaki, Ville
    Vinni, Mikko
    Franti, Pasi
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4229 - 4232
  • [4] Comparative evaluation of maximum a Posteriori vector quantization and gaussian mixture models in speaker verification
    Kinnunen, Tomi
    Saastamoinen, Juhani
    Hautamaki, Ville
    Vinni, Mikko
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2009, 30 (04) : 341 - 347
  • [5] Speech emotion recognition using Gaussian mixture vector autoregressive models
    El Ayadi, Moataz M. H.
    Kamel, Mohamed S.
    Karray, Fakhri
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 957 - +
  • [6] EMOTION RECOGNITION FROM SPEECH VIA BOOSTED GAUSSIAN MIXTURE MODELS
    Tang, Hao
    Chu, Stephen M.
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 294 - +
  • [7] An adaptive algorithm for target recognition using Gaussian mixture models
    Xue, Wenling
    Jiang, Ting
    MEASUREMENT, 2018, 124 : 233 - 240
  • [8] Speech Emotion Recognition based on Gaussian Mixture Models and Deep Neural Networks
    Tashev, Ivan J.
    Wang, Zhong-Qiu
    Godin, Keith
    2017 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2017,
  • [9] GAUSSIAN MIXTURE MODELS WITH CLASS-DEPENDENT FEATURES FOR SPEECH EMOTION RECOGNITION
    Iriya, Rafael
    Ramirez, Miguel Arjona
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 480 - 483
  • [10] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
    Chehrehsa, Sarang
    Moir, Tom James
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 58 - 71