A Preliminary Study of Emotion Recognition Employing Adaptive Gaussian Mixture Models with the Maximum A Posteriori Principle

被引:0
|
作者
Yang, Jing-Hsiang [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 | 2014年
关键词
emotion recognition; MFCC; PLPCC; GMM; MAP adaptation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel processing structure to improve the performance of the automatic speech emotion recognition. In this structure, the Gaussian mixture model (GMM) is first created for each type of emotions with speech features in the training set, which consists of the utterances produced by several speakers. Next, the emotion GMMs are further adapted via a portion of the speaker-specific data in the training set using the maximum a posteriori (MAP) criterion, and thus the resulting new GMMs are expected to be better-suited for the testing utterances produced by the specific speaker in emotion recognition in comparison with the original speaker-independent GMMs. Experimental results show that after MAP adaptation for the GMMs, the emotion recognition accuracy can be improved significantly irrespective of the selected speech feature types being mel-frequency cepstral coefficients (MFCC) or perceptual linear predictive cepstral coefficients (PLPCC).
引用
收藏
页码:1575 / +
页数:2
相关论文
共 50 条
  • [41] Adaptive Background Update Based on Mixture Models of Gaussian
    Wang, Feng
    Dai, Shuguang
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 325 - 328
  • [42] Combining Gaussian mixture models and segmental feature models for speaker recognition
    Milosevic, Milana
    Glavitsch, Ulrike
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2042 - 2043
  • [43] ACCURATE SPEAKER RECOGNITION BASED ON ADAPTIVE GAUSSIAN MIXTURE MODEL
    Wang Yunqi
    Yu Yibiao
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 527 - 531
  • [44] Study on the Relevance Factor of Maximum a Posteriori with GMM for Language Recognition
    You, Chang Huai
    Li, Haizhou
    Lee, Kong Aik
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2904 - 2907
  • [45] Self-Adaptive Multi-Sensor Activity Recognition Systems Based on Gaussian Mixture Models
    Jaenicke, Martin
    Sick, Bernhard
    Tomforde, Sven
    INFORMATICS-BASEL, 2018, 5 (03):
  • [46] Employing adaptive functions and maximum entropy principle for nonlinear blind source deconvolution
    Corinti, E
    Amadio, V
    Tummarello, G
    Piazza, F
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1458 - 1463
  • [47] Emotion Recognition Using Hybrid Gaussian Mixture Model and Deep Neural Network
    Shahin, Ismail
    Nassif, Ali Bou
    Hamsa, Shibani
    IEEE ACCESS, 2019, 7 : 26777 - 26787
  • [48] Cascaded projection of Gaussian mixture model for emotion recognition in speech and ECG signals
    Huang, Chengwei
    Wu, Di
    Zhang, Xiaojun
    Xiao, Zhongzhe
    Xu, Yishen
    Ji, Jingjing
    Tao, Zhi
    Zhao, Li
    Journal of Southeast University (English Edition), 2015, 31 (03) : 320 - 326
  • [49] Gaussian mixture model based estimation of the neutral face shape for emotion recognition
    Ulukaya, Sezer
    Erdem, Cigdem Eroglu
    DIGITAL SIGNAL PROCESSING, 2014, 32 : 11 - 23
  • [50] Emotion Recognition from Speech using Gaussian Mixture Model and Vector Quantization
    Agrawal, Surabhi
    Dongaonkar, Shabda
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,