Relative Entropy Normalized Gaussian Supervector for Speech Emotion Recognition using Kernel Extreme Learning Machine

被引:0
|
作者
Li, Ruru [1 ]
Yang, Dali [1 ]
Li, Xinxing [2 ]
Wang, Renyu [3 ]
Xu, Mingxing [2 ]
Zheng, Thomas Fang [2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Comp Sci, Beijing, Peoples R China
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Minist Educ, Key Lab Pervas Comp,Dept Comp Sci & Technol, Beijing, Peoples R China
[3] Tsinghua Univ, Res Inst Informat Technol, Ctr Speech & Language Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech emotion recognition is a challenging and significant task. On the one hand, the emotion features need to be robust enough to capture the emotion information, and while on the other, machine learning algorithms need to be insensitive to model the utterance. In this paper, we presented a novel framework of speech emotion recognition to address the two above-mentioned challenges. Relative Entropy based Normalization (REN) was proposed to normalize the supervectors of Gaussian Mixture Model-Universal Background Model (GMM-UBM) as the features to emotions. The Kernel Extreme Learning Machine (KELM) was adopted as the classifier to identify the emotion represented by the normalized supervectors. Experimental results on the EMR 1309 corpus showed the proposed framework outperformed the state-of-the-art i-vector based systems.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Exploration of Complementary Features for Speech Emotion Recognition Based on Kernel Extreme Learning Machine
    Guo, Lili
    Wang, Longbiao
    Dang, Jianwu
    Liu, Zhilei
    Guan, Haotian
    IEEE ACCESS, 2019, 7 : 75798 - 75809
  • [2] Improved Emotion Recognition Using Gaussian Mixture Model and Extreme Learning Machine in Speech and Glottal Signals
    Muthusamy, Hariharan
    Polat, Kemal
    Yaacob, Sazali
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [3] Speech Emotion Recognition Using Deep Neural Network and Extreme Learning Machine
    Han, Kun
    Yu, Dong
    Tashev, Ivan
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 223 - 227
  • [4] Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
    Xu, Xinzhou
    Deng, Jun
    Coutinho, Eduardo
    Wu, Chen
    Zhao, Li
    Schuller, Bjoern W.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 795 - 808
  • [5] Speech emotion recognition using optimized genetic algorithm-extreme learning machine
    Musatafa Abbas Abbood Albadr
    Sabrina Tiun
    Masri Ayob
    Fahad Taha AL-Dhief
    Khairuddin Omar
    Mhd Khaled Maen
    Multimedia Tools and Applications, 2022, 81 : 23963 - 23989
  • [6] Speech emotion recognition using optimized genetic algorithm-extreme learning machine
    Albadr, Musatafa Abbas Abbood
    Tiun, Sabrina
    Ayob, Masri
    AL-Dhief, Fahad Taha
    Omar, Khairuddin
    Maen, Mhd Khaled
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 23963 - 23989
  • [7] SER: Speech Emotion Recognition Application Based on Extreme Learning Machine
    Ainurrochman
    Febriansyah, Irfanur Ilham
    Yuhana, Umi Laili
    PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 179 - 183
  • [8] Emotion recognition in election day tweets using optimised kernel extreme learning machine classifier
    Kavitha, N. S. B. D.
    Reddy, Prasad P. V. G. D.
    Rao, K. Venkata
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (02) : 289 - 307
  • [9] Speech Emotion Recognition Based on Gaussian Kernel nonlinear Proximal Support Vector Machine
    Han, Zhiyan
    Wang, Jian
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 2513 - 2516
  • [10] Emotion Recognition On Speech Signals Using Machine Learning
    Ghai, Mohan
    Lal, Shamit
    Duggal, Shivam
    Manik, Shrey
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND COMPUTATIONAL INTELLIGENCE (ICBDAC), 2017, : 34 - 39