Relative Entropy Normalized Gaussian Supervector for Speech Emotion Recognition using Kernel Extreme Learning Machine

被引:0
|
作者
Li, Ruru [1 ]
Yang, Dali [1 ]
Li, Xinxing [2 ]
Wang, Renyu [3 ]
Xu, Mingxing [2 ]
Zheng, Thomas Fang [2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Comp Sci, Beijing, Peoples R China
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Minist Educ, Key Lab Pervas Comp,Dept Comp Sci & Technol, Beijing, Peoples R China
[3] Tsinghua Univ, Res Inst Informat Technol, Ctr Speech & Language Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech emotion recognition is a challenging and significant task. On the one hand, the emotion features need to be robust enough to capture the emotion information, and while on the other, machine learning algorithms need to be insensitive to model the utterance. In this paper, we presented a novel framework of speech emotion recognition to address the two above-mentioned challenges. Relative Entropy based Normalization (REN) was proposed to normalize the supervectors of Gaussian Mixture Model-Universal Background Model (GMM-UBM) as the features to emotions. The Kernel Extreme Learning Machine (KELM) was adopted as the classifier to identify the emotion represented by the normalized supervectors. Experimental results on the EMR 1309 corpus showed the proposed framework outperformed the state-of-the-art i-vector based systems.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Speech emotion recognition using multimodal feature fusion with machine learning approach
    Panda, Sandeep Kumar
    Jena, Ajay Kumar
    Panda, Mohit Ranjan
    Panda, Susmita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42763 - 42781
  • [32] Speech emotion recognition using multimodal feature fusion with machine learning approach
    Sandeep Kumar Panda
    Ajay Kumar Jena
    Mohit Ranjan Panda
    Susmita Panda
    Multimedia Tools and Applications, 2023, 82 : 42763 - 42781
  • [33] Machine learning techniques for speech emotion recognition using paralinguistic acoustic features
    Jha T.
    Kavya R.
    Christopher J.
    Arunachalam V.
    International Journal of Speech Technology, 2022, 25 (03): : 707 - 725
  • [34] Reduced Kernel Extreme Learning Machine for Traffic Sign Recognition
    Sanz-Madoz, E.
    Echanobe, J.
    Mata-Carballeira, O.
    del Campo, I.
    Martinez, M. V.
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 4101 - 4106
  • [35] Speech Emotion Recognition Based on Deep Learning and Kernel Nonlinear PSVM
    Han Zhiyan
    Wang Jian
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1426 - 1430
  • [36] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [37] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [38] Speech Emotion Recognition Using Transfer Learning
    Song, Peng
    Jin, Yun
    Zhao, Li
    Xin, Minghai
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2530 - 2532
  • [39] Cross-person activity recognition using reduced kernel extreme learning machine
    Deng, Wan-Yu
    Zheng, Qing-Hua
    Wang, Zhong-Min
    NEURAL NETWORKS, 2014, 53 : 1 - 7
  • [40] Speech Emotion Recognition Using Support Vector Machine
    Al Zoubi, Rouaa
    Turky, Ayad
    Foufou, Sebti
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 7, 2024, 1003 : 519 - 532