Emotion Recognition in Speech Using MFCC with SVM, DSVM and Auto-encoder

被引:0
|
作者
Aouani, Hadhami [1 ]
Ben Ayed, Yassine [2 ]
机构
[1] ISIMS Univ Sfax, Higher Inst Comp Sci & Multimedia, Sfax, Tunisia
[2] MIRACL Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia
关键词
Emotion recognition; MFCC; SVM; Deep Support Vector Machine; Basic auto-encoder; Stacked Auto-encoder;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotions recognition from speech is one of the most important sub domains in the field of signal processing. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: 39 Mel-frequency Cepstral Coefficient (MFCC) coefficients and 65 MFCC features extracted based on the work of [20]. Secondly, we use the Support Vector Machine (SVM) as the main classifier engine since it is the most common technique in the field of speech recognition. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning, as well as the various types of auto-encoder (the basic auto-encoder and the stacked auto-encoder). A large set of experiments are conducted on the SAVEE audio database. The experimental results show that DSVM method outperforms the standard SVM with a classification rate of 69.84% and 68.25% using 39 MFCC, respectively. Additionally, the auto-encoder method outperforms the standard SVM, yielding a classification rate of 73.01%.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Speech Emotion Recognition using MFCC and Hybrid Neural Networks
    Badr, Youakim
    Mukherjee, Partha
    Thumati, Sindhu
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 366 - 373
  • [22] Script Selection Using Convolutional Auto-encoder for TTS Speech Corpus
    Shamsi, Meysam
    Lolive, Damien
    Barbot, Nelly
    Chevelu, Jonathan
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 423 - 432
  • [23] Speech Emotion Recognition using MFCC features and LSTM network
    Kumbhar, Harshawardhan S.
    Bhandari, Sheetal U.
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [24] Development of Speech Emotion Recognition Algorithm using MFCC and Prosody
    Koo, Hyejin
    Jeong, Soycong
    Yoon, Sungjae
    Kim, Wonjong
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [25] Sound based Human Emotion Recognition using MFCC & Multiple SVM
    Sonawane, Anagha
    Inamdar, M. U.
    Bhangale, Kishor B.
    2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION, INSTRUMENTATION AND CONTROL (ICICIC), 2017,
  • [26] Speech Emotion Recognition Based on Improved MFCC
    Wang, Yan
    Hu, Weiping
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [27] Emotion Recognition and Regulation Based on Stacked Sparse Auto-Encoder Network and Personalized Reconfigurable Music
    Li, Yinsheng
    Zheng, Wei
    MATHEMATICS, 2021, 9 (06) : 1 - 18
  • [28] Automatic detection of arrhythmias from an ECG signal using an auto-encoder and SVM classifier
    Ojha, Manoj Kumar
    Wadhwani, Sulochna
    Wadhwani, Arun Kumar
    Shukla, Anupam
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2022, 45 (02) : 665 - 674
  • [29] Speech emotion recognition using MFCC-based entropy feature
    Siba Prasad Mishra
    Pankaj Warule
    Suman Deb
    Signal, Image and Video Processing, 2024, 18 : 153 - 161
  • [30] Emotion Recognition from Speech Using MFCC and DWT for Security System
    Saste, Sonali T.
    Jagdale, S. M.
    2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, 2017, : 701 - 704