Emotion Recognition in Speech Using MFCC with SVM, DSVM and Auto-encoder

被引:0
|
作者
Aouani, Hadhami [1 ]
Ben Ayed, Yassine [2 ]
机构
[1] ISIMS Univ Sfax, Higher Inst Comp Sci & Multimedia, Sfax, Tunisia
[2] MIRACL Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia
关键词
Emotion recognition; MFCC; SVM; Deep Support Vector Machine; Basic auto-encoder; Stacked Auto-encoder;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotions recognition from speech is one of the most important sub domains in the field of signal processing. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: 39 Mel-frequency Cepstral Coefficient (MFCC) coefficients and 65 MFCC features extracted based on the work of [20]. Secondly, we use the Support Vector Machine (SVM) as the main classifier engine since it is the most common technique in the field of speech recognition. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning, as well as the various types of auto-encoder (the basic auto-encoder and the stacked auto-encoder). A large set of experiments are conducted on the SAVEE audio database. The experimental results show that DSVM method outperforms the standard SVM with a classification rate of 69.84% and 68.25% using 39 MFCC, respectively. Additionally, the auto-encoder method outperforms the standard SVM, yielding a classification rate of 73.01%.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Automatic detection of arrhythmias from an ECG signal using an auto-encoder and SVM classifier
    Manoj Kumar Ojha
    Sulochna Wadhwani
    Arun Kumar Wadhwani
    Anupam Shukla
    Physical and Engineering Sciences in Medicine, 2022, 45 : 665 - 674
  • [32] Speech emotion recognition by using complex MFCC and deep sequential model
    Patnaik, Suprava
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 11897 - 11922
  • [33] Speech emotion recognition using MFCC-based entropy feature
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 153 - 161
  • [34] Speech emotion recognition by using complex MFCC and deep sequential model
    Suprava Patnaik
    Multimedia Tools and Applications, 2023, 82 : 11897 - 11922
  • [35] Speech Emotion Recognition using SVM with thresholding fusion
    Gupta, Shilpi
    Mehra, Anu
    Vinay
    2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 570 - 574
  • [36] SAfER: Simplified Auto-encoder for (Anomalous) Event Recognition
    Perera, Yuvin
    Batista, Gustavo
    Hu, Wen
    Kanhere, Salil
    Jha, Sanjay
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 229 - 233
  • [37] Application of Sparse auto-encoder in Handwritten Digit Recognition
    Zhou, Kaihong
    Qiao, Xinxin
    Shi, Jingkai
    ISBDAI '18: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON BIG DATA AND ARTIFICIAL INTELLIGENCE, 2018, : 5 - 8
  • [38] Robust Deep Auto-encoder for Occluded Face Recognition
    Cheng, Lele
    Wang, Jinjun
    Gong, Yihong
    Hou, Qiqi
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1099 - 1102
  • [39] A Weighted Denoising Auto-Encoder Applied to Mel Sub-Bands for Robust Speech Recognition
    Baniardalan, Faezeh
    Akbari, Ahmad
    Nasersharif, Babak
    2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 38 - 42
  • [40] Real Time Speech Recognition based on PWP Thresholding and MFCC using SVM
    Helali, Wafa
    Hajaiej, Zied
    Cherif, Adnen
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (05) : 6204 - 6208