Multi-Taper Spectral Features for Emotion Recognition from Speech

被引:0
|
作者
Chapaneri, Santosh V. [1 ]
Jayaswal, Deepak D. [1 ]
机构
[1] Univ Mumbai, St Francis Inst Technol, Dept Elect & Telecommun Engn, Mumbai, Maharashtra, India
关键词
Emotion; Multi-taper; Pattern recognition; SVM; MFCC;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the performance of multi-taper spectral estimate is investigated relative to conventional single taper estimate for the application of emotion recognition from speech signals. Typically, a single taper/window helps in reducing bias of the estimate, but due to its high variance, the resulting spectral features tend to give poor recognition performance. The weighted averages of the multi-tapered uncorrelated eigenspectra results in more discriminative spectral features, thus increasing the overall performance. We demonstrate that the application of six Multi-peak multi-tapers with support vector machine results in 81 % classification accuracy on seven emotions from Berlin emotion database considering only spectral features, compared to 72% using conventional Hamming window method.
引用
收藏
页码:1044 / 1049
页数:6
相关论文
共 50 条
  • [11] Automatic speech emotion recognition using modulation spectral features
    Wu, Siqing
    Falk, Tiago H.
    Chan, Wai-Yip
    SPEECH COMMUNICATION, 2011, 53 (05) : 768 - 785
  • [12] Speech emotion recognition using multi resolution Hilbert transform based spectral and entropy features
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    APPLIED ACOUSTICS, 2025, 229
  • [13] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [14] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [15] Multi-taper spectral analysis of stimulation artifact and epileptiform seizure entrainment data
    Chernyy, Nick
    Sunderam, S.
    Mason, J.
    Weinstein, S. L.
    Schiff, S. J.
    Gluckman, B. J.
    EPILEPSIA, 2007, 48 : 309 - 309
  • [16] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
    Zhou, Yu
    Li, Junfeng
    Sun, Yanqing
    Zhang, Jianping
    Yan, Yonghong
    Akagi, Masato
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
  • [17] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
    Gaurav, Manish
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
  • [18] GMM supervector based SVM with spectral features for speech emotion recognition
    Hu, Hao
    Xu, Ming-Xing
    Wu, Wei
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
  • [19] Fusion of Global Statistical and Segmental Spectral Features for Speech Emotion Recognition
    Hu, Hao
    Xu, Ming-Xing
    Wu, Wei
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1013 - 1016
  • [20] Deep Learning Algorithms for Speech Emotion Recognition with Hybrid Spectral Features
    Kogila R.
    Sadanandam M.
    Bhukya H.
    SN Computer Science, 5 (1)