Multi-Taper Spectral Features for Emotion Recognition from Speech

被引：0

作者：

Chapaneri, Santosh V. ^{[1
]}

Jayaswal, Deepak D. ^{[1
]}

机构：

[1] Univ Mumbai, St Francis Inst Technol, Dept Elect & Telecommun Engn, Mumbai, Maharashtra, India

来源：

2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC) | 2015年

关键词：

Emotion; Multi-taper; Pattern recognition; SVM; MFCC;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, the performance of multi-taper spectral estimate is investigated relative to conventional single taper estimate for the application of emotion recognition from speech signals. Typically, a single taper/window helps in reducing bias of the estimate, but due to its high variance, the resulting spectral features tend to give poor recognition performance. The weighted averages of the multi-tapered uncorrelated eigenspectra results in more discriminative spectral features, thus increasing the overall performance. We demonstrate that the application of six Multi-peak multi-tapers with support vector machine results in 81 % classification accuracy on seven emotions from Berlin emotion database considering only spectral features, compared to 72% using conventional Hamming window method.

引用

页码：1044 / 1049

页数：6

共 50 条

[11] Automatic speech emotion recognition using modulation spectral features
Wu, Siqing
Falk, Tiago H.
Chan, Wai-Yip
SPEECH COMMUNICATION, 2011, 53 (05) : 768 - 785
[12] Speech emotion recognition using multi resolution Hilbert transform based spectral and entropy features
Mishra, Siba Prasad
Warule, Pankaj
Deb, Suman
APPLIED ACOUSTICS, 2025, 229
[13] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
Arijul Haque
K. Sreenivasa Rao
Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
[14] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
Haque, Arijul
Rao, K. Sreenivasa
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
[15] Multi-taper spectral analysis of stimulation artifact and epileptiform seizure entrainment data
Chernyy, Nick
Sunderam, S.
Mason, J.
Weinstein, S. L.
Schiff, S. J.
Gluckman, B. J.
EPILEPSIA, 2007, 48 : 309 - 309
[16] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Zhou, Yu
Li, Junfeng
Sun, Yanqing
Zhang, Jianping
Yan, Yonghong
Akagi, Masato
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
[17] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
Gaurav, Manish
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
[18] GMM supervector based SVM with spectral features for speech emotion recognition
Hu, Hao
Xu, Ming-Xing
Wu, Wei
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
[19] Fusion of Global Statistical and Segmental Spectral Features for Speech Emotion Recognition
Hu, Hao
Xu, Ming-Xing
Wu, Wei
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1013 - 1016
[20] Deep Learning Algorithms for Speech Emotion Recognition with Hybrid Spectral Features
Kogila R.
Sadanandam M.
Bhukya H.
SN Computer Science, 5 (1)

← 1 2 3 4 5 →