Fusion of Global Statistical and Segmental Spectral Features for Speech Emotion Recognition

被引：0

作者：

Hu, Hao ^{[1
]}

Xu, Ming-Xing ^{[1
]}

Wu, Wei ^{[1
]}

机构：

[1] Tsinghua Univ, Ctr Speech Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

speech emotion recognition; global statistical features; segmental spectral features; decision fusion;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech emotion recognition is an interesting and challenging speech technology, which can be applied to broad areas. In this paper, we propose to fuse the global statistical and segmental spectral features at the decision level for speech emotion recognition. Each emotional utterance is individually scored by two recognition systems, the global statistics-based and segmental spectrum-based systems, and a weighted linear combination is applied to fuse their scores for final decision. Experimental results on an emotional speech database demonstrate that the global statistical and segmental spectral features are complementary, and the proposed fusion approach further improves the performance of the emotion recognition system.

引用

页码：1013 / 1016

页数：4

共 50 条

[1] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
Gaurav, Manish
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
[2] Statistical Evaluation of Speech Features for Emotion Recognition
Iliou, Theodoros
Anagnostopoulos, Christos-Nikolaos
ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 121 - 126
[3] Hybrid Spectral Features for Speech Emotion Recognition
Shah, Firoz A.
Anto, Babu P.
2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
[4] Speech Emotion Recognition Using Local and Global Features
Gao, Yuanbo
Li, Baobin
Wang, Ning
Zhu, Tingshao
BRAIN INFORMATICS, BI 2017, 2017, 10654 : 3 - 13
[5] Automatic speech emotion recognition using modulation spectral features
Wu, Siqing
Falk, Tiago H.
Chan, Wai-Yip
SPEECH COMMUNICATION, 2011, 53 (05) : 768 - 785
[6] Fusion of PCA and ICA in Statistical Subset Analysis for Speech Emotion Recognition
Kingeski, Rafael
Henning, Elisa
Paterno, Aleksander S.
SENSORS, 2024, 24 (17)
[7] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Zhou, Yu
Li, Junfeng
Sun, Yanqing
Zhang, Jianping
Yan, Yonghong
Akagi, Masato
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
[8] GMM supervector based SVM with spectral features for speech emotion recognition
Hu, Hao
Xu, Ming-Xing
Wu, Wei
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
[9] Multi-Taper Spectral Features for Emotion Recognition from Speech
Chapaneri, Santosh V.
Jayaswal, Deepak D.
2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), 2015, : 1044 - 1049
[10] Deep Learning Algorithms for Speech Emotion Recognition with Hybrid Spectral Features
Kogila R.
Sadanandam M.
Bhukya H.
SN Computer Science, 5 (1)

← 1 2 3 4 5 →