Prominence features: Effective emotional features for speech emotion recognition

被引：43

作者：

Jing, Shaoling ^{[1
]}

Mao, Xia ^{[1
]}

Chen, Lijiang ^{[1
]}

机构：

[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China

来源：

DIGITAL SIGNAL PROCESSING | 2018年 / 72卷

基金：

中国国家自然科学基金;

关键词：

Prominence features; Speech annotation; Consistency assessment; Speech emotion recognition; FUNDAMENTAL-FREQUENCY; PERCEIVED PROMINENCE; AGREEMENT;

D O I：

10.1016/j.dsp.2017.10.016

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Emotion-related feature extraction is a challenging task in speech emotion recognition. Due to the lack of discriminative acoustic features, classical approaches based on traditional acoustic features could not provide satisfactory performances. This research proposes a novel type of feature related to prominence, which, together with traditional acoustic features, are used to classify seven typical different emotional states. To this end, the author group produces a Chinese Dual-mode Emotional Speech Database (CDESD), which contains additional prominence and paralinguistic annotation information. Then, a consistency assessment algorithm is presented to validate the reliability of the annotation information of this database. The results show that the annotation consistency on prominence reaches more than 60% on average. Subsequently, this research analyzes the correlation of the prominence features with emotional states using a curve fitting method. Prominence is found to be closely related to emotion states, to retain emotional information at the word level to the greatest possible extent and to play an important role in emotional expression. Finally, the proposed prominence features are validated on CDESD through speaker dependent and speaker-independent experiments with four commonly used classifiers. The results show that the average recognition rate achieved using the combined features is improved by 6% in speaker dependent experiments and by 6.2% in speaker-independent experiments compared with that achieved using only acoustic features. (C) 2017 Elsevier Inc. All rights reserved.

引用

页码：216 / 231

页数：16

共 50 条

[41] Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
Kadin, Sudarsana Reddy
Gangamohan, P.
Gangashetty, Suryakanth, V
Alku, Paavo
Yegnanarayana, B.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (09) : 4459 - 4481
[42] Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
Sudarsana Reddy Kadiri
P. Gangamohan
Suryakanth V. Gangashetty
Paavo Alku
B. Yegnanarayana
Circuits, Systems, and Signal Processing, 2020, 39 : 4459 - 4481
[43] A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms
Byun, Sung-Woo
Lee, Seok-Pil
APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 15
[44] NOT ALL FEATURES ARE EQUAL: SELECTION OF ROBUST FEATURES FOR SPEECH EMOTION RECOGNITION IN NOISY ENVIRONMENTS
Leem, Seong-Gyun
Fulford, Daniel
Onnela, Jukka-Pekka
Gard, David
Busso, Carlos
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6447 - 6451
[45] Exploring the benefits of discretization of acoustic features for speech emotion recognition
Vogt, Thurid
Andre, Elisabeth
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 348 - 351
[46] Speech Emotion Recognition Using Auditory Spectrogram and Cepstral Features
Zhao, Shujie
Yang, Yan
Cohen, Israel
Zhang, Lijun
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 136 - 140
[47] Investigating Graph-based Features for Speech Emotion Recognition
Pentari, Anastasia
Kafentzis, George
Tsiknakis, Manolis
2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
[48] Learning Salient Features for Speech Emotion Recognition Using CNN
Liu, Jiamu
Han, Wenjing
Ruan, Huabin
Chen, Xiaomin
Jiang, Dongmei
Li, Haifeng
2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
[49] Speech Emotion Recognition Using Neural Network and Wavelet Features
Roy, Tanmoy
Marwala, Tshilidzi
Chakraverty, S.
RECENT TRENDS IN WAVE MECHANICS AND VIBRATIONS, WMVC 2018, 2020, : 427 - 438
[50] Speech emotion recognition based on prosodic segment level features
Han, Wenjing
Li, Haifeng
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2009, 49 (SUPPL. 1): : 1363 - 1368

← 1 2 3 4 5 →