A Comparative Study on Different Labelling Schemes and Cross-Corpus Experiments in Speech Emotion Recognition

被引：0

作者：

Baki, Pinar ^{[1
]}

Erden, Berna ^{[1
]}

Oncul, Serkan ^{[1
]}

机构：

[1] Arcel Arastirma Gelistirme Merkezi, Istanbul, Turkey

来源：

29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021) | 2021年

关键词：

speech emotion recognition; cross-corpus training; emotion categories; audio classification;

D O I：

10.1109/SIU53274.2021.9477924

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Performance of the speech emotion recognition systems depends on many factors such as quality of the speech data, environment, cultural differences, language, emotion categorization scheme, etc. In this work, we create a baseline speech emotion recognition model based on convolutional neural networks using the RAVDESS dataset. First, we compare the performance of the model with different labeling schemes. Then, we perform cross-corpus experiments on datasets recorded in different languages. The results show that emotion groups with common arousal or valence categories are often confused and using multiple corpora in training improves the generalization capacity of the model.

引用

页数：4

共 50 条

[21] Deep Transductive Transfer Regression Network for Cross-Corpus Speech Emotion Recognition
Zhao, Yan
Wang, Jincen
Ye, Ru
Zong, Yuan
Zheng, Wenming
Zhao, Li
INTERSPEECH 2022, 2022, : 371 - 375
[22] Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach
Zhao, Yan
Zong, Yuan
Lian, Hailun
Lu, Cheng
Shi, Jingang
Zheng, Wenming
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
[23] Cross-Corpus Speech Emotion Recognition Based on Sparse Subspace Transfer Learning
Zhao, Keke
Song, Peng
Zhang, Wenjing
Zhang, Weijian
Li, Shaokai
Chen, Dongliang
Zheng, Wenming
BIOMETRIC RECOGNITION (CCBR 2021), 2021, 12878 : 466 - 473
[24] Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition
Chen, Xiuzhen
Zhou, Xiaoyan
Lu, Cheng
Zong, Yuan
Zheng, Wenming
Tang, Chuangao
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2632 - 2636
[25] An adaptation framework with unified embedding reconstruction for cross-corpus speech emotion recognition
Zhang, Ruiteng
Wei, Jianguo
Lu, Xugang
Li, Yongwei
Lu, Wenhuan
Zhang, Lin
Xu, Junhai
APPLIED SOFT COMPUTING, 2025, 174
[26] Cross-Corpus Arabic and English Emotion Recognition
Meftah, Ali
Seddiq, Yasser
Alotaibi, Yousef
Selouani, Sid-Ahmed
2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 377 - 381
[27] Cross-corpus speech emotion recognition using subspace learning and domain adaption
Xuan Cao
Maoshen Jia
Jiawei Ru
Tun-wen Pai
EURASIP Journal on Audio, Speech, and Music Processing, 2022
[28] Cross-corpus speech emotion recognition using subspace learning and domain adaption
Cao, Xuan
Jia, Maoshen
Ru, Jiawei
Pai, Tun-wen
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
[29] Progressive distribution adapted neural networks for cross-corpus speech emotion recognition
Zong, Yuan
Lian, Hailun
Zhang, Jiacheng
Feng, Ercui
Lu, Cheng
Chang, Hongli
Tang, Chuangao
FRONTIERS IN NEUROROBOTICS, 2022, 16
[30] Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition
Zhang, Weijian
Song, Peng
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 307 - 318

← 1 2 3 4 5 →