Gender-Aware Speech Emotion Recognition in Multiple Languages

被引：0

作者：

Nicolini, Marco ^{[1
]}

Ntalampiras, Stavros ^{[1
]}

机构：

[1] Univ Milan, Dept Comp Sci, Milan, Italy

来源：

PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2023 | 2024年 / 14547卷

关键词：

Audio pattern recognition; Machine learning; Transfer learning; Convolutional neural network; YAMNet; Multilingual speech emotion recognition; CORPUS;

D O I：

10.1007/978-3-031-54726-3_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article presents a solution for Speech Emotion Recognition (SER) in multilingual setting using a hierarchical approach. The approach involves two levels, the first level identifies the gender of the speaker, while the second level predicts their emotional state. We evaluate the performance of three classifiers of increasing complexity: k-NN, transfer learning based on YAMNet, and Bidirectional Long Short-Term Memory neural networks. The models were trained, validated, and tested on a dataset that includes the big-six emotions and was collected from well-known SER datasets representing six different languages. Our results indicate that there are differences in classification accuracy when considering all data versus only female or male data, across all classifiers. Interestingly, prior knowledge of the speaker's gender can improve the overall classification performance.

引用

页码：111 / 123

页数：13

共 50 条

[31] End-to-End Speech Emotion Recognition With Gender Information
Sun, Ting-Wei
IEEE ACCESS, 2020, 8 (08): : 152423 - 152438
[32] Gender Differentiated Convolutional Neural Networks for Speech Emotion Recognition
Mishra, Puneet
Sharma, Ruchir
2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 142 - 148
[33] Speech Emotion Recognition Based on Gender Influence in Emotional Expression
Vasuki, P.
Bharati, Divya R.
INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2019, 15 (04) : 22 - 40
[34] TACST: Time-Aware Transformer for Robust Speech Emotion Recognition
Wei, Wei
Zhang, Bingkun
Wang, Yibing
MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 442 - 453
[35] Sparse temporal aware capsule network for robust speech emotion recognition
Zhang, Huiyun
Huang, Heming
Zhao, Puyang
Yu, Zhenbao
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
[36] Adaptive Domain-Aware Representation Learning for Speech Emotion Recognition
Fan, Weiquan
Xu, Xiangmin
Xing, Xiaofen
Huang, Dongyan
INTERSPEECH 2020, 2020, : 4089 - 4093
[37] A Gender-Aware Gamified Scaffolding of Mathematics for the Middle School Level
Roessler, Sarah
Allison, Mark
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON BIG DATA AND EDUCATION (ICBDE 2018), 2018, : 121 - 126
[38] Gender and social entrepreneurship in turbulent waters: developing a gender-aware conceptual framework
de Magdalene, Persephone
Green, Kai Roland
INTERNATIONAL JOURNAL OF GENDER AND ENTREPRENEURSHIP, 2025, 17 (01) : 37 - 64
[39] Gender-aware Estimation of Depression Severity Level in a Multimodal Setting
Oureshi, Syed Arbaaz
Dias, Gael
Saha, Sriparna
Hasanuzzaman, Mohammed
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[40] A Gender-Aware Framework for the Daytime Detection of Obstructive Sleep Apnea
Samy, Lauren
Macey, Paul M.
Sarrafzadeh, Majid
2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 7683 - 7687

← 1 2 3 4 5 →