Gender-Aware Speech Emotion Recognition in Multiple Languages

被引:0
|
作者
Nicolini, Marco [1 ]
Ntalampiras, Stavros [1 ]
机构
[1] Univ Milan, Dept Comp Sci, Milan, Italy
关键词
Audio pattern recognition; Machine learning; Transfer learning; Convolutional neural network; YAMNet; Multilingual speech emotion recognition; CORPUS;
D O I
10.1007/978-3-031-54726-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a solution for Speech Emotion Recognition (SER) in multilingual setting using a hierarchical approach. The approach involves two levels, the first level identifies the gender of the speaker, while the second level predicts their emotional state. We evaluate the performance of three classifiers of increasing complexity: k-NN, transfer learning based on YAMNet, and Bidirectional Long Short-Term Memory neural networks. The models were trained, validated, and tested on a dataset that includes the big-six emotions and was collected from well-known SER datasets representing six different languages. Our results indicate that there are differences in classification accuracy when considering all data versus only female or male data, across all classifiers. Interestingly, prior knowledge of the speaker's gender can improve the overall classification performance.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [31] End-to-End Speech Emotion Recognition With Gender Information
    Sun, Ting-Wei
    IEEE ACCESS, 2020, 8 (08): : 152423 - 152438
  • [32] Gender Differentiated Convolutional Neural Networks for Speech Emotion Recognition
    Mishra, Puneet
    Sharma, Ruchir
    2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 142 - 148
  • [33] Speech Emotion Recognition Based on Gender Influence in Emotional Expression
    Vasuki, P.
    Bharati, Divya R.
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2019, 15 (04) : 22 - 40
  • [34] TACST: Time-Aware Transformer for Robust Speech Emotion Recognition
    Wei, Wei
    Zhang, Bingkun
    Wang, Yibing
    MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 442 - 453
  • [35] Sparse temporal aware capsule network for robust speech emotion recognition
    Zhang, Huiyun
    Huang, Heming
    Zhao, Puyang
    Yu, Zhenbao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [36] Adaptive Domain-Aware Representation Learning for Speech Emotion Recognition
    Fan, Weiquan
    Xu, Xiangmin
    Xing, Xiaofen
    Huang, Dongyan
    INTERSPEECH 2020, 2020, : 4089 - 4093
  • [37] A Gender-Aware Gamified Scaffolding of Mathematics for the Middle School Level
    Roessler, Sarah
    Allison, Mark
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON BIG DATA AND EDUCATION (ICBDE 2018), 2018, : 121 - 126
  • [38] Gender and social entrepreneurship in turbulent waters: developing a gender-aware conceptual framework
    de Magdalene, Persephone
    Green, Kai Roland
    INTERNATIONAL JOURNAL OF GENDER AND ENTREPRENEURSHIP, 2025, 17 (01) : 37 - 64
  • [39] Gender-aware Estimation of Depression Severity Level in a Multimodal Setting
    Oureshi, Syed Arbaaz
    Dias, Gael
    Saha, Sriparna
    Hasanuzzaman, Mohammed
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] A Gender-Aware Framework for the Daytime Detection of Obstructive Sleep Apnea
    Samy, Lauren
    Macey, Paul M.
    Sarrafzadeh, Majid
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 7683 - 7687