Gender-Aware Speech Emotion Recognition in Multiple Languages

被引:0
|
作者
Nicolini, Marco [1 ]
Ntalampiras, Stavros [1 ]
机构
[1] Univ Milan, Dept Comp Sci, Milan, Italy
关键词
Audio pattern recognition; Machine learning; Transfer learning; Convolutional neural network; YAMNet; Multilingual speech emotion recognition; CORPUS;
D O I
10.1007/978-3-031-54726-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a solution for Speech Emotion Recognition (SER) in multilingual setting using a hierarchical approach. The approach involves two levels, the first level identifies the gender of the speaker, while the second level predicts their emotional state. We evaluate the performance of three classifiers of increasing complexity: k-NN, transfer learning based on YAMNet, and Bidirectional Long Short-Term Memory neural networks. The models were trained, validated, and tested on a dataset that includes the big-six emotions and was collected from well-known SER datasets representing six different languages. Our results indicate that there are differences in classification accuracy when considering all data versus only female or male data, across all classifiers. Interestingly, prior knowledge of the speaker's gender can improve the overall classification performance.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [1] Gender-Aware CNN-BLSTM for Speech Emotion Recognition
    Zhang, Linjuan
    Wang, Longbiao
    Dang, Jianwu
    Guo, Lili
    Yu, Qiang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 782 - 790
  • [2] Advanced differential evolution for gender-aware English speech emotion recognition
    Yue, Liya
    Hu, Pei
    Zhu, Jiulong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] A Gender-Aware Deep Neural Network Structure for Speech Recognition
    Zoughi, Toktam
    Homayounpour, Mohammad Mehdi
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2019, 43 (03) : 635 - 644
  • [4] A Gender-Aware Deep Neural Network Structure for Speech Recognition
    Toktam Zoughi
    Mohammad Mehdi Homayounpour
    Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2019, 43 : 635 - 644
  • [5] Towards Emotion, Age- and Gender-Aware VoiceXML Applications
    Schmitt, Alexander
    Heinroth, Tobias
    Bertrand, Gregor
    INTELLIGENT ENVIRONMENTS 2009, 2009, 2 : 34 - 41
  • [6] HGF-MiLaG: Hierarchical Graph Fusion for Emotion Recognition in Conversation with Mid-Late Gender-Aware Strategy
    Wang, Yihan
    Hao, Rongrong
    Li, Ziheng
    Kuang, Xinhe
    Dong, Jiacheng
    Zhang, Qi
    Qian, Fengkui
    Fu, Changzeng
    SENSORS, 2025, 25 (04)
  • [7] Use of Multiple Classifier System for Gender Driven Speech Emotion Recognition
    Ladde, Pravina P.
    Deshmukh, Vaishali S.
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 713 - 717
  • [8] Gender-aware Re-ranking
    Kharitonov, Eugene
    Serdyukov, Pavel
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1081 - 1082
  • [9] Gender-Aware Course Reform in Scientific Computing
    Larsson, Elisabeth
    Palsson, Stefan
    Rantakokko, Jarmo
    von Sydow, Lina
    Thune, Michael
    INTERNATIONAL JOURNAL OF ENGINEERING EDUCATION, 2013, 29 (02) : 403 - 414
  • [10] Understanding tourism processes: A gender-aware framework
    Kinnaird, V
    Hall, D
    TOURISM MANAGEMENT, 1996, 17 (02) : 95 - 102