Gender-Aware Speech Emotion Recognition in Multiple Languages

被引:0
|
作者
Nicolini, Marco [1 ]
Ntalampiras, Stavros [1 ]
机构
[1] Univ Milan, Dept Comp Sci, Milan, Italy
关键词
Audio pattern recognition; Machine learning; Transfer learning; Convolutional neural network; YAMNet; Multilingual speech emotion recognition; CORPUS;
D O I
10.1007/978-3-031-54726-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a solution for Speech Emotion Recognition (SER) in multilingual setting using a hierarchical approach. The approach involves two levels, the first level identifies the gender of the speaker, while the second level predicts their emotional state. We evaluate the performance of three classifiers of increasing complexity: k-NN, transfer learning based on YAMNet, and Bidirectional Long Short-Term Memory neural networks. The models were trained, validated, and tested on a dataset that includes the big-six emotions and was collected from well-known SER datasets representing six different languages. Our results indicate that there are differences in classification accuracy when considering all data versus only female or male data, across all classifiers. Interestingly, prior knowledge of the speaker's gender can improve the overall classification performance.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [41] UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS
    Li, Longxin
    Mak, Man-Wai
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5269 - 5273
  • [42] Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
    Latif, Siddique
    Qayyum, Adnan
    Usman, Muhammad
    Qadir, Junaid
    2018 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2018), 2018, : 88 - 93
  • [43] A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages
    Monisha, Syeda Tamanna Alam
    Sultana, Sadia
    ADVANCES IN HUMAN-COMPUTER INTERACTION, 2022, 2022
  • [44] Relationships Self-Learning Based Gender-Aware Age Estimation
    Qing Tian
    Meng Cao
    Songcan Chen
    Hujun Yin
    Neural Processing Letters, 2019, 50 : 2141 - 2160
  • [45] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [46] MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION
    Nediyanchath, Anish
    Paramasivam, Periyasamy
    Yenigalla, Promod
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7179 - 7183
  • [47] Emotion Prompting for Speech Emotion Recognition
    Zhou, Xingfa
    Li, Min
    Yang, Lan
    Sun, Rui
    Wang, Xin
    Zhan, Huayi
    INTERSPEECH 2023, 2023, : 3108 - 3112
  • [48] Speaker and gender dependencies in within/cross linguistic Speech Emotion Recognition
    Chakhtouna A.
    Sekkate S.
    Adib A.
    International Journal of Speech Technology, 2023, 26 (03) : 609 - 625
  • [49] Spontaneous speech emotion recognition via multiple kernel learning
    Zha, Cheng
    Yang, Ping
    Zhang, Xinran
    Zhao, Li
    PROCEEDINGS 2016 EIGHTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION ICMTMA 2016, 2016, : 621 - 623
  • [50] Speech Emotion Recognition Using Combined Multiple Pairwise Classifiers
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HCI INTERNATIONAL 2021 - LATE BREAKING POSTERS, HCII 2021, PT I, 2021, 1498 : 115 - 118