Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

被引:0
|
作者
Lin, Yi-Cheng [1 ]
Wu, Haibin [1 ]
Chou, Huang-Cheng [2 ]
Lee, Chi-Chun [2 ]
Lee, Hung-yi [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Natl Tsing Hua Univ, Hsinchu, Taiwan
来源
INTERSPEECH 2024 | 2024年
关键词
social bias; self-supervised learning; emotion recognition;
D O I
10.21437/Interspeech.2024-1073
中图分类号
学科分类号
摘要
The rapid growth of Speech Emotion Recognition (SER) has diverse global applications, from improving human-computer interactions to aiding mental health diagnostics. However, SER models might contain social bias toward gender, leading to unfair outcomes. This study analyzes gender bias in SER models trained with Self-Supervised Learning (SSL) at scale, exploring factors influencing it. SSL-based SER models are chosen for their cutting-edge performance. Our research pioneering research gender bias in SER from both upstream model and data perspectives. Our findings reveal that females exhibit slightly higher overall SER performance than males. Modified CPC and XLS-R, two well-known SSL models, notably exhibit significant bias. Moreover, models trained with Mandarin datasets display a pronounced bias toward valence. Lastly, we find that gender-wise emotion distribution differences in training data significantly affect gender bias, while upstream model representation has a limited impact.
引用
收藏
页码:4633 / 4637
页数:5
相关论文
共 50 条
  • [31] Galaxy formation and large-scale bias
    Mon Not Royal Astron Soc, 4 (795):
  • [32] Web-based and mixed-mode cognitive large-scale assessments in higher education: An evaluation of selection bias, measurement bias, and prediction bias
    Sabine Zinn
    Uta Landrock
    Timo Gnambs
    Behavior Research Methods, 2021, 53 : 1202 - 1217
  • [33] Speech emotion recognition for the Urdu languageDataset and evaluation
    Nimra Zaheer
    Obaid Ullah Ahmad
    Mudassir Shabbir
    Agha Ali Raza
    Language Resources and Evaluation, 2023, 57 : 915 - 944
  • [34] Statistical Evaluation of Speech Features for Emotion Recognition
    Iliou, Theodoros
    Anagnostopoulos, Christos-Nikolaos
    ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 121 - 126
  • [35] Web-based and mixed-mode cognitive large-scale assessments in higher education: An evaluation of selection bias, measurement bias, and prediction bias
    Zinn, Sabine
    Landrock, Uta
    Gnambs, Timo
    BEHAVIOR RESEARCH METHODS, 2021, 53 (03) : 1202 - 1217
  • [36] Rethinking AI: bias in speech-recognition chatbots for ELT
    Jeon, Jaeho
    Lee, Seongyong
    Coronel-Molina, Serafin M.
    ELT JOURNAL, 2024, 78 (04) : 435 - 445
  • [37] Bias in Automatic Speech Recognition: The Case of African American Language
    Martin, Joshua L.
    Wright, Kelly Elizabeth
    APPLIED LINGUISTICS, 2023, 44 (04) : 613 - 630
  • [38] Adaptive transition bias for robust low complexity speech recognition
    Koumpis, K
    Riis, SK
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 277 - 280
  • [39] Emotion-induced modulation of recognition memory decisions in a Go/NoGo task: Response bias or memory bias?
    Windmann, Sabine
    Chmielewski, Adam
    COGNITION & EMOTION, 2008, 22 (05) : 761 - 776
  • [40] A novel frequency warping scale for speech emotion recognition
    Singh, Premjeet
    Saha, Goutam
    INTERSPEECH 2023, 2023, : 3647 - 3651