Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

被引:0
|
作者
Lin, Yi-Cheng [1 ]
Wu, Haibin [1 ]
Chou, Huang-Cheng [2 ]
Lee, Chi-Chun [2 ]
Lee, Hung-yi [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Natl Tsing Hua Univ, Hsinchu, Taiwan
来源
INTERSPEECH 2024 | 2024年
关键词
social bias; self-supervised learning; emotion recognition;
D O I
10.21437/Interspeech.2024-1073
中图分类号
学科分类号
摘要
The rapid growth of Speech Emotion Recognition (SER) has diverse global applications, from improving human-computer interactions to aiding mental health diagnostics. However, SER models might contain social bias toward gender, leading to unfair outcomes. This study analyzes gender bias in SER models trained with Self-Supervised Learning (SSL) at scale, exploring factors influencing it. SSL-based SER models are chosen for their cutting-edge performance. Our research pioneering research gender bias in SER from both upstream model and data perspectives. Our findings reveal that females exhibit slightly higher overall SER performance than males. Modified CPC and XLS-R, two well-known SSL models, notably exhibit significant bias. Moreover, models trained with Mandarin datasets display a pronounced bias toward valence. Lastly, we find that gender-wise emotion distribution differences in training data significantly affect gender bias, while upstream model representation has a limited impact.
引用
收藏
页码:4633 / 4637
页数:5
相关论文
共 50 条
  • [1] Facial emotion recognition in schizotypy: The role of accuracy and social cognitive bias
    Brown, Laura A.
    Cohen, Alex S.
    JOURNAL OF THE INTERNATIONAL NEUROPSYCHOLOGICAL SOCIETY, 2010, 16 (03) : 474 - 483
  • [2] A bias evaluation solution for multiple sensitive attribute speech recognition
    Chen, Zigang
    Zhou, Yuening
    Wang, Zhen
    Liu, Fan
    Leng, Tao
    Zhu, Haihua
    COMPUTER SPEECH AND LANGUAGE, 2025, 93
  • [3] The effect of speech cost bias and probability bias on social anxiety
    Noda, Shota
    Osawa, Kaori
    Shirotsuki, Kentaro
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 1171 - 1171
  • [4] Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition
    Jalal, Md Asif
    Milner, Rosanna
    Hain, Thomas
    Moore, Roger K.
    INTERSPEECH 2020, 2020, : 4084 - 4088
  • [5] Threat Bias and Emotion Recognition in Victims of IPV
    Clauss, Kate
    Clements, Caroline
    JOURNAL OF INTERPERSONAL VIOLENCE, 2021, 36 (5-6) : NP2464 - NP2481
  • [6] EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause
    Nguyen, Mia Huong
    Samaradivakara, Yasith
    Sasikumar, Prasanth
    Nanayakkara, Suranga
    Gupta, Chitralekha
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11043 - 11051
  • [7] LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION
    Fan, Weiquan
    Xu, Xiangmin
    Xing, Xiaofen
    Chen, Weidong
    Huang, Dongyan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 641 - 645
  • [8] Estimation of channel bias for telephone speech recognition
    Chien, JT
    Wang, HC
    Lee, LM
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1840 - 1843
  • [9] Evaluation of speech misattribution bias in schizophrenia
    Stephane, M.
    Kuskowski, M.
    McClannahan, K.
    Surerus, C.
    Nelson, K.
    PSYCHOLOGICAL MEDICINE, 2010, 40 (05) : 741 - 748
  • [10] EVALUATION OF SPEECH MISATTRIBUTION BIAS IN SCHIZOPHRENIA
    Stephane, Massoud
    Kuskowski, M.
    McClannahan, K.
    Surerus, C.
    SCHIZOPHRENIA BULLETIN, 2009, 35 : 16 - 16