Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

被引：0

作者：

Lin, Yi-Cheng ^{[1
]}

Wu, Haibin ^{[1
]}

Chou, Huang-Cheng ^{[2
]}

Lee, Chi-Chun ^{[2
]}

Lee, Hung-yi ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

[2] Natl Tsing Hua Univ, Hsinchu, Taiwan

来源：

INTERSPEECH 2024 | 2024年

关键词：

social bias; self-supervised learning; emotion recognition;

D O I：

10.21437/Interspeech.2024-1073

中图分类号：

学科分类号：

摘要：

The rapid growth of Speech Emotion Recognition (SER) has diverse global applications, from improving human-computer interactions to aiding mental health diagnostics. However, SER models might contain social bias toward gender, leading to unfair outcomes. This study analyzes gender bias in SER models trained with Self-Supervised Learning (SSL) at scale, exploring factors influencing it. SSL-based SER models are chosen for their cutting-edge performance. Our research pioneering research gender bias in SER from both upstream model and data perspectives. Our findings reveal that females exhibit slightly higher overall SER performance than males. Modified CPC and XLS-R, two well-known SSL models, notably exhibit significant bias. Moreover, models trained with Mandarin datasets display a pronounced bias toward valence. Lastly, we find that gender-wise emotion distribution differences in training data significantly affect gender bias, while upstream model representation has a limited impact.

引用

页码：4633 / 4637

页数：5

共 50 条

[31] Galaxy formation and large-scale bias
Mon Not Royal Astron Soc, 4 (795):
[32] Web-based and mixed-mode cognitive large-scale assessments in higher education: An evaluation of selection bias, measurement bias, and prediction bias
Sabine Zinn
Uta Landrock
Timo Gnambs
Behavior Research Methods, 2021, 53 : 1202 - 1217
[33] Speech emotion recognition for the Urdu languageDataset and evaluation
Nimra Zaheer
Obaid Ullah Ahmad
Mudassir Shabbir
Agha Ali Raza
Language Resources and Evaluation, 2023, 57 : 915 - 944
[34] Statistical Evaluation of Speech Features for Emotion Recognition
Iliou, Theodoros
Anagnostopoulos, Christos-Nikolaos
ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 121 - 126
[35] Web-based and mixed-mode cognitive large-scale assessments in higher education: An evaluation of selection bias, measurement bias, and prediction bias
Zinn, Sabine
Landrock, Uta
Gnambs, Timo
BEHAVIOR RESEARCH METHODS, 2021, 53 (03) : 1202 - 1217
[36] Rethinking AI: bias in speech-recognition chatbots for ELT
Jeon, Jaeho
Lee, Seongyong
Coronel-Molina, Serafin M.
ELT JOURNAL, 2024, 78 (04) : 435 - 445
[37] Bias in Automatic Speech Recognition: The Case of African American Language
Martin, Joshua L.
Wright, Kelly Elizabeth
APPLIED LINGUISTICS, 2023, 44 (04) : 613 - 630
[38] Adaptive transition bias for robust low complexity speech recognition
Koumpis, K
Riis, SK
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 277 - 280
[39] Emotion-induced modulation of recognition memory decisions in a Go/NoGo task: Response bias or memory bias?
Windmann, Sabine
Chmielewski, Adam
COGNITION & EMOTION, 2008, 22 (05) : 761 - 776
[40] A novel frequency warping scale for speech emotion recognition
Singh, Premjeet
Saha, Goutam
INTERSPEECH 2023, 2023, : 3647 - 3651

← 1 2 3 4 5 →