Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis

被引:0
|
作者
Goldfarb-Tarrant, Seraphina [1 ,2 ]
Ross, Bjorn [1 ]
Lopez, Adam [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Cohere, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis (SA) systems are widely deployed in many of the world's languages, and there is well-documented evidence of demographic bias in these systems. In languages beyond English, scarcer training data is often supplemented with transfer learning using pre-trained models, including multilingual models trained on other languages. In some cases, even supervision data comes from other languages. Does cross-lingual transfer also import new biases? To answer this question, we use counterfactual evaluation to test whether gender or racial biases are imported when using cross-lingual transfer, compared to a monolingual transfer setting. Across five languages, we find that systems using cross-lingual transfer usually become more biased than their monolingual counterparts. We also find racial biases to be much more prevalent than gender biases. To spur further research on this topic, we release the sentiment models we used for this study, and the intermediate checkpoints throughout training, yielding 1,525 distinct models; we also release our evaluation code.(1)
引用
收藏
页码:5691 / 5704
页数:14
相关论文
共 50 条
  • [21] Cross-lingual sentiment classification with stacked autoencoders
    Guangyou Zhou
    Zhiyuan Zhu
    Tingting He
    Xiaohua Tony Hu
    Knowledge and Information Systems, 2016, 47 : 27 - 44
  • [22] An Approach to Cross-lingual Sentiment Lexicon Construction
    Chang, Chia-Hsuan
    Wu, Ming-Lun
    Hwang, San-Yih
    2019 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS 2019), 2019, : 129 - 131
  • [23] Cross-lingual sentiment classification with stacked autoencoders
    Zhou, Guangyou
    Zhu, Zhiyuan
    He, Tingting
    Hu, Xiaohua Tony
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (01) : 27 - 44
  • [24] Active Learning for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Wang, Rong
    Liu, Huanhuan
    Huang, Chu-Ren
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 236 - 246
  • [25] A multimodal approach to cross-lingual sentiment analysis with ensemble of transformer and LLM
    Miah, Md Saef Ullah
    Kabir, Md Mohsin
    Bin Sarwar, Talha
    Safran, Mejdl
    Alfarhood, Sultan
    Mridha, M. F.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [26] A Systematic Review of Cross-Lingual Sentiment Analysis: Tasks, Strategies, and Prospects
    Zhao, Chuanjun
    Wu, Meiling
    Yang, Xinyi
    Zhang, Wenyue
    Zhang, Shaoxia
    Wang, Suge
    Li, Deyu
    ACM COMPUTING SURVEYS, 2024, 56 (07)
  • [27] Cross-Lingual Transfer Learning Framework for Program Analysis
    Li, Zhiming
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1074 - 1078
  • [28] Data Quality Controlling for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Xue, Yunxia
    Wang, Zhongqing
    Lee, Sophia Yat Mei
    Huang, Chu-Ren
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 125 - 128
  • [29] A Cross-Lingual Approach for Building Multilingual Sentiment Lexicons
    Naderalvojoud, Behzad
    Qasemizadeh, Behrang
    Kallmeyer, Laura
    Sezer, Ebru Akcapinar
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 259 - 266
  • [30] Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques
    Reusens, Manon
    Borchert, Philipp
    Mieskes, Margot
    De Weerdt, Jochen
    Baesens, Bart
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2887 - 2896