Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis

被引:0
|
作者
Goldfarb-Tarrant, Seraphina [1 ,2 ]
Ross, Bjorn [1 ]
Lopez, Adam [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Cohere, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis (SA) systems are widely deployed in many of the world's languages, and there is well-documented evidence of demographic bias in these systems. In languages beyond English, scarcer training data is often supplemented with transfer learning using pre-trained models, including multilingual models trained on other languages. In some cases, even supervision data comes from other languages. Does cross-lingual transfer also import new biases? To answer this question, we use counterfactual evaluation to test whether gender or racial biases are imported when using cross-lingual transfer, compared to a monolingual transfer setting. Across five languages, we find that systems using cross-lingual transfer usually become more biased than their monolingual counterparts. We also find racial biases to be much more prevalent than gender biases. To spur further research on this topic, we release the sentiment models we used for this study, and the intermediate checkpoints throughout training, yielding 1,525 distinct models; we also release our evaluation code.(1)
引用
收藏
页码:5691 / 5704
页数:14
相关论文
共 50 条
  • [1] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [2] Cross-lingual sentiment transfer with limited resources
    Rasooli, Mohammad Sadegh
    Farra, Noura
    Radeva, Axinia
    Yu, Tao
    McKeown, Kathleen
    MACHINE TRANSLATION, 2018, 32 (1-2) : 143 - 165
  • [3] Cross-Lingual Sentiment Analysis: A Survey
    Xu Y.
    Cao H.
    Wang W.
    Du W.
    Xu C.
    Data Analysis and Knowledge Discovery, 2023, 7 (01) : 1 - 21
  • [4] Cross-Lingual Propagation for Deep Sentiment Analysis
    Dong, Xin
    de Melo, Gerard
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5771 - 5778
  • [5] A comparative study of cross-lingual sentiment analysis
    Priban, Pavel
    Smid, Jakub
    Steinberger, Josef
    Mistera, Adam
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [6] Linear Transformations for Cross-lingual Sentiment Analysis
    Priban, Pavel
    Smid, Jakub
    Mistera, Adam
    Kral, Pavel
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 125 - 137
  • [7] Cross-Lingual Sentiment Quantification
    Esuli, Andrea
    Moreo, Alejandro
    Sebastiani, Fabrizio
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (03) : 106 - 113
  • [8] Semi-supervised Learning on Cross-Lingual Sentiment Analysis with Space Transfer
    He, Xiaonan
    Zhang, Hui
    Chao, Wenhan
    Wang, Daqing
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 371 - 377
  • [9] Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
    Zhao, Jieyu
    Mukherjee, Subhabrata
    Hosseini, Saghar
    Chang, Kai-Wei
    Awadallah, Ahmed Hassan
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2896 - 2907
  • [10] Cross-Lingual Sentiment Analysis for Indian Regional Languages
    Impana, P.
    Kallimani, Jagadish S.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 867 - 872