Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis

被引:0
|
作者
Goldfarb-Tarrant, Seraphina [1 ,2 ]
Ross, Bjorn [1 ]
Lopez, Adam [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Cohere, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis (SA) systems are widely deployed in many of the world's languages, and there is well-documented evidence of demographic bias in these systems. In languages beyond English, scarcer training data is often supplemented with transfer learning using pre-trained models, including multilingual models trained on other languages. In some cases, even supervision data comes from other languages. Does cross-lingual transfer also import new biases? To answer this question, we use counterfactual evaluation to test whether gender or racial biases are imported when using cross-lingual transfer, compared to a monolingual transfer setting. Across five languages, we find that systems using cross-lingual transfer usually become more biased than their monolingual counterparts. We also find racial biases to be much more prevalent than gender biases. To spur further research on this topic, we release the sentiment models we used for this study, and the intermediate checkpoints throughout training, yielding 1,525 distinct models; we also release our evaluation code.(1)
引用
收藏
页码:5691 / 5704
页数:14
相关论文
共 50 条
  • [31] Cross-lingual Transfer of Monolingual Models
    Gogoulou, Evangelia
    Ekgren, Ariel
    Isbister, Tim
    Sahlgren, Magnus
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 948 - 955
  • [32] Model Selection for Cross-Lingual Transfer
    Chen, Yang
    Ritter, Alan
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5675 - 5687
  • [33] Cross-Lingual Sentiment Analysis in Deep Learning: A Comparative Study of Multilingual Approaches
    Kumar, Rishabh
    Kumar, Rajat
    Singh, Ritik
    Katarya, Rahul
    2023 14th International Conference on Computing Communication and Networking Technologies, ICCCNT 2023, 2023,
  • [34] A Knowledge-Enhanced Adversarial Model for Cross-lingual Structured Sentiment Analysis
    Zhang, Qi
    Zhou, Jie
    Chen, Qin
    Bai, Qingchun
    Xiao, Jun
    He, Liang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] Exploring the Cross-Lingual Similarity of Valmiki Ramayana Using Semantic and Sentiment Analysis
    Kulkarni, Pooja
    Birajdar, Gajanan K.
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2025,
  • [36] Improving Cross-lingual Aspect-based Sentiment Analysis with Sememe Bridge
    Liu, Yijiang
    Li, Fei
    Ji, Donghong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (12)
  • [37] Can Cross-Domain Term Extraction Benefit from Cross-lingual Transfer?
    Tran, Hanh Thi Hong
    Martinc, Matej
    Doucet, Antoine
    Pollak, Senja
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 363 - 378
  • [38] Cross-lingual Opinion Analysis via Negative Transfer Detection
    Gui, Lin
    Xu, Ruifeng
    Lu, Qin
    Xu, Jun
    Xu, Jian
    Liu, Bin
    Wang, Xiaolong
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 860 - 865
  • [39] Latent Sentiment Model for Weakly-Supervised Cross-Lingual Sentiment Classification
    He, Yulan
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 214 - 225
  • [40] Coarse Alignment of Topic and Sentiment: A Unified Model for Cross-Lingual Sentiment Classification
    Wang, Deqing
    Jing, Baoyu
    Lu, Chenwei
    Wu, Junjie
    Liu, Guannan
    Du, Chenguang
    Zhuang, Fuzhen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 736 - 747