Language Fairness in Multilingual Information Retrieval

被引:0
|
作者
Yang, Eugene [1 ]
Janich, Thomas [2 ]
Mayfield, James [1 ]
Lawrie, Dawn [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
language fairness; multilingual retrieval; statistical testing;
D O I
10.1145/3626772.3657943
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual information retrieval (MLIR) considers the problem of ranking documents in several languages for a query expressed in a language that may differ from any of those languages. Recent work has observed that approaches such as combining ranked lists representing a single document language each or using multilingual pretrained language models demonstrate a preference for one language over others. This results in systematic unfair treatment of documents in different languages. This work proposes a language fairness metric to evaluate whether documents across different languages are fairly ranked through statistical equivalence testing using the Kruskal-Wallis test. In contrast to most prior work in group fairness, we do not consider any language to be an unprotected group. Thus our proposed measure, PEER (Probability of Equal Expected Rank), is the first fairness metric specifically designed to capture the language fairness of MLIR systems. We demonstrate the behavior of PEER on artificial ranked lists. We also evaluate real MLIR systems on two publicly available benchmarks and show that the PEER scores align with prior analytical findings on MLIR fairness. Our implementation is compatible with ir-measures and is available at http://github.com/hltcoe/peer_measure.
引用
收藏
页码:2487 / 2491
页数:5
相关论文
共 50 条
  • [1] Multilingual information retrieval in the language modeling framework
    Rahimi, Razieh
    Shakery, Azadeh
    King, Irwin
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (03): : 246 - 281
  • [2] Multilingual information retrieval in the language modeling framework
    Razieh Rahimi
    Azadeh Shakery
    Irwin King
    Information Retrieval Journal, 2015, 18 : 246 - 281
  • [3] Language Agnostic Multilingual Information Retrieval with Contrastive Learning
    Hu, Xiyang
    Chen, Xinchi
    Qi, Peng
    Kong, Deguang
    Liu, Kunlun
    Wang, William Yang
    Huang, Zhiheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9133 - 9146
  • [4] Cross-language information retrieval in a multilingual legal domain
    Sheridan, P
    Braschler, M
    Schauble, P
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 1997, 1324 : 253 - 268
  • [5] A multilingual approach to multilingual information retrieval
    Nie, JY
    Jin, F
    ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 101 - 110
  • [6] Multilingual information access system using cross-language information retrieval
    Hayashi, Yoshihiko
    Matsuo, Yoshihiro
    Nagata, Masaaki
    Furuse, Osamu
    2003, Nippon Telegraph and Telephone Corp. (52):
  • [7] Fairness in Information Retrieval
    Lipani, Aldo
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1171 - 1171
  • [8] Distillation for Multilingual Information Retrieval
    Yang, Eugene
    Lawrie, Dawn
    Mayfield, James
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2368 - 2373
  • [9] Multilingual information retrieval system
    Hong, Z
    Syin, C
    Lia, KF
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS, 1996, 2916 : 33 - 44
  • [10] Information Retrieval in Multilingual Environment
    Chaware, S. M.
    Rao, Srikantha
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 198 - +