Language Fairness in Multilingual Information Retrieval

被引:0
|
作者
Yang, Eugene [1 ]
Janich, Thomas [2 ]
Mayfield, James [1 ]
Lawrie, Dawn [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
language fairness; multilingual retrieval; statistical testing;
D O I
10.1145/3626772.3657943
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual information retrieval (MLIR) considers the problem of ranking documents in several languages for a query expressed in a language that may differ from any of those languages. Recent work has observed that approaches such as combining ranked lists representing a single document language each or using multilingual pretrained language models demonstrate a preference for one language over others. This results in systematic unfair treatment of documents in different languages. This work proposes a language fairness metric to evaluate whether documents across different languages are fairly ranked through statistical equivalence testing using the Kruskal-Wallis test. In contrast to most prior work in group fairness, we do not consider any language to be an unprotected group. Thus our proposed measure, PEER (Probability of Equal Expected Rank), is the first fairness metric specifically designed to capture the language fairness of MLIR systems. We demonstrate the behavior of PEER on artificial ranked lists. We also evaluate real MLIR systems on two publicly available benchmarks and show that the PEER scores align with prior analytical findings on MLIR fairness. Our implementation is compatible with ir-measures and is available at http://github.com/hltcoe/peer_measure.
引用
收藏
页码:2487 / 2491
页数:5
相关论文
共 50 条
  • [31] Automatic processing of multilingual medical terminology:: applications to thesaurus enrichment and cross-language information retrieval
    Déjean, H
    Gaussier, E
    Renders, JM
    Sadat, F
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) : 111 - 124
  • [32] Cross-Lingual Information Retrieval from Multilingual Construction Documents Using Pretrained Language Models
    Kim, Jungyeon
    Chung, Sehwan
    Chi, Seokho
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2024, 150 (06)
  • [33] Using Web resources to construct multilingual medical thesaurus for cross-language medical information retrieval
    Lu, Wen-Hsiang
    Lin, Ray S.
    Chan, Yi-Che
    Chen, Kuan-Hsi
    DECISION SUPPORT SYSTEMS, 2008, 45 (03) : 585 - 595
  • [34] Multilingual single document keyword extraction for information retrieval
    Bracewell, DB
    Ren, FJ
    Kuriowa, S
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 517 - 522
  • [35] Multilingual information retrieval using English and Chinese queries
    Chen, AT
    EVLAUATION OF CROSS-LANGUAGE INFORMATION RETRIEVAL SYSTEMS, 2002, 2406 : 44 - 58
  • [36] Multilingual Information Retrieval in Thoracic Radiology: Feasibility Study
    Castilla, Andre Coutinho
    Furuie, Sergio Shiguemi
    Mendonca, Eneida A.
    MEDINFO 2007: PROCEEDINGS OF THE 12TH WORLD CONGRESS ON HEALTH (MEDICAL) INFORMATICS, PTS 1 AND 2: BUILDING SUSTAINABLE HEALTH SYSTEMS, 2007, 129 : 387 - +
  • [37] An Information Retrieval Based Approach for Multilingual Ontology Matching
    Rexha, Andi
    Dragoni, Mauro
    Kern, Roman
    Kroell, Mark
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2016, 2016, 9612 : 433 - 439
  • [38] An Exploration of Users' Needs for Multilingual Information Retrieval and Access
    Vassilakaki, Evgenia
    Garoufallou, Emmanouel
    Johnson, Frances
    Hartley, R. J.
    METADATA AND SEMANTICS RESEARCH, MTSR 2015, 2015, 544 : 249 - 258
  • [39] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [40] SyDoM: A multilingual information retrieval system for digital libraries
    Roussey, C
    Calabretto, S
    Pinon, JM
    ELECTRONIC PUBLISHING '01, CONFERENCE PROCEEDINGS: 2001 IN THE DIGITAL PUBLISHING ODYSSEY, 2001, : 150 - 164