Evaluating Factuality in Cross-lingual Summarization

被引:0
|
作者
Gao, Mingqi [1 ,2 ,3 ]
Wang, Wenqing [4 ]
Wan, Xiaojun [1 ,2 ,3 ]
Xu, Yuemei [4 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Ctr Data Sci, Beijing, Peoples R China
[3] Peking Univ, MOE Key Lab Computat Linguist, Beijing, Peoples R China
[4] Beijing Foreign Studies Univ, Sch Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Cross-lingual summarization aims to help people efficiently grasp the core idea of the document written in a foreign language. Modern text summarization models generate highly fluent but often factually inconsistent outputs, which has received heightened attention in recent research. However, the factual consistency of cross-lingual summarization has not been investigated yet. In this paper, we propose a cross-lingual factuality dataset by collecting human annotations of reference summaries as well as generated summaries from models at both summary level and sentence level. Furthermore, we perform the fine-grained analysis and observe that over 50% of generated summaries and over 27% of reference summaries contain factual errors with characteristics different from mono-lingual summarization. Existing evaluation metrics for monolingual summarization require translation to evaluate the factuality of cross-lingual summarization and perform differently at different tasks and levels. Finally, we adapt the monolingual factuality metrics as an initial step towards the automatic evaluation of summarization factuality in cross-lingual settings. Our dataset and code are available at https: //github.com/kite99520/Fact_CLS.
引用
收藏
页码:12415 / 12431
页数:17
相关论文
共 50 条
  • [41] Cross-lingual training of summarization systems using annotated corpora in a foreign language
    Marina Litvak
    Mark Last
    Information Retrieval, 2013, 16 : 629 - 656
  • [42] Cross-lingual training of summarization systems using annotated corpora in a foreign language
    Litvak, Marina
    Last, Mark
    INFORMATION RETRIEVAL, 2013, 16 (05): : 629 - 656
  • [43] Towards Making the Most of Knowledge Across Languages for Multimodal Cross-Lingual Summarization
    Shi, Xiaorui
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 424 - 438
  • [44] Multi-path Based Self-adaptive Cross-lingual Summarization
    Bao, Zhongtian
    Wang, Jun
    Yang, Zhenglu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023, 2023, 14119 : 282 - 294
  • [45] Evaluating cross-lingual textual similarity on dictionary alignment problem
    Sever, Yigit
    Ercan, Gonenc
    LANGUAGE RESOURCES AND EVALUATION, 2020, 54 (04) : 1059 - 1078
  • [46] Evaluating cross-lingual textual similarity on dictionary alignment problem
    Yiğit Sever
    Gönenç Ercan
    Language Resources and Evaluation, 2020, 54 : 1059 - 1078
  • [47] XTREME-S: Evaluating Cross-lingual Speech Representations
    Conneau, Alexis
    Bapna, Ankur
    Zhang, Yu
    Ma, Min
    von Platen, Patrick
    Lozhkov, Anton
    Cherry, Colin
    Jia, Ye
    Rivera, Clara
    Kale, Mihir
    Van Esch, Daan
    Axelrod, Vera
    Khanuja, Simran
    Clark, Jonathan H.
    Firat, Orhan
    Auli, Michael
    Ruder, Sebastian
    Riesa, Jason
    Johnson, Melvin
    INTERSPEECH 2022, 2022, : 3248 - 3252
  • [48] Evaluating Sub-word embeddings in cross-lingual models
    Parizi, Ali Hakimi
    Cook, Paul
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2712 - 2719
  • [49] Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification
    Wu, Hanqian
    Wang, Zhike
    Qing, Feng
    Li, Shoushan
    ELECTRONICS, 2021, 10 (03) : 1 - 14
  • [50] FACTGRAPH: Evaluating Factuality in Summarization with Semantic Graph Representations
    Ribeiro, Leonardo F. R.
    Liu, Mengwen
    Gurevych, Iryna
    Dreyer, Markus
    Bansal, Mohit
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3238 - 3253