Evaluating Factuality in Cross-lingual Summarization

被引:0
|
作者
Gao, Mingqi [1 ,2 ,3 ]
Wang, Wenqing [4 ]
Wan, Xiaojun [1 ,2 ,3 ]
Xu, Yuemei [4 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Ctr Data Sci, Beijing, Peoples R China
[3] Peking Univ, MOE Key Lab Computat Linguist, Beijing, Peoples R China
[4] Beijing Foreign Studies Univ, Sch Informat Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Cross-lingual summarization aims to help people efficiently grasp the core idea of the document written in a foreign language. Modern text summarization models generate highly fluent but often factually inconsistent outputs, which has received heightened attention in recent research. However, the factual consistency of cross-lingual summarization has not been investigated yet. In this paper, we propose a cross-lingual factuality dataset by collecting human annotations of reference summaries as well as generated summaries from models at both summary level and sentence level. Furthermore, we perform the fine-grained analysis and observe that over 50% of generated summaries and over 27% of reference summaries contain factual errors with characteristics different from mono-lingual summarization. Existing evaluation metrics for monolingual summarization require translation to evaluate the factuality of cross-lingual summarization and perform differently at different tasks and levels. Finally, we adapt the monolingual factuality metrics as an initial step towards the automatic evaluation of summarization factuality in cross-lingual settings. Our dataset and code are available at https: //github.com/kite99520/Fact_CLS.
引用
收藏
页码:12415 / 12431
页数:17
相关论文
共 50 条
  • [31] X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Ponzetto, Simone Paolo
    2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [32] X-SCITLDR: Cross-lingual extreme summarization of scholarly documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Der Medien, Hochschule
    Ponzetto, Simone Paolo
    Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2022,
  • [33] Evaluating and Modeling Attribution for Cross-Lingual Question Answering
    Muller, Benjamin
    Wieting, John
    Clark, Jonathan H.
    Kwiatkowski, Tom
    Ruder, Sebastian
    Soares, Livio Baldini
    Aharoni, Roee
    Herzig, Jonathan
    Wang, Xinyi
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 144 - 157
  • [34] RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
    Rettinger, Achim
    Zhang, Lei
    Berovic, Dasa
    Merkler, Danijela
    Srebacic, Matea
    Tadic, Marko
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4000 - 4003
  • [35] XCMRC: Evaluating Cross-Lingual Machine Reading Comprehension
    Liu, Pengyuan
    Deng, Yuning
    Zhu, Chenghao
    Hu, Han
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 552 - 564
  • [36] Evaluating Cross-lingual Semantic Annotation for Medical Forms
    Lin, Ying-Chi
    Christen, Victor
    Gross, Anika
    Kirsten, Toralf
    Cardoso, Silvio Domingos
    Pruski, Cedric
    Da Silveira, Marcos
    Rahm, Erhard
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 145 - 155
  • [37] CAR-Transformer: Cross-Attention Reinforcement Transformer for Cross-Lingual Summarization
    Cai, Yuang
    Yuan, Yuyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17718 - 17726
  • [38] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan H.
    Xi Y.
    Wang L.
    Nan Y.
    Su Z.
    Cao R.
    PeerJ Computer Science, 2023, 9
  • [39] Dataset construction method of cross-lingual summarization based on filtering and text augmentation
    Pan, Hangyu
    Xi, Yaoyi
    Wang, Ling
    Nan, Yu
    Su, Zhizhong
    Cao, Rong
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [40] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67