CAKES: Cross-lingual Wikipedia Knowledge Enrichment and Summarization

被引:0
|
作者
Fionda, Valeria [1 ]
Pirro, Giuseppe [1 ]
机构
[1] Free Univ Bolzano Bozen, Bolzano, Italy
关键词
D O I
10.3233/978-1-61499-098-7-901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia is a huge source of multilingual knowledge curated by human contributors. Wiki articles are independently written in the various languages and may cover different perspectives about a given subject. The aim of this paper is to exploit Wikipedia multilingual information for knowledge enrichment and summarization. Investigating the link structure of a Wiki article in a source language and comparing it with the structure of articles about the same subject written in other languages gives insights about the body of knowledge shared among languages. This investigation is also useful to identify knowledge perspectives not covered in the source language but covered in other languages. We implemented these ideas in CAKES, which: i) exploits Wikipedia information on the fly without requiring any data preprocessing; ii) enables to specify the set of languages to be considered and; iii) ranks subjects interesting for a given article on the basis of their popularity among languages.
引用
收藏
页码:901 / 902
页数:2
相关论文
共 50 条
  • [41] CATAMARAN: A Cross-lingual Long Text Abstractive Summarization Dataset
    Chen, Zheng
    Lin, Hongyu
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6932 - 6937
  • [42] Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles
    Paramita, Monica Lestari
    Clough, Paul
    Gaizauskas, Robert
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 633 - 639
  • [43] Cross-Lingual Knowledge Transfer for Clinical Phenotyping
    Papaioannou, Jens-Michalis
    Grundmann, Paul
    van Aken, Betty
    Samaras, Athanasios
    Kyparissidis, Ilias
    Giannakoulas, George
    Gers, Felix
    Loeser, Alexander
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 900 - 909
  • [44] Cross-lingual thesaurus for multilingual knowledge management
    Yang, Christopher C.
    Wei, Chih-Ping
    Li, K. W.
    DECISION SUPPORT SYSTEMS, 2008, 45 (03) : 596 - 605
  • [45] Improving Neural Cross-Lingual Abstractive Summarization via Employing Optimal Transport Distance for Knowledge Distillation
    Thong Thanh Nguyen
    Anh Tuan Luu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11103 - 11111
  • [46] Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation
    Yang, Xiwei
    Yun, Jing
    Zheng, Bofei
    Liu, Limin
    Ban, Qi
    ELECTRONICS, 2023, 12 (24)
  • [47] MCLS: A Large-Scale Multimodal Cross-Lingual Summarization Dataset
    Shi, Xiaorui
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 273 - 288
  • [48] Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization
    Zhu, Junnan
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1309 - 1321
  • [49] X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Ponzetto, Simone Paolo
    2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [50] Japanese/english blog distillation and cross-lingual blog analysis with multilingual wikipedia entries as fundamental knowledge source
    Nakasaki H.
    Kawaba M.
    Yokomoto D.
    Utsuro T.
    Fukuhara T.
    Transactions of the Japanese Society for Artificial Intelligence, 2010, 25 (05) : 613 - 622