CAKES: Cross-lingual Wikipedia Knowledge Enrichment and Summarization

被引:0
|
作者
Fionda, Valeria [1 ]
Pirro, Giuseppe [1 ]
机构
[1] Free Univ Bolzano Bozen, Bolzano, Italy
关键词
D O I
10.3233/978-1-61499-098-7-901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia is a huge source of multilingual knowledge curated by human contributors. Wiki articles are independently written in the various languages and may cover different perspectives about a given subject. The aim of this paper is to exploit Wikipedia multilingual information for knowledge enrichment and summarization. Investigating the link structure of a Wiki article in a source language and comparing it with the structure of articles about the same subject written in other languages gives insights about the body of knowledge shared among languages. This investigation is also useful to identify knowledge perspectives not covered in the source language but covered in other languages. We implemented these ideas in CAKES, which: i) exploits Wikipedia information on the fly without requiring any data preprocessing; ii) enables to specify the set of languages to be considered and; iii) ranks subjects interesting for a given article on the basis of their popularity among languages.
引用
收藏
页码:901 / 902
页数:2
相关论文
共 50 条
  • [11] Cross-Lingual Entity Linking in Wikipedia Infoboxes
    Yang, Juheng
    Wang, Zhichun
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 38 - 49
  • [12] Detecting Cross-Lingual Information Gaps in Wikipedia
    Ashrafmoghari, Vahid
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 581 - 585
  • [13] Towards Unifying Multi-Lingual and Cross-Lingual Summarization
    Wang, Jiaan
    Meng, Fandong
    Zheng, Duo
    Liang, Yunlong
    Li, Zhixu
    Qu, Jianfeng
    Zhou, Jie
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15127 - 15143
  • [14] Cross-Lingual Speech-to-Text Summarization
    Pontes, Elvys Linhares
    Gonzalez-Gallardo, Carlos-Emiliano
    Torres-Moreno, Juan-Manuel
    Huet, Stephane
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 385 - 395
  • [15] A Cross-Lingual Summarization method based on cross-lingual Fact-relationship Graph Generation
    Zhang, Yongbing
    Gao, Shengxiang
    Huang, Yuxin
    Tan, Kaiwen
    Yu, Zhengtao
    PATTERN RECOGNITION, 2024, 146
  • [16] Cross-lingual extreme summarization of scholarly documents
    Takeshita, Sotaro
    Green, Tommaso
    Friedrich, Niklas
    Eckert, Kai
    Ponzetto, Simone Paolo
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2024, 25 (02) : 249 - 271
  • [17] A Robust Abstractive System for Cross-Lingual Summarization
    Ouyang, Jessica
    Song, Boya
    McKeown, Kathleen
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2025 - 2031
  • [18] Towards Making the Most of Knowledge Across Languages for Multimodal Cross-Lingual Summarization
    Shi, Xiaorui
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 424 - 438
  • [19] Cross-lingual entity matching and infobox alignment in Wikipedia
    Rinser, Daniel
    Lange, Dustin
    Naumann, Felix
    INFORMATION SYSTEMS, 2013, 38 (06) : 887 - 907
  • [20] Ongoing Events in Wikipedia: A Cross-lingual Case Study
    Gottschalk, Simon
    Demidova, Elena
    Bernacchi, Viola
    Rogers, Richard
    PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 387 - 388