SOAP: Soaking Capacity Optimization for Multi-Document Summarization

被引:0
|
作者
Wang, Kexiang [1 ]
Chang, Baobao [1 ]
Sui, Zhifang [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Computat Linguist, Minist Educ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
soaking capacity optimization; multi-document summarization;
D O I
10.1145/3340531.3411909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-document summarization (MDS) aims at giving a brief summary for a cluster of related documents. In this paper, we consider the MDS task as an optimization problem with a novel measure named soaking capacity being the objective function. The origin of our method is the classic hypothesis: the summary components are the sinks of information diffusion. We point out that the hypothesis only gives the role of summary but does not cover how well a summary acts as this role. To fill in the gap, soaking capacity is formally defined to quantify the ability of summary to soak up information. We explicitly demonstrate its fitness as an indicator for both the saliency and the diversity goal of MDS. For solving the optimization problem, we propose a greedy algorithm named SOAP by adopting a surrogate of soaking capacity to accelerate the computation. Experiments on MDS datasets across various domains show the great potential of SOAP as compared with the state-of-the-art MDS systems.
引用
收藏
页码:1525 / 1534
页数:10
相关论文
共 50 条
  • [21] An Optimization Algorithm for Extractive Multi-document Summarization Based on Association of Sentences
    Chen, Chun-Hao
    Yang, Yi-Chen
    Lin, Jerry Chun-Wei
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 460 - 469
  • [22] Multi-document summarization based on lexical chains
    Chen, YM
    Wang, XL
    Liu, BQ
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1937 - 1942
  • [23] Unsupervised Multi-document Summarization with Holistic Inference
    Zhang, Haopeng
    Cho, Sangwoo
    Song, Kaiqiang
    Wang, Xiaoyang
    Wang, Hongwei
    Zhang, Jiawei
    Yu, Dong
    13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 123 - 133
  • [24] Automatic multi-document summarization for digital libraries
    Ou Shiyan
    Khoo, Christopher S. G.
    Goh, Dion H.
    PROCEEDINGS OF THE ASIA-PACIFIC CONFERENCE ON LIBRARY & INFORMATION EDUCATION & PRACTICE 2006: PREPARING INFORMATION PROFESSIONALS FOR LEADERSHIP IN THE NEW AGE, 2006, : 72 - +
  • [25] Multi-document summarization for terrorism information extraction
    Wang, Fu Lee
    Yang, Christopher C.
    Shi, Xiaodong
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 602 - 608
  • [26] Multi-document summarization using closed patterns
    Qiang, Ji-Peng
    Chen, Ping
    Ding, Wei
    Xie, Fei
    Wu, Xindong
    KNOWLEDGE-BASED SYSTEMS, 2016, 99 : 28 - 38
  • [27] Enhancing multi-document summarization using concepts
    Rao, Pattabhi R. K.
    Devi, S. Lalitha
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2018, 43 (02):
  • [28] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [29] Disentangling Specificity for Abstractive Multi-document Summarization
    Ma, Congbo (congbo.ma@mq.edu.au), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [30] Genetic algorithm based multi-document summarization
    Liu, Dexi
    He, Yanxiang
    Ji, Donghong
    Yang, Hua
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1140 - 1144