SOAP: Soaking Capacity Optimization for Multi-Document Summarization

被引:0
|
作者
Wang, Kexiang [1 ]
Chang, Baobao [1 ]
Sui, Zhifang [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Computat Linguist, Minist Educ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
soaking capacity optimization; multi-document summarization;
D O I
10.1145/3340531.3411909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-document summarization (MDS) aims at giving a brief summary for a cluster of related documents. In this paper, we consider the MDS task as an optimization problem with a novel measure named soaking capacity being the objective function. The origin of our method is the classic hypothesis: the summary components are the sinks of information diffusion. We point out that the hypothesis only gives the role of summary but does not cover how well a summary acts as this role. To fill in the gap, soaking capacity is formally defined to quantify the ability of summary to soak up information. We explicitly demonstrate its fitness as an indicator for both the saliency and the diversity goal of MDS. For solving the optimization problem, we propose a greedy algorithm named SOAP by adopting a surrogate of soaking capacity to accelerate the computation. Experiments on MDS datasets across various domains show the great potential of SOAP as compared with the state-of-the-art MDS systems.
引用
收藏
页码:1525 / 1534
页数:10
相关论文
共 50 条
  • [31] A Game Theory Approach for Multi-document Summarization
    Ahmad, Amreen
    Ahmad, Tanvir
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (04) : 3655 - 3667
  • [32] Multi-document summarization based on unsupervised clustering
    Ji, Paul
    INFORMATION RETRIEVAL TECHNOLOLGY, PROCEEDINGS, 2006, 4182 : 560 - 566
  • [33] Geodesic Distance based Multi-document Summarization
    Ma, Huifang
    He, Qing
    Shi, Zhongzhi
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 54 - 59
  • [34] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [35] MRS for multi-document summarization by sentence extraction
    Yong-Dong Xu
    Xiao-Dong Zhang
    Guang-Ri Quan
    Ya-Dong Wang
    Telecommunication Systems, 2013, 53 : 91 - 98
  • [36] Personalized Multi-Document Summarization in information retrieval
    Yang, Xiao-Peng
    Liu, Xiao-Rong
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 4108 - +
  • [37] Multi-document Summarization using Tensor Decomposition
    Litvak, Marina
    Vanetik, Natalia
    COMPUTACION Y SISTEMAS, 2014, 18 (03): : 581 - 589
  • [38] Multi-document Summarization for E-Learning
    Wang, Fu Lee
    Kwan, Reggie
    Hung, Sheung Lun
    HYBRID LEARNING AND EDUCATION, PROCEEDINGS, 2009, 5685 : 353 - +
  • [39] Enhancing multi-document summarization using concepts
    Pattabhi R K Rao
    S Lalitha Devi
    Sādhanā, 2018, 43
  • [40] A New Approach for Multi-Document Update Summarization
    Chong Long
    Min-Lie Huang
    Xiao-Yan Zhu
    Ming Li
    Journal of Computer Science and Technology, 2010, 25 : 739 - 749