SOAP: Soaking Capacity Optimization for Multi-Document Summarization

被引:0
|
作者
Wang, Kexiang [1 ]
Chang, Baobao [1 ]
Sui, Zhifang [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Computat Linguist, Minist Educ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
soaking capacity optimization; multi-document summarization;
D O I
10.1145/3340531.3411909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-document summarization (MDS) aims at giving a brief summary for a cluster of related documents. In this paper, we consider the MDS task as an optimization problem with a novel measure named soaking capacity being the objective function. The origin of our method is the classic hypothesis: the summary components are the sinks of information diffusion. We point out that the hypothesis only gives the role of summary but does not cover how well a summary acts as this role. To fill in the gap, soaking capacity is formally defined to quantify the ability of summary to soak up information. We explicitly demonstrate its fitness as an indicator for both the saliency and the diversity goal of MDS. For solving the optimization problem, we propose a greedy algorithm named SOAP by adopting a surrogate of soaking capacity to accelerate the computation. Experiments on MDS datasets across various domains show the great potential of SOAP as compared with the state-of-the-art MDS systems.
引用
收藏
页码:1525 / 1534
页数:10
相关论文
共 50 条
  • [41] Identification of Event and Topic for Multi-document Summarization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    Matsuyoshi, Suguru
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 304 - 316
  • [42] Text Summarization as a Multi-objective Optimization Task: Applying Harmony Search to Extractive Multi-Document Summarization
    Bidoki, M.
    Fakhrahmad, M.
    Moosavi, M. R.
    COMPUTER JOURNAL, 2022, 65 (05): : 1053 - 1072
  • [43] A New Approach for Multi-Document Update Summarization
    龙翀
    黄民烈
    朱小燕
    李明
    JournalofComputerScience&Technology, 2010, 25 (04) : 739 - 749
  • [44] Multi-document summarization based on the Yago ontology
    Baralis, Elena
    Cagliero, Luca
    Jabeen, Saima
    Fiori, Alessandro
    Shah, Sajid
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) : 6976 - 6984
  • [45] A Hybrid Hierarchical Model for Multi-Document Summarization
    Celikyilmaz, Asli
    Hakkani-Tur, Dilek
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 815 - 824
  • [46] SUBTOPIC-BASED MULTI-DOCUMENT SUMMARIZATION
    Dai, Lin
    Tang, Ji-Liang
    Xia, Yun-Qing
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 3505 - +
  • [47] Multi-document summarization based on concept space
    Tang, STK
    Yen, J
    Yang, CC
    ITRE2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, 2003, : 385 - 389
  • [48] Multi-document summarization as applied in information retrieval
    Zhou, Dan
    Li, Lei
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 203 - +
  • [49] Multi-document Summarization Based on Sentence Clustering
    Zheng, Hai-Tao
    Gong, Shu-Qin
    Chen, Hao
    Jiang, Yong
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 429 - 436
  • [50] Multi-document summarization based on cohesion with disambiguation
    Chen, Yanmin
    Lou, Xizhong
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 232 - +