CLUSTERING TECHNIQUES AND DISCRETE PARTICLE SWARM OPTIMIZATION ALGORITHM FOR MULTI-DOCUMENT SUMMARIZATION

被引:31
|
作者
Aliguliyev, Ramiz M. [1 ]
机构
[1] Natl Acad Sci, Inst Informat Technol, Dept 13, AZ-1141 Baku, Azerbaijan
关键词
text mining; sentence clustering; generic multi-document summarization; sentence extractive technique; discrete Particle Swarm Optimization algorithm; TEXT; SENTENCES; LEXRANK;
D O I
10.1111/j.1467-8640.2010.00365.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization is a process of automatic creation of a compressed version of a given collection of documents that provides useful information to users. In this article we propose a generic multi-document summarization method based on sentence clustering. We introduce five clustering methods, which optimize various aspects of intra-cluster similarity, inter-cluster dissimilarity and their combinations. To solve the clustering problem a modification of discrete particle swarm optimization algorithm has been proposed. The experimental results on open benchmark data sets from DUC2005 and DUC2007 show that our method significantly outperforms the baseline methods for multi-document summarization.
引用
收藏
页码:420 / 448
页数:29
相关论文
共 50 条
  • [21] Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization
    Cai, Xiaoyan
    Li, Wenjie
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1424 - 1433
  • [22] Topic-Sensitive Multi-document Summarization Algorithm
    Liu Na
    Tang Xiao-jun
    Lu Ying
    Li Ming-xia
    Wang Hai-wen
    Xiao Peng
    2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 69 - 74
  • [23] Co-clustering Sentences and Terms for Multi-document Summarization
    Xia, Yunqing
    Zhang, Yonggang
    Yao, Jianmin
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 339 - +
  • [24] SOAP: Soaking Capacity Optimization for Multi-Document Summarization
    Wang, Kexiang
    Chang, Baobao
    Sui, Zhifang
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1525 - 1534
  • [25] MULTI-DOCUMENT VIDEO SUMMARIZATION
    Wang, Feng
    Merialdo, Bernard
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1326 - 1329
  • [26] On redundancy in multi-document summarization
    Calvo, Hiram
    Carrillo-Mendoza, Pabel
    Gelbukh, Alexander
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3245 - 3255
  • [27] Abstractive Multi-Document Summarization
    Ranjitha, N. S.
    Kallimani, Jagadish S.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1690 - 1693
  • [28] Text document clustering using Spectral Clustering algorithm with Particle Swarm Optimization
    Janani, R.
    Vijayarani, S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 134 : 192 - 200
  • [29] Multi-document Summarization via Deep Learning Techniques: A Survey
    Ma, Congbo
    Zhang, Wei Emma
    Guo, Mingyu
    Wang, Hu
    Sheng, Quan Z.
    ACM COMPUTING SURVEYS, 2023, 55 (05)
  • [30] Research on sentence optimum selection algorithm for multi-document summarization
    Zhang, Shu
    Zhao, Tie-Jun
    Yao, Chao
    Zheng, De-Quan
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2008, 30 (12): : 2921 - 2925