Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization

被引:7
|
作者
Cai, Xiaoyan [1 ]
Li, Wenjie [2 ]
Zhang, Renxian [3 ]
机构
[1] Northwest Agr & Forestry Univ, Coll Informat Engn, Yangling, Shaanxi, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
Document summarization; Information diversity; Information coverage; Subspace clustering;
D O I
10.1016/j.ins.2014.04.028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [41] Clustering-Based Federated Learning for Enhancing Data Privacy in Internet of Vehicles
    Jin, Zilong
    Wang, Jin
    Zhang, Lejun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1462 - 1477
  • [42] Clustering-based feature selection
    School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
    Tien Tzu Hsueh Pao, 2008, SUPPL. (157-160):
  • [43] A clustering-based fuzzy classifier
    Drummond, Isabela
    Sandri, Sandra
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2005, 131 : 247 - 254
  • [44] Exploration of Compiler Optimization Sequences Using Clustering-Based Selection
    Martins, Luiz G. A.
    Nobre, Ricardo
    Delbem, Alexandre C. B.
    Marques, Eduardo
    Cardoso, Joao M. P.
    ACM SIGPLAN NOTICES, 2014, 49 (05) : 63 - 72
  • [45] Clustering-Based Particle Swarm Optimization for Electrical Impedance Imaging
    Hu, Gang
    Chen, Min-you
    He, Wei
    Zhai, Jin-qian
    ADVANCES IN SWARM INTELLIGENCE, PT I, 2011, 6728 : 165 - 171
  • [46] Spectral Clustering-based Classification
    Owhadi-Kareshk, Moein
    Akbarzadeh-T, Mohammad-R
    2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2015, : 222 - 227
  • [47] Clustering-Based Selection for Evolutionary Many-Objective Optimization
    Denysiuk, Roman
    Costa, Lino
    Santo, Isabel Espirito
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XIII, 2014, 8672 : 538 - 547
  • [48] On Cloud Storage Optimization of Blockchain With a Clustering-Based Genetic Algorithm
    Xu, Mengtian
    Feng, Guorui
    Ren, Yanli
    Zhang, Xinpeng
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (09) : 8547 - 8558
  • [49] Fuzzy Clustering-Based Filter
    Coletta, Luiz F. S.
    Hruschka, Eduardo R.
    Covoes, Thiago F.
    Campello, Ricardo J. G. B.
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND METHODS, PT 1, 2010, 80 : 406 - 415
  • [50] Clustering-based microcode compression
    Borin, Edson
    Breternitz, Mauricio, Jr.
    Wut, Youfeg
    Araujo, Guido
    PROCEEDINGS 2006 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2007, : 189 - +