Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization

被引:7
|
作者
Cai, Xiaoyan [1 ]
Li, Wenjie [2 ]
Zhang, Renxian [3 ]
机构
[1] Northwest Agr & Forestry Univ, Coll Informat Engn, Yangling, Shaanxi, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
Document summarization; Information diversity; Information coverage; Subspace clustering;
D O I
10.1016/j.ins.2014.04.028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [31] A Clustering-Based Coverage Path Planning Method for Autonomous Heterogeneous UAVs
    Chen, Jinchao
    Du, Chenglie
    Zhang, Ying
    Han, Pengcheng
    Wei, Wei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25546 - 25556
  • [32] Enhancing Document Clustering through Heuristics and Summary-Based Pre-processing
    Allamraju, Sri Harsha
    Chun, Robert
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND INTERACTION, PT II, 2009, 5618 : 105 - 113
  • [33] Anchor-Based Multiview Subspace Clustering With Diversity Regularization
    Ou, Qiyuan
    Wang, Siwei
    Zhou, Sihang
    Li, Miaomiao
    Guo, Xifeng
    Zhu, En
    IEEE MULTIMEDIA, 2020, 27 (04) : 91 - 101
  • [34] Enhancing clustering of trajectories through optimization of geometric features
    Shivanasab, Pooya
    Abbaspour, Rahim Ali
    Chehreghan, Alireza
    EARTH SCIENCE INFORMATICS, 2025, 18 (03)
  • [35] Statistical semantics for enhancing document clustering
    Ahmed K. Farahat
    Mohamed S. Kamel
    Knowledge and Information Systems, 2011, 28 : 365 - 393
  • [36] Statistical semantics for enhancing document clustering
    Farahat, Ahmed K.
    Kamel, Mohamed S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (02) : 365 - 393
  • [37] Proximal Optimization for Fuzzy Subspace Clustering
    Guillon, Arthur
    Lesot, Marie-Jeanne
    Marsala, Christophe
    Pal, Nikhil R.
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, IPMU 2016, PT I, 2016, 610 : 675 - 686
  • [38] A Clustering-based Recommendation System
    Wu, Shaofei
    PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 328 - 330
  • [39] Density clustering-based optimization model for trajectory data publication
    Zhang, Qian
    Zhang, Xing
    Chu, Zhiguang
    Li, Xiang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [40] Analysis and Clustering-Based Improvement of Particle Filter Optimization Algorithms
    Kenyeres, Eva
    Abonyi, Janos
    IEEE ACCESS, 2024, 12 : 55600 - 55619