Multi-document summarization using probabilistic topic-based network models

被引:0
|
作者
机构
[1] Yang, Cheng-Zen
[2] Fan, Jhih-Shang
[3] Liu, Yu-Fan
来源
| 1613年 / Institute of Information Science卷 / 32期
关键词
Integrated approach - Multi-document summarization - Network models - Performance evaluation - Probabilistic topic models - Research domains - Text summarization - Topic model;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-document summarization has obtained much attention in the research domain of text summarization. In the past, probabilistic topic models and network models have been leveraged to generate summaries. However, previous studies do not investigate different combinations of various topic models and network models. This paper describes an integrated approach considering both probabilistic topic models and network models. Two probabilistic topic models and four network models are investigated. We have conducted experiments to evaluate the effectiveness of the proposed approach with the DUC 2004-2007 datasets and make a systematic comparison between two representative topic models, PLSA and LDA. The results show that the PLSA-based network approach outperforms the TF-IDF baseline on all datasets. Moreover, PLSA has better ROUGE performance than LDA for multi-document summarization. © 2016, Institute of Information Science. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [1] Multi-document Summarization using Probabilistic Topic-based Network Models
    Yang, Cheng-Zen
    Fan, Jhih-Shang
    Liu, Yu-Fan
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (06) : 1613 - 1634
  • [2] Using Topic Themes for Multi-Document Summarization
    Harabagiu, Sanda
    Lacatusu, Finley
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [3] Multi-document summarization using discourse models
    Cardoso, Paula C. F.
    Pardo, Thiago A. S.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (56): : 57 - 64
  • [4] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [5] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [6] Identification of Event and Topic for Multi-document Summarization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    Matsuyoshi, Suguru
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 304 - 316
  • [7] Tiered sentence based topic model for multi-document summarization
    Akhtar, Nadeem
    Beg, M. M. Sufyan
    Javed, Hira
    Hussain, Md Muzakkir
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 2131 - 2141
  • [8] Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures
    Bairi, Ramakrishna B.
    Iyer, Rishabh
    Ramakrishnan, Ganesh
    Bilmes, Jeff
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 553 - 563
  • [9] Research On Multi-document Summarization Based On LDA Topic Model
    Bian, Jinqiang
    Jiang, Zengru
    Chen, Qian
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 113 - 116
  • [10] A New Automatic Multi-document Text Summarization using Topic Modeling
    Roul, Rajendra Kumar
    Mehrotra, Samarth
    Pungaliya, Yash
    Sahoo, Jajati Keshari
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2019, 2019, 11319 : 212 - 221