Identification of Event and Topic for Multi-document Summarization

被引:1
|
作者
Fukumoto, Fumiyo [1 ]
Suzuki, Yoshimi [1 ]
Takasu, Atsuhiro [2 ]
Matsuyoshi, Suguru [3 ]
机构
[1] Univ Yamanashi, Grad Fac Interdisciplinary Res, Kofu, Yamanashi 4008510, Japan
[2] Natl Inst Informat, Tokyo, Japan
[3] Univ Yamanashi, Interdisciplinary Grad Sch Med & Engn, Kofu, Yamanashi, Japan
关键词
Latent Dirichlet Allocation; Moving Average Convergence/Divergence; Multi-document summarization;
D O I
10.1007/978-3-319-43808-5_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on continuous news documents and presents a method for extractive multi-document summarization. Our hypothesis about salient, key sentences in news documents is that they include words related to the target event and topic of a document. Here, an event and a topic are the same as Topic Detection and Tracking (TDT) project: an event is something that occurs at a specific place and time along with all necessary preconditions and unavoidable consequences, and a topic is defined to be "a seminal event or activity along with all directly related events and activities." The difficulty for finding topics is that they have various word distributions. In addition to the TF-IDF term weighting method to extract event words, we identified topics by using two models, i. e., Moving Average Convergence Divergence (MACD) for words with high frequencies, and Latent Dirichlet Allocation (LDA) for low frequency words. The method was tested on two datasets, NTCIR-3 Japanese news documents and DUC data, and the results showed the effectiveness of the method.
引用
收藏
页码:304 / 316
页数:13
相关论文
共 50 条
  • [21] Two-phase Multi-document Event Summarization on Core Event Graphs
    Chen, Zengjian
    Xu, Jin
    Liao, Meng
    Xue, Tong
    He, Kun
    Journal of Artificial Intelligence Research, 2022, 74 : 1037 - 1057
  • [22] Two-phase Multi-document Event Summarization on Core Event Graphs
    Chen, Zengjian
    Xu, Jin
    Liao, Meng
    Xue, Tong
    He, Kun
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1037 - 1057
  • [23] A Novel Contextual Topic Model for Query-focused Multi-document Summarization
    Yang, Guangbing
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 576 - 583
  • [24] Multi-document summarization using probabilistic topic-based network models
    1613, Institute of Information Science (32):
  • [25] Using cross-document random walks for topic-focused multi-document summarization
    Wan, Xiaojun
    Yang, Jianwu
    Xiao, Jianguo
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 1012 - +
  • [26] Topic modeling combined with classification technique for extractive multi-document text summarization
    Rajendra Kumar Roul
    Soft Computing, 2021, 25 : 1113 - 1127
  • [27] Multi-document Summarization using Probabilistic Topic-based Network Models
    Yang, Cheng-Zen
    Fan, Jhih-Shang
    Liu, Yu-Fan
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (06) : 1613 - 1634
  • [28] Topic modeling combined with classification technique for extractive multi-document text summarization
    Roul, Rajendra Kumar
    SOFT COMPUTING, 2021, 25 (02) : 1113 - 1127
  • [29] Extracting main content of a topic on online social network by multi-document summarization
    Liu, Chunyan
    Zhu, Conghui
    Zhao, Tiejun
    Zheng, Dequan
    PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 52 - 55
  • [30] Topic Oriented Multi-document Summarization Using LSA, Syntactic and Semantic Features
    Anjaneyulu, M.
    Sarma, S. S. V. N.
    Reddy, P. Vijaya Pal
    Chander, K. Prem
    Nagaprasad, S.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 487 - 502