Identification of Event and Topic for Multi-document Summarization

被引:1
|
作者
Fukumoto, Fumiyo [1 ]
Suzuki, Yoshimi [1 ]
Takasu, Atsuhiro [2 ]
Matsuyoshi, Suguru [3 ]
机构
[1] Univ Yamanashi, Grad Fac Interdisciplinary Res, Kofu, Yamanashi 4008510, Japan
[2] Natl Inst Informat, Tokyo, Japan
[3] Univ Yamanashi, Interdisciplinary Grad Sch Med & Engn, Kofu, Yamanashi, Japan
关键词
Latent Dirichlet Allocation; Moving Average Convergence/Divergence; Multi-document summarization;
D O I
10.1007/978-3-319-43808-5_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on continuous news documents and presents a method for extractive multi-document summarization. Our hypothesis about salient, key sentences in news documents is that they include words related to the target event and topic of a document. Here, an event and a topic are the same as Topic Detection and Tracking (TDT) project: an event is something that occurs at a specific place and time along with all necessary preconditions and unavoidable consequences, and a topic is defined to be "a seminal event or activity along with all directly related events and activities." The difficulty for finding topics is that they have various word distributions. In addition to the TF-IDF term weighting method to extract event words, we identified topics by using two models, i. e., Moving Average Convergence Divergence (MACD) for words with high frequencies, and Latent Dirichlet Allocation (LDA) for low frequency words. The method was tested on two datasets, NTCIR-3 Japanese news documents and DUC data, and the results showed the effectiveness of the method.
引用
收藏
页码:304 / 316
页数:13
相关论文
共 50 条
  • [41] Causal Maps for Multi-Document Summarization
    Strelnikoff, Sasha
    Jammalamadaka, Aruna
    Warmsley, Dana
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4437 - 4445
  • [42] Aspect Based Multi-Document Summarization
    Sahoo, Deepak
    Balabantaray, Rakesh
    Phukon, Mridumoni
    Saikia, Saibali
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 873 - 877
  • [43] MULTI-DOCUMENT SUMMARIZATION OF EVALUATIVE TEXT
    Carenini, Giuseppe
    Cheung, Jackie Chi Kit
    Pauls, Adam
    COMPUTATIONAL INTELLIGENCE, 2013, 29 (04) : 545 - 576
  • [44] Hierarchical Summarization: Scaling Up Multi-Document Summarization
    Christensen, Janara
    Soderland, Stephen
    Bansal, Gagan
    Mausam
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 902 - 912
  • [45] A novel approach to multi-document summarization
    Qiu, Li-Qing
    Pang, Bin
    Lin, Sai-Qun
    Chen, Peng
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
  • [46] Hierarchical Transformers for Multi-Document Summarization
    Liu, Yang
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
  • [47] Application of anaphora resolution in event semantic relation based multi-document summarization
    Liu, Maofu
    Jin, Kejia
    Li, Shujun
    Zhang, Xiaolong
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 318 - 323
  • [48] Multi-document summarization of news articles using an event-based framework
    Ou, Shiyan
    Khoo, Christopher S. G.
    Goh, Dion H.
    ASLIB PROCEEDINGS, 2006, 58 (03): : 197 - 217
  • [49] Chain-of-event prompting for multi-document summarization by large language models
    Bao, Songlin
    Li, Tiantian
    Cao, Bin
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2024, 20 (03) : 229 - 247
  • [50] Graph-Based Multi-Modality Learning for Topic-Focused Multi-Document Summarization
    Wan, Xiaojun
    Xiao, Jianguo
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1586 - 1591