Extractive multi-document text summarization based on graph independent sets

被引:38
|
作者
Uckan, Taner [1 ]
Karci, Ali [2 ]
机构
[1] Van Yuzuncu Yil Univ, Comp Programming Dept, TR-65000 Van, Turkey
[2] Inonu Univ, Dept Comp Engn, TR-44000 Malatya, Turkey
关键词
Graph independent set; Graph-based document summarization; Generic document summarization; Extractive text summarization; Multi document text summarization;
D O I
10.1016/j.eij.2019.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel methodology for extractive, generic summarization of text documents. The Maximum Independent Set, which has not been used previously in any summarization study, has been utilized within the context of this study. In addition, a text processing tool, which we named KUSH, is suggested in order to preserve the semantic cohesion between sentences in the representation stage of introductory texts. Our anticipation was that the set of sentences corresponding to the nodes in the independent set should be excluded from the summary. Based on this anticipation, the nodes forming the Independent Set on the graphs are identified and removed from the graph. Thus, prior to quantification of the effect of the nodes on the global graph, a limitation is applied on the documents to be summarized. This limitation prevents repetition of word groups to be included in the summary. Performance of the proposed approach on the Document Understanding Conference (DUC-2002 and DUC-2004) datasets was calculated using ROUGE evaluation metrics. The developed model achieved a 0.38072 ROUGE performance value for 100-word summaries, 0.51954 for 200-word summaries, and 0.59208 for 400-word summaries. The values reported throughout the experimental processes of the study reveal the contribution of this innovative method. (C) 2019 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.
引用
收藏
页码:145 / 157
页数:13
相关论文
共 50 条
  • [21] A Graph Based Query Focused Multi-Document Summarization
    Balaji, J.
    Geetha, T.
    Parthasarathi, Ranjani
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2014, 10 (01) : 16 - 41
  • [22] Text Summarization as a Multi-objective Optimization Task: Applying Harmony Search to Extractive Multi-Document Summarization
    Bidoki, M.
    Fakhrahmad, M.
    Moosavi, M. R.
    COMPUTER JOURNAL, 2022, 65 (05): : 1053 - 1072
  • [23] Graph-based extractive text summarization based on single document
    Avaneesh Kumar Yadav
    Rama Shankar Ranvijay
    Ashish Kumar Yadav
    Multimedia Tools and Applications, 2024, 83 : 18987 - 19013
  • [24] Decomposition-based multi-objective differential evolution for extractive multi-document automatic text summarization
    Wahab, Muhammad Hafizul Hazmi
    Hamid, Nor Asilah Wati Abdul
    Subramaniam, Shamala
    Latip, Rohaya
    Othman, Mohamed
    APPLIED SOFT COMPUTING, 2024, 151
  • [25] Graph-based extractive text summarization based on single document
    Yadav, Avaneesh Kumar
    Ranvijay, Rama Shankar
    Yadav, Rama Shankar
    Maurya, Ashish Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 18987 - 19013
  • [26] An Indicator-based Multi-Objective Optimization Approach Applied to Extractive Multi-Document Text Summarization
    Sanchez-Gomez, J.
    Vega-Rodriguez, M.
    Perez, C.
    IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1291 - 1299
  • [27] Extractive multi-document summarization using multilayer networks
    Tohalino, Jorge V.
    Amancio, Diego R.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 503 : 526 - 539
  • [28] Unsupervised Framework for Comment-based Multi-document Extractive Summarization
    Roha, Vishal Singh
    Saini, Naveen
    Saha, Sriparna
    Moreno, Jose G.
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 574 - 582
  • [29] An Optimization Algorithm for Extractive Multi-document Summarization Based on Association of Sentences
    Chen, Chun-Hao
    Yang, Yi-Chen
    Lin, Jerry Chun-Wei
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 460 - 469
  • [30] An Extractive Multi-Document Summarization Technique Based on Fuzzy Logic approach
    Tsoumou, Evrard Stency Larys
    Yang, Shichong
    Lai, Linjing
    Varus, Mbembo Loundou
    2016 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2016, : 346 - 351