Extractive multi-document text summarization based on graph independent sets

被引:38
|
作者
Uckan, Taner [1 ]
Karci, Ali [2 ]
机构
[1] Van Yuzuncu Yil Univ, Comp Programming Dept, TR-65000 Van, Turkey
[2] Inonu Univ, Dept Comp Engn, TR-44000 Malatya, Turkey
关键词
Graph independent set; Graph-based document summarization; Generic document summarization; Extractive text summarization; Multi document text summarization;
D O I
10.1016/j.eij.2019.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel methodology for extractive, generic summarization of text documents. The Maximum Independent Set, which has not been used previously in any summarization study, has been utilized within the context of this study. In addition, a text processing tool, which we named KUSH, is suggested in order to preserve the semantic cohesion between sentences in the representation stage of introductory texts. Our anticipation was that the set of sentences corresponding to the nodes in the independent set should be excluded from the summary. Based on this anticipation, the nodes forming the Independent Set on the graphs are identified and removed from the graph. Thus, prior to quantification of the effect of the nodes on the global graph, a limitation is applied on the documents to be summarized. This limitation prevents repetition of word groups to be included in the summary. Performance of the proposed approach on the Document Understanding Conference (DUC-2002 and DUC-2004) datasets was calculated using ROUGE evaluation metrics. The developed model achieved a 0.38072 ROUGE performance value for 100-word summaries, 0.51954 for 200-word summaries, and 0.59208 for 400-word summaries. The values reported throughout the experimental processes of the study reveal the contribution of this innovative method. (C) 2019 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.
引用
收藏
页码:145 / 157
页数:13
相关论文
共 50 条
  • [1] Multi-document extractive text summarization based on firefly algorithm
    Tomer, Minakshi
    Kumar, Manoj
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6057 - 6065
  • [2] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [3] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    ELECTRONICS, 2023, 12 (08)
  • [4] Survey on Extractive Text Summarization Methods with Multi-Document Datasets
    Varalakshmi, P. N. K.
    Kallimani, Jagadish S.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2113 - 2119
  • [5] Multi-document extractive text summarization: A comparative assessment on features
    Mutlu, Begum
    Sezer, Ebru A.
    Akcayol, M. Ali
    KNOWLEDGE-BASED SYSTEMS, 2019, 183
  • [6] Survey on Graph and Cluster Based Approaches in Multi-document Text Summarization
    Meena, Yogesh Kumar
    Jain, Ashish
    Gopalani, Dinesh
    2014 RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2014,
  • [7] Unsupervised extractive multi-document text summarization using a Genetic Algorithm
    Neri-Mendoza, Veronica
    Ledeneva, Yulia
    Garcia-Hernandez, Rene Arnulfo
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2397 - 2408
  • [8] Multi-Document Extractive Text Summarization via Deep Learning Approach
    Rezaei, Afsaneh
    Dami, Sina
    Daneshjoo, Parisa
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 680 - 685
  • [9] Experimental analysis of multiple criteria for extractive multi-document text summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [10] Extractive Multi-Document Text Summarization by Using Binary Particle Swarm Optimization
    Potnurwar, Archana
    Pimpalshende, Anjusha
    Aote, Shailendra S.
    Bongirwar, Vrusbali
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 32 - 34