Evolutionary Automatic Text Summarization using Cluster Validation Indexes

被引:2
|
作者
Hernandez Castaneda, Nestor [1 ]
Garcia Hernandez, Rene Arnulfo [1 ]
Ledeneva, Yulia [1 ]
Hernandez Castaneda, Angel [1 ,2 ]
机构
[1] Autonomous Univ State Mexico, Toluca, Mexico
[2] Catedras CONACYT, Mexico City, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2020年 / 24卷 / 02期
关键词
Automatic text summarization; cluster validation indexes; evolutionary method; extractive summaries;
D O I
10.13053/CyS-24-2-3392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main problem for generating an extractive automatic text summary (EATS) is to detect the key themes of a text. For this task, unsupervised approaches cluster the sentences of the original text to find the key sentences that take part in an automatic summary. The quality of an automatic summary is evaluated using similarity metrics with human-made summaries. However, the relationship between the quality of the human-made summaries and the internal quality of the clustering is unclear. First, this paper proposes a comparison of the correlation of the quality of a human-made summary to the internal quality of the clustering validation index for finding the best correlation with a clustering validation index. Second, in this paper, an evolutionary method based on the best above internal clustering validation index for an automatic text summarization task is proposed. Our proposed unsupervised method for EATS has the advantage of not requiring information regarding the specific classes or themes of a text, and is therefore domain- and language-independent. The high results obtained by our method, using the most-competitive standard collection for EATS, prove that our method maintains a high correlation with human-made summaries, meeting the specific features of the groups, for example, compaction, separation, distribution, and density.
引用
收藏
页码:583 / 595
页数:13
相关论文
共 50 条
  • [1] Evolutionary Algorithms for Extractive Automatic Text Summarization
    Meena, Yogesh Kumar
    Gopalani, Dinesh
    INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 : 244 - 249
  • [2] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789
  • [3] Automatic Text Summarization Using Fuzzy Inference
    Jafari, Mehdi
    Shahabi, Amir Shahab
    Wang, Jing
    Qin, Yongrui
    Tao, Xiaohui
    Gheisari, Mehdi
    2016 22ND INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2016, : 256 - 260
  • [4] An Approach to Automatic Text Summarization using WordNet
    Pal, Alok Ranjan
    Saha, Diganta
    SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 1169 - 1173
  • [5] Automatic Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 192 - +
  • [6] Automatic Text Summarization using Word Embeddings
    Easwar, Arjun
    Uthra, Annie
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1065 - 1079
  • [7] Automatic Arabic Text Summarization Using Analogical Proportions
    Bilel Elayeb
    Amina Chouigui
    Myriam Bounhas
    Oussama Ben Khiroun
    Cognitive Computation, 2020, 12 : 1043 - 1069
  • [8] Improving the Performance of Text Categorization using Automatic Summarization
    Jiang Xiao-Yu
    Fan Xiao-Zhong
    Wang Zhi-Fei
    Jia Ke-Liang
    2009 INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, PROCEEDINGS, 2009, : 347 - +
  • [9] AUTOMATIC TEXT SUMMARIZATION USING SUPPORT VECTOR MACHINE
    Begum, Nadira
    Fattah, Mohamed Abdel
    Ren, Fuji
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (07): : 1987 - 1996
  • [10] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    Programming and Computer Software, 2011, 37 : 299 - 305