Evolutionary Automatic Text Summarization using Cluster Validation Indexes

被引:2
|
作者
Hernandez Castaneda, Nestor [1 ]
Garcia Hernandez, Rene Arnulfo [1 ]
Ledeneva, Yulia [1 ]
Hernandez Castaneda, Angel [1 ,2 ]
机构
[1] Autonomous Univ State Mexico, Toluca, Mexico
[2] Catedras CONACYT, Mexico City, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2020年 / 24卷 / 02期
关键词
Automatic text summarization; cluster validation indexes; evolutionary method; extractive summaries;
D O I
10.13053/CyS-24-2-3392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main problem for generating an extractive automatic text summary (EATS) is to detect the key themes of a text. For this task, unsupervised approaches cluster the sentences of the original text to find the key sentences that take part in an automatic summary. The quality of an automatic summary is evaluated using similarity metrics with human-made summaries. However, the relationship between the quality of the human-made summaries and the internal quality of the clustering is unclear. First, this paper proposes a comparison of the correlation of the quality of a human-made summary to the internal quality of the clustering validation index for finding the best correlation with a clustering validation index. Second, in this paper, an evolutionary method based on the best above internal clustering validation index for an automatic text summarization task is proposed. Our proposed unsupervised method for EATS has the advantage of not requiring information regarding the specific classes or themes of a text, and is therefore domain- and language-independent. The high results obtained by our method, using the most-competitive standard collection for EATS, prove that our method maintains a high correlation with human-made summaries, meeting the specific features of the groups, for example, compaction, separation, distribution, and density.
引用
收藏
页码:583 / 595
页数:13
相关论文
共 50 条
  • [31] Using Clustering and a Modified Classification algorithm for automatic text summarization
    Aries, Abdelkrime
    Oufaida, Houda
    Nouali, Omar
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [32] AUTOMATIC TEXT SUMMARIZATION USING PAGE RANK AND GENETIC ALGORITHM
    Gupta, Shashank
    Jagrawal, Anushree
    Mathur, Neha
    JOURNAL OF RAJASTHAN ACADEMY OF PHYSICAL SCIENCES, 2014, 13 (02): : 171 - 179
  • [33] Bilingual Automatic Text Summarization Using Unsupervised Deep Learning
    Singh, Shashi Pal
    Kumar, Ajai
    Mangal, Abhilasha
    Singhal, Shikha
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 1195 - 1200
  • [34] An Automatic Thai Text Summarization Using Topic Sensitive PageRank
    Chongsuntornsri, Aekkasit
    Sornil, Ohm
    2006 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES,VOLS 1-3, 2006, : 597 - +
  • [35] Text summarization system using automatic paragraphing in Korean document
    Kim, KS
    Lee, HJ
    Noh, TG
    Lee, SJ
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 1420 - 1424
  • [36] Automatic Text Summarization Using Deep Reinforcement Learning and Beyond
    Sun, Gang
    Wang, Zhongxin
    Zhao, Jia
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (03): : 458 - 469
  • [37] Improving Automatic Image Captioning Using Text Summarization Techniques
    Plaza, Laura
    Lloret, Elena
    Aker, Ahmet
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 165 - +
  • [38] Word Concept Extraction Using HOSVD for Automatic Text Summarization
    Biyabangard, Atiyeh
    Abadeh, Mohammad Saniee
    2015 AI & ROBOTICS (IRANOPEN), 2015,
  • [39] Automatic Arabic Text Summarization Using Clustering and Keyphrase Extraction
    Fejer, Hamzah Noori
    Omar, Nazlia
    PROCEEDINGS OF THE 2014 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MULTIMEDIA (ICIM), 2014, : 293 - 298
  • [40] Practical approach to automatic text summarization
    Hynek, J
    Jezek, K
    FROM INFORMATION TO KNOWLEDGE, 2003, : 378 - 388