ATSSC: Development of an approach based on soft computing for text summarization

被引:22
|
作者
Tayal, Madhuri A. [1 ]
Raghuwanshi, Mukesh M. [2 ]
Malik, Latesh G. [1 ]
机构
[1] GH Raisoni Coll Engn, Dept Comp Sci & Engn, Nagpur, Maharashtra, India
[2] Yeshwantrao Chavhan Coll Engn, Dept Comp Sci & Engn, Nagpur, Maharashtra, India
来源
关键词
Text document; Summarization; Semantic representation; Clustering; Reference resolution; Evaluation;
D O I
10.1016/j.csl.2016.07.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) is a field of computer science and linguistics concerned with the unique conversation between computers and human languages. It processes data through Lexical analysis, Syntax analysis, Semantic analysis, Discourse processing and Pragmatic analysis. An intelligent text summarization is one of the most challenging tasks in Natural language processing. It can be further used for applications like storytelling and question answering. This paper presents an automatic text summarizer for text documents using soft computing approach, consisting of SVO (Subject, Verb, and Object) Rules and Tag based training. This approach processes data through POS Tagger, NLP Parser, ambiguity removal, Semantic Representation, Sentence Reduction and Sentence Combination. At first, this paper defines the theme (title) of the document. After this operation, it preprocesses text document to perform pronominal reference resolution and text clustering. After these preprocessing operations, it identifies and removes ambiguity from the language using parser. And then, it calculates the score for the sentences using the title of the document, Semantic Sentence Similarity utility and n-gram Co-Occurrence relations of the words in a particular sentence. At last, sentences are combined with the SVO Rules after providing tag based training for simple and complex sentences. The summarizer was tested on the standard DUC 2007 dataset as well as a corpus of hundred text documents of different domains created by us. DUC 2007 Update Task produced accuracy F-scores of 0.13523 (ROUGE-2) and 0.112561 (ROUGE-SU4) for DUC 2007 documents and 0.4036 (ROUGE-2) and 0.3129 (ROUGE-SU4) for our corpus. Subjective evaluation was carried out by five language experts and twenty random individuals for system generated sample summaries. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:214 / 235
页数:22
相关论文
共 50 条
  • [41] Enhaneement of Keyphrase-Based Approach of Automatie Bangla Text Summarization
    Haque, Md. Majharul
    Pervin, Suraiya
    Begum, Zerina
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 42 - 46
  • [42] Summarization based approach for Old Sinhala Text Archival Search and Preservation
    Rathnasena, K. A. M. P.
    Kumarasinghe, K. M. S. J.
    Paranavitharana, D. T. P.
    Dayarathne, D. V. A. U.
    Ranathunga, L.
    2018 18TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) CONFERENCE PROCEEDINGS, 2018, : 182 - 188
  • [43] The soft computing approach to program development time estimation
    Bhattacherjee, Vandana
    ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2006, : 291 - 292
  • [44] An approach to concept-obtained text summarization
    Wang, M
    Wang, XR
    Xu, C
    INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1290 - 1293
  • [45] An Approach to Automatic Text Summarization using WordNet
    Pal, Alok Ranjan
    Saha, Diganta
    SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 1169 - 1173
  • [46] A Heuristic Approach of Text Summarization for Bengali Documentation
    Abujar, Sheikh
    Hasan, Mahmudul
    Shahin, M. S. I.
    Hossain, Syed Akhter
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [47] A Novel Approach for Semantic Extractive Text Summarization
    Waseemullah
    Fatima, Zainab
    Zardari, Shehnila
    Fahim, Muhammad
    Andleeb Siddiqui, Maria
    Ibrahim, Ag. Asri Ag.
    Nisar, Kashif
    Naz, Laviza Falak
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [48] A New Approach for Single Text Document Summarization
    Yadav, Chandra Shekhar
    Sharan, Aditi
    Kumar, Rakesh
    Biswas, Payal
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 2, 2016, 380 : 401 - 411
  • [49] A Variable Dimension Optimization Approach for Text Summarization
    Verma, Pradeepika
    Om, Hari
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 687 - 696
  • [50] TEXT SUMMARIZATION BY FORMAL CONCEPT ANALYSIS APPROACH
    Tatar, Doina
    Lupea, Mihaiela
    Marian, Zsuzsanna
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 37 - 48