ATSSC: Development of an approach based on soft computing for text summarization

被引:22
|
作者
Tayal, Madhuri A. [1 ]
Raghuwanshi, Mukesh M. [2 ]
Malik, Latesh G. [1 ]
机构
[1] GH Raisoni Coll Engn, Dept Comp Sci & Engn, Nagpur, Maharashtra, India
[2] Yeshwantrao Chavhan Coll Engn, Dept Comp Sci & Engn, Nagpur, Maharashtra, India
来源
关键词
Text document; Summarization; Semantic representation; Clustering; Reference resolution; Evaluation;
D O I
10.1016/j.csl.2016.07.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) is a field of computer science and linguistics concerned with the unique conversation between computers and human languages. It processes data through Lexical analysis, Syntax analysis, Semantic analysis, Discourse processing and Pragmatic analysis. An intelligent text summarization is one of the most challenging tasks in Natural language processing. It can be further used for applications like storytelling and question answering. This paper presents an automatic text summarizer for text documents using soft computing approach, consisting of SVO (Subject, Verb, and Object) Rules and Tag based training. This approach processes data through POS Tagger, NLP Parser, ambiguity removal, Semantic Representation, Sentence Reduction and Sentence Combination. At first, this paper defines the theme (title) of the document. After this operation, it preprocesses text document to perform pronominal reference resolution and text clustering. After these preprocessing operations, it identifies and removes ambiguity from the language using parser. And then, it calculates the score for the sentences using the title of the document, Semantic Sentence Similarity utility and n-gram Co-Occurrence relations of the words in a particular sentence. At last, sentences are combined with the SVO Rules after providing tag based training for simple and complex sentences. The summarizer was tested on the standard DUC 2007 dataset as well as a corpus of hundred text documents of different domains created by us. DUC 2007 Update Task produced accuracy F-scores of 0.13523 (ROUGE-2) and 0.112561 (ROUGE-SU4) for DUC 2007 documents and 0.4036 (ROUGE-2) and 0.3129 (ROUGE-SU4) for our corpus. Subjective evaluation was carried out by five language experts and twenty random individuals for system generated sample summaries. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:214 / 235
页数:22
相关论文
共 50 条
  • [21] Extractive Odia Text Summarization System: An OCR Based Approach
    Pattnaik, Priyanka
    Mallick, Debasish Kumar
    Parida, Shantipriya
    Dash, Satya Ranjan
    BIOLOGICALLY INSPIRED TECHNIQUES IN MANY-CRITERIA DECISION MAKING, 2020, 10 : 136 - 143
  • [22] A Semantic QA-Based Approach for Text Summarization Evaluation
    Chen, Ping
    Wu, Fei
    Wang, Tong
    Ding, Wei
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4800 - 4807
  • [23] A hybrid approach to automatic text summarization
    Chang, Te-Min
    Hsiao, Wen-Feng
    2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 65 - +
  • [24] Practical approach to automatic text summarization
    Hynek, J
    Jezek, K
    FROM INFORMATION TO KNOWLEDGE, 2003, : 378 - 388
  • [25] The method of multidimensional approach to text summarization
    Janaszkiewicz, Piotr
    Rozewski, Przemyslaw
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 2189 - 2196
  • [26] Malayalam Text Summarization: An Extractive Approach
    Krishnaprasad, P.
    Sooryanarayanan, A.
    Ramanujan, Ajeesh
    2016 INTERNATIONAL CONFERENCE ON NEXT GENERATION INTELLIGENT SYSTEMS (ICNGIS), 2016, : 40 - 43
  • [27] A complex network approach to text summarization
    Antiqueira, Lucas
    Oliveira, Osvaldo N., Jr.
    Costa, Luciano da Fontoura
    Volpe Nunes, Maria das Gracas
    INFORMATION SCIENCES, 2009, 179 (05) : 584 - 599
  • [28] Swarm Based Text Summarization
    Binwahlan, Mohammed Salem
    Salim, Naomie
    Suanmali, Ladda
    IACSIT-SC 2009: INTERNATIONAL ASSOCIATION OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY - SPRING CONFERENCE, 2009, : 145 - +
  • [29] Development of Technology for Summarization of Kazakh Text
    Zhabayev, Talgat
    Tukeyev, Ualsher
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (09) : 111 - 116
  • [30] A Framework for Extractive Text Summarization using Semantic Graph Based Approach
    Ullah, Shofi
    Al Islam, A. B. M. Alim
    2019 6TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS 2019), 2019, : 48 - 55