Topic Oriented Multi-document Summarization Using LSA, Syntactic and Semantic Features

被引:0
|
作者
Anjaneyulu, M. [1 ]
Sarma, S. S. V. N. [2 ]
Reddy, P. Vijaya Pal [3 ]
Chander, K. Prem [1 ]
Nagaprasad, S. [4 ]
机构
[1] Dravidian Univ, Dept Comp Sci, Kuppam, India
[2] Vaagdevi Engn Coll, CSE Dept, Warangal, Andhra Pradesh, India
[3] Matrusri Engn Coll, CSE Dept, Hyderabad, India
[4] SRR Govt Arts & Sci Coll, Comp Sci Dept, Karimnagar, India
关键词
Multi-document Summarization; Topic identification; Latent Semantic Analysis; Semantic features; Syntactic features;
D O I
10.1007/978-981-13-2354-6_50
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multi-document Summarization (MDS) is a process obtaining precise and concise information from a set of documents described on the same topic. The generated summary makes the user to understand the important information that is present in the documents. In general, a set of interrelated documents discusses about a subject. The subject contains a set of topics. The description of various topics that belongs to the main subject are presented in the documents. The user wish to consolidate the subject in terms of topics that are covered in the various documents. The existing approaches suffer with identification of the topics within a document and also lack the establishment of semantic and syntactic relationship among the words within a sentence. In this paper, a novel unsupervised model is proposed to generate extractive multi-document summaries by identifying the topics that are present in the documents using Latent Semantic Analysis (LSA) and eliminating the redundant sentences that are describing the same topic that are present in themultiple documents using semantic and syntactic information embedded in the sentences. Empirical evaluations are carried out using LSA, lexical, syntactic and semantic features on DUC2006 dataset. The experimental results on DUC2006 demonstrate that the performance of proposed summarization system is comparable with the existing summarization systems in terms of F-measure, recall, and precision values.
引用
收藏
页码:487 / 502
页数:16
相关论文
共 50 条
  • [1] Sentence Similarity Using Syntactic and Semantic Features for Multi-document Summarization
    Anjaneyulu, M.
    Sarma, S. S. V. N.
    Reddy, P. Vijaya Pal
    Chander, K. Prem
    Nagaprasad, S.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 471 - 485
  • [2] Using Topic Themes for Multi-Document Summarization
    Harabagiu, Sanda
    Lacatusu, Finley
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [3] TOMDS (Topic-Oriented Multi-Document Summarization): Enabling Personalized Customization of Multi-Document Summaries
    Zhang, Xin
    Wei, Qiyi
    Song, Qing
    Zhang, Pengzhou
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [4] Research on multi-document summarization merging the sentential semantic features
    Luo S.-L.
    Bai J.-M.
    Pan L.-M.
    Han L.
    Meng Q.
    2016, Beijing Institute of Technology (36): : 1059 - 1064
  • [5] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [6] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [7] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [8] Identification of Event and Topic for Multi-document Summarization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    Matsuyoshi, Suguru
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 304 - 316
  • [9] Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures
    Bairi, Ramakrishna B.
    Iyer, Rishabh
    Ramakrishnan, Ganesh
    Bilmes, Jeff
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 553 - 563
  • [10] A New Automatic Multi-document Text Summarization using Topic Modeling
    Roul, Rajendra Kumar
    Mehrotra, Samarth
    Pungaliya, Yash
    Sahoo, Jajati Keshari
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2019, 2019, 11319 : 212 - 221