Aspect Based Multi-Document Summarization

被引:0
|
作者
Sahoo, Deepak [1 ]
Balabantaray, Rakesh [1 ]
Phukon, Mridumoni [2 ]
Saikia, Saibali [2 ]
机构
[1] IIIT Bhubaneswar, Dept Comp Sci & Engn, Bhubaneswar, Odisha, India
[2] Gauhati Univ, GUIST, Dept IT, Gauhati, Assam, India
关键词
Summarization; Clusteri; Term Weigt; Positional Weigh; Chronological Weight; Aspect Weight;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-document summarization is useful when a user deals with a group of heterogeneous documents and wants to compile the important information present in the collection, or there is a group of homogeneous documents, taken out from a large corpus as a result of a query. We present an approach to automatic multi-document summarization that depends on clustering and sentence extraction. User provides a query, based on the query; documents that are relevant to the query are extracted from a document corpus containing documents from various domains. An n x n similarity matrix is created among the sentences having sentence level similarity in all extracted documents. Then clusters of similar sentences are formed using Markov clustering algorithm. In each cluster, each sentence is assigned five different weights 1. Chronological weight of sentence (Document level) 2. Position weight of sentence (position of sentence in the document) 3. Sentence weight (based on term weight) 4. Aspect based weight (sentence containing aspect words) and 5. Synonymy and Hyponym Weight. Then top ranked sentences having highest weight are extracted from each cluster and presented to user.
引用
收藏
页码:873 / 877
页数:5
相关论文
共 50 条
  • [21] MSSF: A Multi-Document Summarization Framework based on Submodularity
    Li, Jingxuan
    Li, Lei
    Li, Tao
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1247 - 1248
  • [22] Multi-document Summarization Algorithm based on Significance Sentences
    Liu Na
    Lu Ying
    Tang Xiao-Jun
    Wang Hai-Wen
    Xiao Peng
    Li Ming-Xia
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3847 - 3852
  • [23] Multi-Document Summarization Based on Locally Relevant Sentences
    Villatoro-Tello, Esau
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    Pinto-Avendano, David
    2009 EIGHTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 87 - +
  • [24] Cover Coefficient-Based Multi-document Summarization
    Ercan, Gonenc
    Can, Fazli
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 670 - 674
  • [25] Multi-document summarization based on BE-vector clustering
    Liu, DX
    He, YX
    Ji, DH
    Yang, H
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 470 - 479
  • [26] CONCEPT-BASED CLASSIFICATION FOR MULTI-DOCUMENT SUMMARIZATION
    Celikyilmaz, Asli
    Hakkani-Tuer, Dilek
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5540 - 5543
  • [27] Chinese Multi-document Summarization Based on Opinion Similarity
    Liu, Rui
    An, Yi
    Song, Lang
    CEIS 2011, 2011, 15
  • [28] Multi-Document Summarization with Centroid-Based Pretraining
    Puduppully, Ratish
    Jain, Parag
    Chen, Nancy F.
    Steedman, Mark
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 128 - 138
  • [29] Weighted consensus multi-document summarization
    Wang, Dingding
    Li, Tao
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 513 - 523
  • [30] MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON
    Li, Lei
    Heng, Wei
    Liu, Ping'an
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1409 - 1413