Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

被引:0
|
作者
Shen, Chen [1 ]
Lin, Hongfei [1 ]
Hao, Huihui [1 ]
Yang, Zhihao [1 ,2 ]
Wang, Jian [1 ]
Zhang, Shaowu [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Univ New South Wales Canberra, Sch Engn & Informat Technol, Canberra, ACT, Australia
基金
中国国家自然科学基金;
关键词
Intelligent; text summarization; graph-based ranking; similarity calculation;
D O I
10.3233/JIFS-179315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
引用
收藏
页码:4797 / 4802
页数:6
相关论文
共 50 条
  • [31] Improvements in Multi-Document Abstractive Summarization using Multi Sentence Compression with Word Graph and Node Alignment
    Agarwal, Raksha
    Chatterjee, Niladri
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [32] An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings
    Lamsiyah, Salima
    El Mahdaouy, Abdelkader
    Espinasse, Bernard
    Ouatik, Said El Alaoui
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
  • [33] Literature Study on Multi-document Text Summarization Techniques
    Shah, Chintan
    Jivani, Anjali
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 442 - 451
  • [34] Hindi Multi-document Word Cloud based Summarization through Unsupervised Learning
    Bafna, Prafulla B.
    Saini, Jatinderkumar R.
    2019 9TH INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY: SIGNAL AND INFORMATION PROCESSING (ICETET-SIP-19), 2019,
  • [35] Experimental study on fuzzy word memberships for multi-document summarization
    Tjhi, William-Chandra
    Chen, Lihui
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1629 - 1633
  • [36] Multi-document summarization based on lexical chains
    Chen, YM
    Wang, XL
    Liu, BQ
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1937 - 1942
  • [37] Genetic algorithm based multi-document summarization
    Liu, Dexi
    He, Yanxiang
    Ji, Donghong
    Yang, Hua
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1140 - 1144
  • [38] Boosting multi-document summarization with hierarchical graph convolutional networks
    Song, Yingjie
    Yang, Li
    Luo, Wenming
    Xiao, Xiong
    Tang, Zhuo
    NEUROCOMPUTING, 2025, 614
  • [39] Genetic Semantic Graph Approach for Multi-document Abstractive Summarization
    Khan, Atif
    Salim, Naomie
    Kumar, Yogan Jaya
    2015 FIFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC), 2015, : 173 - 181
  • [40] Co-HITS-Ranking Based Query-Focused Multi-document Summarization
    Hu, Po
    Ji, Donghong
    Teng, Chong
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 121 - 130