Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

被引:0
|
作者
Shen, Chen [1 ]
Lin, Hongfei [1 ]
Hao, Huihui [1 ]
Yang, Zhihao [1 ,2 ]
Wang, Jian [1 ]
Zhang, Shaowu [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Univ New South Wales Canberra, Sch Engn & Informat Technol, Canberra, ACT, Australia
基金
中国国家自然科学基金;
关键词
Intelligent; text summarization; graph-based ranking; similarity calculation;
D O I
10.3233/JIFS-179315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
引用
收藏
页码:4797 / 4802
页数:6
相关论文
共 50 条
  • [1] Unsupervised Graph-Based Tibetan Multi-Document Summarization
    Yan, Xiaodong
    Wang, Yiqin
    Wei Song
    Zhao, Xiaobing
    Run, A.
    Yang Yanxing
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1769 - 1781
  • [2] An Approach for Combining Multiple Weighting Schemes and Ranking Methods in Graph-Based Multi-Document Summarization
    Alzuhair, Abeer
    Al-Dhelaan, Mohammed
    IEEE ACCESS, 2019, 7 : 120375 - 120386
  • [3] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    ELECTRONICS, 2023, 12 (08)
  • [4] Enhancing Multi-Document Summarization with Cross-Document Graph-based Information Extraction
    Zhang, Zixuan
    Elfardy, Heba
    Dreyer, Markus
    Small, Kevin
    Ji, Heng
    Bansal, Mohit
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1696 - 1707
  • [5] A Query-Sensitive Graph-Based Sentence Ranking Algorithm for Query-Oriented Multi-Document Summarization .
    Wei, Furu
    He, Yanxiang
    Li, Wenjie
    Lu, Qin
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 9 - +
  • [6] GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization
    Li, Haoran
    Zhang, Jiajun
    Zhou, Yu
    Zong, Chengqing
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 608 - 620
  • [7] Graph-Based Query-Focused Multi-document Summarization Using Improved Affinity Graph
    Hu, Po
    He, Jiacong
    Zhang, Yong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 336 - 347
  • [8] Graph-Based Multi-Modality Learning for Topic-Focused Multi-Document Summarization
    Wan, Xiaojun
    Xiao, Jianguo
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1586 - 1591
  • [9] A Graph Based Query Focused Multi-Document Summarization
    Balaji, J.
    Geetha, T.
    Parthasarathi, Ranjani
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2014, 10 (01) : 16 - 41
  • [10] Summarization of biomedical articles using domain-specific word embeddings and graph ranking
    Moradi, Milad
    Dashti, Maedeh
    Samwald, Matthias
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 107