Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

被引:0
|
作者
Shen, Chen [1 ]
Lin, Hongfei [1 ]
Hao, Huihui [1 ]
Yang, Zhihao [1 ,2 ]
Wang, Jian [1 ]
Zhang, Shaowu [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Univ New South Wales Canberra, Sch Engn & Informat Technol, Canberra, ACT, Australia
基金
中国国家自然科学基金;
关键词
Intelligent; text summarization; graph-based ranking; similarity calculation;
D O I
10.3233/JIFS-179315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
引用
收藏
页码:4797 / 4802
页数:6
相关论文
共 50 条
  • [21] Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization
    Li, Miao
    Qi, Jianzhong
    Lau, Jey Han
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13085 - 13093
  • [22] Survey on Graph and Cluster Based Approaches in Multi-document Text Summarization
    Meena, Yogesh Kumar
    Jain, Ashish
    Gopalani, Dinesh
    2014 RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2014,
  • [23] A Proposed Textual Graph Based Model for Arabic Multi-document Summarization
    Alwan, Muneer A.
    Onsi, Hoda M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (06) : 435 - 439
  • [24] An Intelligent Web Search Using Multi-Document Summarization
    Takale, Sheetal A.
    Kulkarni, Prakash J.
    Shah, Sahil K.
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2016, 6 (02) : 41 - 65
  • [25] Parallel Relationship Graph to Improve Multi-Document Summarization
    Lu, Menghua
    Liang, Lijia
    Liu, Gongshen
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 630 - 642
  • [26] StarSum: A Simple Star Graph for Multi-document Summarization
    Al-Dhelaan, Mohammed
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 715 - 718
  • [27] Aspect Based Multi-Document Summarization
    Sahoo, Deepak
    Balabantaray, Rakesh
    Phukon, Mridumoni
    Saikia, Saibali
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 873 - 877
  • [28] Summarizing learning materials using graph based multi-document summarization
    Krishnaveni P.
    Balasundaram S.R.
    International Journal of Web-Based Learning and Teaching Technologies, 2021, 16 (05) : 39 - 57
  • [29] Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization
    Cai, Xiaoyan
    Li, Wenjie
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1424 - 1433
  • [30] Manifold-Ranking Based Topic-Focused Multi-Document Summarization
    Wan, Xiaojun
    Yang, Jianwu
    Xiao, Jianguo
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2903 - 2908