Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

被引:0
|
作者
Shen, Chen [1 ]
Lin, Hongfei [1 ]
Hao, Huihui [1 ]
Yang, Zhihao [1 ,2 ]
Wang, Jian [1 ]
Zhang, Shaowu [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Univ New South Wales Canberra, Sch Engn & Informat Technol, Canberra, ACT, Australia
基金
中国国家自然科学基金;
关键词
Intelligent; text summarization; graph-based ranking; similarity calculation;
D O I
10.3233/JIFS-179315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
引用
收藏
页码:4797 / 4802
页数:6
相关论文
共 50 条
  • [11] Using query expansion in graph-based approach for query-focused multi-document summarization
    Zhao, Lin
    Wu, Lide
    Huang, Xuanjing
    INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (01) : 35 - 41
  • [12] Query Focused Multi-document Summarization Based on Five-Layered Graph and Universal Paraphrastic Embeddings
    Canhasi, Ercan
    ARTIFICIAL INTELLIGENCE TRENDS IN INTELLIGENT SYSTEMS, CSOC2017, VOL 1, 2017, 573 : 220 - 228
  • [13] Graph-based Keyphrase Extraction Using Word and Document Embeddings
    Zu, Xian
    Xie, Fei
    Liu, Xiaojian
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 70 - 76
  • [14] A Query Specific Graph Based Approach to Multi-document Text Summarization: Simultaneous Cluster and Sentence Ranking
    Pandit, Sandip R.
    Potey, M. A.
    2013 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND RESEARCH ADVANCEMENT (ICMIRA 2013), 2013, : 213 - 217
  • [15] A document-sensitive graph model for multi-document summarization
    Furu Wei
    Wenjie Li
    Qin Lu
    Yanxiang He
    Knowledge and Information Systems, 2010, 22 : 245 - 259
  • [16] A document-sensitive graph model for multi-document summarization
    Wei, Furu
    Li, Wenjie
    Lu, Qin
    He, Yanxiang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) : 245 - 259
  • [17] Clustering cliques for graph-based summarization of the biomedical research literature
    Han Zhang
    Marcelo Fiszman
    Dongwook Shin
    Bartlomiej Wilkowski
    Thomas C Rindflesch
    BMC Bioinformatics, 14
  • [18] Clustering cliques for graph-based summarization of the biomedical research literature
    Zhang, Han
    Fiszman, Marcelo
    Shin, Dongwook
    Wilkowski, Bartlomiej
    Rindflesch, Thomas C.
    BMC BIOINFORMATICS, 2013, 14
  • [19] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [20] Extractive multi-document text summarization based on graph independent sets
    Uckan, Taner
    Karci, Ali
    EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 145 - 157