Degree centrality for semantic abstraction summarization of therapeutic studies

被引:49
作者
Zhang, Han [1 ,2 ]
Fiszman, Marcelo [2 ]
Shin, Dongwook [2 ]
Miller, Christopher M. [2 ]
Rosemblat, Graciela [2 ]
Rindflesch, Thomas C. [2 ]
机构
[1] China Med Univ, Dept Med Informat, Shenyang, Peoples R China
[2] NIH, Natl Lib Med, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
Automatic summarization; Natural language processing; Graph theory; Degree centrality; Semantic processing; Disease treatment; MEDICAL LANGUAGE SYSTEM;
D O I
10.1016/j.jbi.2011.05.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic summarization has been proposed to help manage the results of biomedical information retrieval systems. Semantic MEDLINE, for example, summarizes semantic predications representing assertions in MEDLINE citations. Results are presented as a graph which maintains links to the original citations. Graphs summarizing more than 500 citations are hard to read and navigate, however. We exploit graph theory for focusing these large graphs. The method is based on degree centrality, which measures connectedness in a graph. Four categories of clinical concepts related to treatment of disease were identified and presented as a summary of input text. A baseline was created using term frequency of occurrence. The system was evaluated on summaries for treatment of five diseases compared to a reference standard produced manually by two physicians. The results showed that recall for system results was 72%, precision was 73%, and F-score was 0.72. The system F-score was considerably higher than that for the baseline (0.47). Published by Elsevier Inc.
引用
收藏
页码:830 / 838
页数:9
相关论文
共 48 条
[1]   Summarization from medical documents: a survey [J].
Afantenos, S ;
Karkaletsis, V ;
Stamatopoulos, P .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) :157-177
[2]  
Ahlers CB, 2007, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2007, P209
[3]  
Amigo E., 2004, P 42 ANN M ASS COMP, P207
[4]  
[Anonymous], 2010, P 2010 WORKSH BIOM N
[5]  
[Anonymous], P TALN00
[6]  
[Anonymous], P ACL 02 WORKSH AUT
[7]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[8]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[9]   Improving graph-based random walks for complex question answering using syntactic, shallow semantic and extended string subsequence kernels [J].
Chali, Yllias ;
Hasan, Sadid A. ;
Joty, Shafiq R. .
INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (06) :843-855
[10]  
Chen Y, 2010, CONSUM COMM NETWORK, P1