Development and evaluation of RapTAT: A machine learning system for concept mapping of phrases from medical narratives

被引:0
|
作者
机构
[1] [1,2,Gobbel, Glenn T.
[2] 1,Reeves, Ruth
[3] 1,Jayaramaraja, Shrimalini
[4] Giuse, Dario
[5] 1,3,Speroff, Theodore
[6] 1,Brown, Steven H.
[7] Elkin, Peter L.
[8] 1,2,3,Matheny, Michael E.
来源
Gobbel, G.T. (glenn.t.gobbel@vanderbilt.edu) | 1600年 / Academic Press Inc.卷 / 48期
关键词
Artificial intelligence - Learning algorithms - Natural language processing systems - Medicine - Information retrieval - Terminology - Bayesian networks - Learning systems;
D O I
暂无
中图分类号
学科分类号
摘要
Rapid, automated determination of the mapping of free text phrases to pre-defined concepts could assist in the annotation of clinical notes and increase the speed of natural language processing systems. The aim of this study was to design and evaluate a token-order-specific naïve Bayes-based machine learning system (RapTAT) to predict associations between phrases and concepts. Performance was assessed using a reference standard generated from 2860 VA discharge summaries containing 567,520 phrases that had been mapped to 12,056 distinct Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT) concepts by the MCVS natural language processing system. It was also assessed on the manually annotated, 2010 i2b2 challenge data. Performance was established with regard to precision, recall, and F-measure for each of the concepts within the VA documents using bootstrapping. Within that corpus, concepts identified by MCVS were broadly distributed throughout SNOMED CT, and the token-order-specific language model achieved better performance based on precision, recall, and F-measure (0.95 ± 0.15, 0.96 ± 0.16, and 0.95 ± 0.16, respectively; mean ± SD) than the bag-of-words based, naïve Bayes model (0.64 ± 0.45, 0.61 ± 0.46, and 0.60 ± 0.45, respectively) that has previously been used for concept mapping. Precision, recall, and F-measure on the i2b2 test set were 92.9%, 85.9%, and 89.2% respectively, using the token-order-specific model. RapTAT required just 7.2. ms to map all phrases within a single discharge summary, and mapping rate did not decrease as the number of processed documents increased. The high performance attained by the tool in terms of both accuracy and speed was encouraging, and the mapping rate should be sufficient to support near-real-time, interactive annotation of medical narratives. These results demonstrate the feasibility of rapidly and accurately mapping phrases to a wide range of medical concepts based on a token-order-specific naïve Bayes model and machine learning. © 2013.
引用
收藏
相关论文
共 50 条
  • [21] A qualitative evaluation of medical student learning with concept maps
    Torre, D. M.
    Daley, B.
    Stark-Schweitzer, Tracy
    Siddartha, Singh
    Petkova, Jenny
    Ziebert, Monica
    MEDICAL TEACHER, 2007, 29 (9-10) : 949 - 955
  • [22] Machine Learning Unplugged - Development and Evaluation of a Workshop About Machine Learning
    Ossovski, Elisaweta
    Brinkmeier, Michael
    INFORMATICS IN SCHOOLS: NEW IDEAS IN SCHOOL INFORMATICS, ISSEP 2019, 2019, 11913 : 136 - 146
  • [23] Machine Learning for the Identification and Classification of Key Phrases from Clinical Documents in Spanish
    Tovar Vidal, Mireya
    Santos Rodriguez, Emmanuel
    Reyes-Ortiz, Jose A.
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2019, VOL 1, 2020, 1069 : 164 - 174
  • [24] Personalized medical recommendation system with machine learning
    Basma M. Hassan
    Shahd Mohamed Elagamy
    Neural Computing and Applications, 2025, 37 (9) : 6431 - 6447
  • [25] The Mechanics of Machine Learning: From a Concept to Value
    Shrestha, Sirish
    Sengupta, Partho P.
    JOURNAL OF THE AMERICAN SOCIETY OF ECHOCARDIOGRAPHY, 2018, 31 (12) : 1285 - 1287
  • [26] Application of machine learning to mapping primary causal factors in self reported safety narratives
    Robinson, S. D.
    Irwin, W. J.
    Kelly, T. K.
    Wu, X. O.
    SAFETY SCIENCE, 2015, 75 : 118 - 129
  • [27] Revisiting the role of concept mapping in teaching and learning pathophysiology for medical students
    Fonseca, Marta
    Oliveira, Beatriz
    Carreiro-Martins, Pedro
    Neuparth, Nuno
    Rendas, Antonio
    ADVANCES IN PHYSIOLOGY EDUCATION, 2020, 44 (03) : 475 - 481
  • [28] Evaluation of Learning in Oncology of Undergraduate Nursing with the Use of Concept Mapping
    Mariane Trevisani
    Cibelli Rizzo Cohrs
    Mariângela Abate de Lara Soares
    José Marcio Duarte
    Felipe Mancini
    Ivan Torres Pisa
    Edvane Birelo Lopes De Domenico
    Journal of Cancer Education, 2016, 31 : 533 - 540
  • [29] Evaluation of Learning in Oncology of Undergraduate Nursing with the Use of Concept Mapping
    Trevisani, Mariane
    Cohrs, Cibelli Rizzo
    de Lara Soares, Mariangela Abate
    Duarte, Jose Marcio
    Mancini, Felipe
    Pisa, Ivan Torres
    Lopes De Domenico, Edvane Birelo
    JOURNAL OF CANCER EDUCATION, 2016, 31 (03) : 533 - 540
  • [30] Generalizability of machine learning methods in detecting adverse drug events from clinical narratives in electronic medical records
    Zitu, Md Muntasir
    Zhang, Shijun
    Owen, Dwight H. H.
    Chiang, Chienwei
    Li, Lang
    FRONTIERS IN PHARMACOLOGY, 2023, 14