Development and evaluation of RapTAT: A machine learning system for concept mapping of phrases from medical narratives

被引:0
|
作者
机构
[1] [1,2,Gobbel, Glenn T.
[2] 1,Reeves, Ruth
[3] 1,Jayaramaraja, Shrimalini
[4] Giuse, Dario
[5] 1,3,Speroff, Theodore
[6] 1,Brown, Steven H.
[7] Elkin, Peter L.
[8] 1,2,3,Matheny, Michael E.
来源
Gobbel, G.T. (glenn.t.gobbel@vanderbilt.edu) | 1600年 / Academic Press Inc.卷 / 48期
关键词
Artificial intelligence - Learning algorithms - Natural language processing systems - Medicine - Information retrieval - Terminology - Bayesian networks - Learning systems;
D O I
暂无
中图分类号
学科分类号
摘要
Rapid, automated determination of the mapping of free text phrases to pre-defined concepts could assist in the annotation of clinical notes and increase the speed of natural language processing systems. The aim of this study was to design and evaluate a token-order-specific naïve Bayes-based machine learning system (RapTAT) to predict associations between phrases and concepts. Performance was assessed using a reference standard generated from 2860 VA discharge summaries containing 567,520 phrases that had been mapped to 12,056 distinct Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT) concepts by the MCVS natural language processing system. It was also assessed on the manually annotated, 2010 i2b2 challenge data. Performance was established with regard to precision, recall, and F-measure for each of the concepts within the VA documents using bootstrapping. Within that corpus, concepts identified by MCVS were broadly distributed throughout SNOMED CT, and the token-order-specific language model achieved better performance based on precision, recall, and F-measure (0.95 ± 0.15, 0.96 ± 0.16, and 0.95 ± 0.16, respectively; mean ± SD) than the bag-of-words based, naïve Bayes model (0.64 ± 0.45, 0.61 ± 0.46, and 0.60 ± 0.45, respectively) that has previously been used for concept mapping. Precision, recall, and F-measure on the i2b2 test set were 92.9%, 85.9%, and 89.2% respectively, using the token-order-specific model. RapTAT required just 7.2. ms to map all phrases within a single discharge summary, and mapping rate did not decrease as the number of processed documents increased. The high performance attained by the tool in terms of both accuracy and speed was encouraging, and the mapping rate should be sufficient to support near-real-time, interactive annotation of medical narratives. These results demonstrate the feasibility of rapidly and accurately mapping phrases to a wide range of medical concepts based on a token-order-specific naïve Bayes model and machine learning. © 2013.
引用
收藏
相关论文
共 50 条
  • [31] Mapping the practice of developmental evaluation: Insights from a concept mapping study
    Szijarto, Barbara
    Cousins, J. Bradley
    EVALUATION AND PROGRAM PLANNING, 2019, 76
  • [32] Learning from architects: Complementary concept mapping approaches
    ETH Zürich, Zürich, Switzerland
    不详
    Inf. Visualization, 2006, 3 (225-234):
  • [33] A METHODOLOGICAL STUDY ON THE EVALUATION OF LEARNING FROM STORY NARRATIVES
    GLINER, G
    GOLDMAN, SR
    HUBERT, LJ
    MULTIVARIATE BEHAVIORAL RESEARCH, 1983, 18 (01) : 9 - 36
  • [34] Method and development of the machine proportion evaluation on the NC machine evaluation system
    Wang, Youjun
    Chen, Shunxiang
    He, Weiping
    Yang, Haicheng
    Jixie Kexue Yu Jishu/Mechanical Science and Technology, 1998, 17 (02): : 325 - 328
  • [35] Industrial application of machine-in-the-loop-learning for a medical robot vision system -: Concept and comprehensive field study
    Eberhardt, Michael
    Roth, Siegfried
    Koenig, Andreas
    COMPUTERS & ELECTRICAL ENGINEERING, 2008, 34 (02) : 111 - 126
  • [36] Performance Evaluation of the Machine Learning Algorithms Used in Inference Mechanism of a Medical Decision Support System
    Bal, Mert
    Amasyali, M. Fatih
    Sever, Hayri
    Kose, Guven
    Demirhan, Ayse
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [37] Recommendations for Performance Evaluation of Machine Learning in Pathology A Concept Paper From the College of American Pathologists
    Hanna, Matthew G.
    Olson, Niels H.
    Zarella, Mark
    Dash, C.
    Herrmann, Markus D.
    Furtado, Larissa, V
    Stram, Michelle N.
    Raciti, Patricia M.
    Hassell, Lewis
    Mays, Alex
    Pantanowitz, Liron
    Sirintrapun, Joseph S.
    Krishnamurthy, Savitri
    Parwani, Anil
    Lujan, Giovanni
    Evans, Joseph Andrew
    Glassy, Eric F.
    Bui, Marilyn M.
    Singh, Rajendra
    Souers, Rhona J.
    de Baca, Monica E.
    Seheult, Jansen N.
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2024, 148 (10)
  • [38] A machine learning approach to rapid development of XML mapping queries
    Morishima, A
    Kitagawa, H
    Matsumoto, A
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 276 - 287
  • [39] Development of an evaluation concept for a multimodal computer system
    Seifert, K
    Liu, J
    Hurtienne, J
    Baumgarten, T
    ANALYSIS, DESIGN AND EVALUATION OF HUMAN-MACHINE SYSTEMS 2001, 2002, : 47 - 52
  • [40] Autolearner: An autonomic machine learning system based on concept algebra
    Hu, Kai
    Wang, Yingxu
    PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, 2007, : 502 - +