A deep database of medical abbreviations and acronyms for natural language processing

被引:0
|
作者
Lisa Grossman Liu
Raymond H. Grossman
Elliot G. Mitchell
Chunhua Weng
Karthik Natarajan
George Hripcsak
David K. Vawdrey
机构
[1] Columbia University,Department of Biomedical Informatics
[2] Kensho Technologies,undefined
[3] LLC,undefined
[4] Steele Institute for Health Innovation,undefined
[5] Geisinger,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The recognition, disambiguation, and expansion of medical abbreviations and acronyms is of upmost importance to prevent medically-dangerous misinterpretation in natural language processing. To support recognition, disambiguation, and expansion, we present the Medical Abbreviation and Acronym Meta-Inventory, a deep database of medical abbreviations. A systematic harmonization of eight source inventories across multiple healthcare specialties and settings identified 104,057 abbreviations with 170,426 corresponding senses. Automated cross-mapping of synonymous records using state-of-the-art machine learning reduced redundancy, which simplifies future application. Additional features include semi-automated quality control to remove errors. The Meta-Inventory demonstrated high completeness or coverage of abbreviations and senses in new clinical text, a substantial improvement over the next largest repository (6–14% increase in abbreviation coverage; 28–52% increase in sense coverage). To our knowledge, the Meta-Inventory is the most complete compilation of medical abbreviations and acronyms in American English to-date. The multiple sources and high coverage support application in varied specialties and settings. This allows for cross-institutional natural language processing, which previous inventories did not support. The Meta-Inventory is available at https://bit.ly/github-clinical-abbreviations.
引用
收藏
相关论文
共 50 条
  • [21] Natural Language Processing Using Database Context
    Mincheva, Zheni
    Vasilev, Nikola
    Antonov, Anatoliy
    Nikolov, Ventsislav
    INTELLIGENT COMPUTING, VOL 2, 2022, 507 : 747 - 759
  • [22] Database Tuning using Natural Language Processing
    Trummer, Immanuel
    SIGMOD RECORD, 2021, 50 (03) : 27 - 28
  • [23] Specialist medical abbreviations as a foreign language
    Das-Purkayastha, P
    McLeod, K
    Canter, R
    JOURNAL OF THE ROYAL SOCIETY OF MEDICINE, 2004, 97 (09) : 456 - 456
  • [24] Deep learning of the natural language processing
    Allauzen, Alexandre
    Schuetze, Hinrich
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2018, 59 (02): : 7 - 14
  • [25] Deep Learning in Natural Language Processing
    Feng, Haoda
    Shi, Feng
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 373 - 375
  • [26] Deep Learning for Natural Language Processing and Language Modelling
    Klosowski, Piotr
    2018 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2018, : 223 - 228
  • [27] NATURAL LANGUAGE QUERY PROCESSING IN A TEMPORAL DATABASE.
    De, Suranjan
    Pan, Shuh-Shen
    Whinston, Andrew B.
    Data and Knowledge Engineering, 1985, 1 (01): : 3 - 15
  • [28] Deep Learning on Graphs for Natural Language Processing
    Wu, Lingfei
    Chen, Yu
    Ji, Heng
    Liu, Bang
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4084 - 4085
  • [29] Deep Natural Language Processing for Search Systems
    Guo, Weiwei
    Gao, Huiji
    Shi, Jun
    Long, Bo
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1405 - 1406
  • [30] Deep Learning Methods in Natural Language Processing
    Flores, Alexis Stalin Alulema
    APPLIED TECHNOLOGIES (ICAT 2019), PT II, 2020, 1194 : 92 - 107