Embedded Word Representations for Rich Indexing: A Case Study for Medical Records

被引:2
|
作者
Metcalf, Katherine [1 ]
Leake, David [1 ]
机构
[1] Indiana Univ, Sch Informat Comp & Engn, Bloomington, IN 47408 USA
来源
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2018 | 2018年 / 11156卷
关键词
Case-based reasoning for medicine; Electronic health records; Indexing; Textual case-based reasoning; Vector space embedding; TEXT; RETRIEVAL; UMLS;
D O I
10.1007/978-3-030-01081-2_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Case indexing decisions must often confront the tradeoff between rich semantic indexing schemes, which provide effective retrieval at large indexing cost, and shallower indexing schemes, which enable lowcost indexing but may be less reliable. Indexing for textual case-based reasoning is often based on information retrieval approaches that minimize index acquisition cost but sacrifice semantic information. This paper presents JointEmbed, a method for automatically generating rich indices. JointEmbed automatically generates continuous vector space embeddings that implicitly capture semantic information, leveraging multiple knowledge sources such as free text cases and pre-existing knowledge graphs. JointEmbed generates effective indices by applying pTransR, a novel approach for modelling knowledge graphs, to encode and summarize contents of domain knowledge resources. JointEmbed is applied to the medical CBR task of retrieving relevant patient electronic health records, for which potential health consequences make retrieval quality paramount. An evaluation supports that JointEmbed outperforms previous methods.
引用
收藏
页码:264 / 280
页数:17
相关论文
共 50 条
  • [1] Word indexing foe mobile device data representations
    Larkin, Henry
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 399 - 404
  • [2] Indexing of medical diagnoses by word affinity method
    Surján, G
    Héja, G
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 276 - 279
  • [3] Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
    Koutsomitropoulos, Dimitrios A.
    Andriopoulos, Andreas D.
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 343 - 354
  • [4] Do Japanese word-embedded representations obtained in the academic corpus retain the medical concepts of "infarction"?
    Yokokawa, Daiki
    Noda, Kazutaka
    Uehara, Takanori
    Yanagita, Yasutaka
    Ohira, Yoshiyuki
    Ikusaka, Masatomi
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
  • [5] Learning to Combine Representations for Medical Records Search
    Limsopatham, Nut
    Macdonald, Craig
    Ounis, Ladh
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 833 - 836
  • [6] Towards a semantic indexing model adapted on patient medical records
    Dinh, Duy
    Tamine, Lynda
    CORIA 2010: Actes de la COnference en Recherche d'Information et Applications - Proceedings of the Conference on Information Retrieval and Applications, 2010, : 325 - 336
  • [7] Case Study of Linking Dental and Medical Healthcare Records
    Theis, Mary Kay
    Reid, Robert J.
    Chaudhari, Monica
    Newton, Katherine M.
    Spangler, Leslie
    Grossman, David C.
    Inge, Ronald E.
    AMERICAN JOURNAL OF MANAGED CARE, 2010, 16 (02): : E51 - E56
  • [8] Analysis of Subword based Word Representations Case Study: Fasttext Malayalam
    Vivek, M. R.
    Chandran, Priya
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [9] Comparison of Word Embeddings for Extraction from Medical Records
    Dudchenko, Aleksei
    Kopanitsa, Georgy
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2019, 16 (22)
  • [10] MEDICAL SOCIAL CASE RECORDS
    Cannon, M. Antoinette
    FAMILY, 1930, 10 (09): : 286 - 287