Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

被引:0
|
作者
Varma, Maya [1 ]
Orr, Laurel [1 ]
Wu, Sen [1 ]
Leszczynski, Megan [1 ]
Ling, Xiao [2 ]
Re, Christopher [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Apple, Cupertino, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity disambiguation (NED), which involves mapping textual mentions to structured entities, is particularly challenging in the medical domain due to the presence of rare entities. Existing approaches are limited by the presence of coarse-grained structural resources in biomedical knowledge bases as well as the use of training datasets that provide low coverage over uncommon resources. In this work, we address these issues by proposing a cross-domain data integration method that transfers structural knowledge from a general text knowledge base to the medical domain. We utilize our integration scheme to augment structural resources and generate a large biomedical NED dataset for pretraining. Our pretrained model with injected structural knowledge achieves state-of-the-art performance on two benchmark medical NED datasets: MedMentions and BC5CDR. Furthermore, we improve disambiguation of rare entities by up to 57 accuracy points.
引用
收藏
页码:4566 / 4575
页数:10
相关论文
共 50 条
  • [21] An Entity-Aware Adversarial Domain Adaptation Network for Cross-Domain Named Entity Recognition (Student Abstract)
    Peng, Qi
    Zheng, Changmeng
    Cai, Yi
    Wang, Tao
    Xie, Haoran
    Li, Qing
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15865 - 15866
  • [22] Decoupled Hyperbolic Graph Attention Network for Cross-domain Named Entity Recognition
    Xu, Jingyun
    Cai, Yi
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 591 - 600
  • [23] Cross-Domain Tibetan Named Entity Recognition via Large Language Models
    Zhang, Jin
    Gao, Fan
    Yeshi, Lobsang
    Tashi, Dorje
    Wang, Xiangshi
    Tashi, Nyima
    Luosang, Gadeng
    ELECTRONICS, 2025, 14 (01):
  • [24] Harnessing Causal Structure Alignment for Enhanced Cross-Domain Named Entity Recognition
    Liu, Xiaoming
    Cao, Mengyuan
    Yang, Guan
    Liu, Jie
    Liu, Yang
    Wang, Hang
    ELECTRONICS, 2024, 13 (01)
  • [25] DOZEN: Cross-Domain Zero Shot Named Entity Recognition with Knowledge Graph
    Nguyen, Hoang Van
    Gelli, Francesco
    Poria, Soujanya
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1642 - 1646
  • [26] Named Entity Recognition System for the Biomedical Domain
    Sharma, Raghav
    Chauhan, Deependra
    Sharma, Raksha
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 837 - 840
  • [27] Cross-Domain Neurobiology Data Integration and Exploration
    Xuan, Weijian
    Dai, Manhong
    Josh, Buckner
    Mirel, Barbara
    Song, Jean
    Athey, Brian
    Watson, Stanley J.
    Meng, Fan
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 37 - +
  • [28] Cross-domain neurobiology data integration and exploration
    Weijian Xuan
    Manhong Dai
    Josh Buckner
    Barbara Mirel
    Jean Song
    Brian Athey
    Stanley J Watson
    Fan Meng
    BMC Genomics, 11
  • [29] Cross-domain neurobiology data integration and exploration
    Xuan, Weijian
    Dai, Manhong
    Buckner, Josh
    Mirel, Barbara
    Song, Jean
    Athey, Brian
    Watson, Stanley J.
    Meng, Fan
    BMC GENOMICS, 2010, 11
  • [30] Cross-Domain Labeled LDA for Cross-Domain Text Classification
    Jing, Baoyu
    Lu, Chenwei
    Wang, Deqing
    Zhuang, Fuzhen
    Niu, Cheng
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 187 - 196