Online biomedical named entities recognition by data and knowledge-driven model

被引:3
|
作者
Cao, Lulu [1 ]
Wu, Chaochen [3 ]
Luo, Guan [2 ]
Guo, Chao [4 ]
Zheng, Anni [2 ]
机构
[1] Peking Univ, Peoples Hosp, Dept Rheumatol & Immunol, Beijing 100044, Peoples R China
[2] Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence S, Inst Automat, Beijing, Peoples R China
[3] Renmin Univ China, Beijing 100872, Peoples R China
[4] CAMS & PUMC, Fuwai Hosp, Dept Cardiol, Beijing 100037, Peoples R China
基金
中国国家自然科学基金;
关键词
Biomedical named entity recognition; Neural network; Pre-training; Knowledge representation; Online text;
D O I
10.1016/j.artmed.2024.102813
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is an important task for the natural language processing of biomedical text. Currently, most NER studies standardized biomedical text, but NER for unstandardized biomedical text draws less attention from researchers. Named entities in online biomedical text exist with errors and polymorphisms, which negatively impact NER models' performance and impede support from knowledge representation methods. In this paper, we propose a neural network method that can effectively recognize entities in unstandardized online medical/health text. We introduce a new pre -training scheme that uses largescale online question -answering pairs to enhance transformers' model capacity on online biomedical text. Moreover, we supply models with knowledge representations from a knowledge base called multi -channel knowledge labels, and this method overcomes the restriction from languages, like Chinese, that require word segmentation tools to represent knowledge. Our model outperforms other baseline methods significantly in experiments on a dataset for Chinese online medical entity recognition and achieves state-of-the-art results.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Data and knowledge-driven named entity recognition for cyber security
    Chen Gao
    Xuan Zhang
    Hui Liu
    Cybersecurity, 4
  • [2] Data and knowledge-driven named entity recognition for cyber security
    Gao, Chen
    Zhang, Xuan
    Liu, Hui
    CYBERSECURITY, 2021, 4 (01)
  • [3] Sequential knowledge-driven scene recognition model
    Chernyak, DA
    Stark, LW
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 382 - 387
  • [4] Biomedical named entities recognition using conditional random fields model
    Sun, Chengjie
    Guan, Yi
    Wang, Xiaolong
    Lin, Lei
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 1279 - 1288
  • [5] Using semantic web technologies for knowledge-driven querying of biomedical data
    O'Connor, Martin
    Shankar, Ravi
    Tu, Samson
    Nyulas, Csongor
    Parrish, Dave
    Musen, Mark
    Das, Amar
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2007, 4594 : 267 - 276
  • [6] A knowledge-driven approach to biomedical document conceptualization
    Zheng, Hai-Tao
    Borchert, Charles
    Jiang, Yong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2010, 49 (02) : 67 - 78
  • [7] A Knowledge-driven Data Warehouse Model for Analysis Evolution
    Favre, Cecile
    Bentayeb, Fadila
    Boussaid, Omar
    LEADING THE WEB IN CONCURRENT ENGINEERING: NEXT GENERATION CONCURRENT ENGINEERING, 2006, 143 : 271 - +
  • [8] Knowledge-Driven Compositional Action Recognition
    Liu, Yang
    Liu, Fang
    Jiao, Licheng
    Bao, Qianyue
    Li, Shuo
    Li, Lingling
    Liu, Xu
    PATTERN RECOGNITION, 2025, 163
  • [9] Knowledge-Driven Activity Recognition in Intelligent Environments
    Chen, Liming
    Nugent, Chris
    Cook, Diane
    Yu, Zhiwen
    PERVASIVE AND MOBILE COMPUTING, 2011, 7 (03) : 285 - 286
  • [10] Comparative Study of Word Embedding Methods in Biomedical Named Entities Recognition
    Derbel, Houssemeddine
    Habacha Chaibi, Anja
    Benabdelkader, Chiraz
    Hajjami Ben Ghezala, Henda
    VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, : 6356 - 6367