Robust Chinese Clinical Named Entity Recognition with information bottleneck and adversarial training

被引:0
|
作者
He, Yunfei [1 ]
Zhang, Zhiqiang [2 ]
Shen, Jinlong [1 ]
Li, Yuling [1 ]
Zhang, Yiwen [3 ]
Ding, Weiping [4 ,5 ]
Yang, Fei [1 ]
机构
[1] Anhui Med Univ, Sch Biomed Engn, Hefei 230601, Anhui, Peoples R China
[2] Bengbu First Peoples Hosp, Med Equipment Engn Dept, Bengbu 233000, Anhui, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[4] Nantong Univ, Sch Artificial Intelligence & Comp Sci, Nantong 226019, Jiangsu, Peoples R China
[5] City Univ Macau, Fac Data Sci, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Chinese Clinical Named Entity Recognition; Multifaceted text representation; Information bottleneck; Hilbert-Schmidt independence criterion; Adversarial training; NETWORKS;
D O I
10.1016/j.asoc.2024.112409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese Clinical Named Entity Recognition (CCNER) aims to extract entities with specific medical significance from Chinese clinical texts, which is an important part of medical data mining. Some existing CCNER models may assume perfect text data and design complex models to improve their accuracy. However, due to the complexity of Chinese clinical entity semantics and the professionalism of annotation, Chinese clinical texts are prone to contain irregular misrepresentations and sparse entity labeling. That would lead to noisy or incomplete text features extracted by CCNER, seriously threatening the robustness of recognition in real-world scenarios. To address these problems, we propose the Robust Chinese Clinical Named Entity Recognition model (RCCNER). RCCNER comprises three essential components: multifaceted text representation, robust feature extraction, and robust model training. For multifaceted text representation, the model enhances consistency and collaboration between feature representations by integrating word embedding, radical embedding, and dictionary embedding to help withstand textual noise. Then, guided by the information bottleneck and the Hilbert-Schmidt independence criterion, robust feature extraction compresses the dependency between text representation and extracted features, while enhancing the dependency between extracted features and labels, which consequently provides reliable text features for robust recognition. The robust model training aspect leverages adversarial training to diminish RCCNER's sensitivity to noise disturbances and sparse entity labeling, thereby reinforcing its robustness in entity recognition. RCCNER collaboratively enhances the noise immunity through text representation, text feature extraction and model training. Several experiments on two popular public datasets validate the effectiveness and robustness of RCCNER.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Adversarial training based lattice LSTM for Chinese clinical named entity recognition
    Zhao, Shan
    Cai, Zhiping
    Chen, Haiwen
    Wang, Ye
    Liu, Fang
    Liu, Anfeng
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99
  • [2] Named entity recognition for Chinese based on global pointer and adversarial training
    Hongjun Li
    Mingzhe Cheng
    Zelin Yang
    Liqun Yang
    Yansong Chua
    Scientific Reports, 13
  • [3] Named entity recognition for Chinese based on global pointer and adversarial training
    Li, Hongjun
    Cheng, Mingzhe
    Yang, Zelin
    Yang, Liqun
    Chua, Yansong
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [4] Study on Chinese Named Entity Recognition Based on Dynamic Fusion and Adversarial Training
    Fan, Fei
    Yang, Linnan
    Wu, Xingyu
    Lin, Shengken
    Dong, Huijie
    Yin, Changshan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 3 - 14
  • [5] Named Entity Recognition for Chinese Social Media with Domain Adversarial Training and Language Modeling
    Xu, Yong
    Lu, Qi
    Zhu, Muhua
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 687 - 699
  • [6] Named Entity Recognition Based on Reinforcement Learning and Adversarial Training
    Peng, Shi
    Zhang, Yong
    Yu, Yuanfang
    Zuo, Haoyang
    Zhang, Kai
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 191 - 202
  • [7] Adversarial training for named entity recognition of rail fault text
    Qu, J.
    Su, S.
    Li, R.
    Wang, G.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1353 - 1358
  • [8] Utilizing Chinese Dictionary Information in Named Entity Recognition
    Hu, Yun
    Liao, Mingxue
    Lv, Pin
    Zheng, Changwen
    COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 17 - 26
  • [9] IMPROVING CHINESE NAMED ENTITY RECOGNITION WITH LEXICAL INFORMATION
    Fu, Guo-Hong
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 3487 - 3491
  • [10] Named entity recognition in the food field based on BERT and Adversarial training
    Dong, Zhe
    Shao, RuoQi
    Chen, YuLiang
    Chen, JiaWei
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2219 - 2226