Pattern-enhanced Named Entity Recognition with Distant Supervision

被引:7
|
作者
Wang, Xuan [1 ]
Guan, Yingjun [1 ]
Zhang, Yu [1 ]
Li, Qi [2 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA USA
关键词
named entity recognition; distant supervision; pattern mining; neural network;
D O I
10.1109/BigData50022.2020.9378052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised deep learning methods have achieved state-of-the-art performance on the task of named entity recognition (NER). However, such methods suffer from high cost and low efficiency in training data annotation, leading to highly specialized NER models that cannot be easily adapted to new domains. Recently, distant supervision has been applied to replace human annotation, thanks to the fast development of domain specific knowledge bases. However, the generated noisy labels pose significant challenges in learning effective neural models with distant supervision. We propose PATNER, a distantly supervised NER model that effectively deals with noisy distant supervision from domain-specific dictionaries. PATNER does not require human-annotated training data but only relies on unlabeled data and incomplete domain-specific dictionaries for distant supervision. It incorporates the distant labeling uncertainty into the neural model training to enhance distant supervision. We go beyond the traditional sequence labeling framework and propose a more effective fuzzy neural model using the tie-or break tagging scheme for the NER task. Extensive experiments on three benchmark datasets in two domains demonstrate the power of PATNER. Case studies on two additional real-world datasets demonstrate that PATNER improves the distant NER performance in both entity boundary detection and entity type recognition. The results show a great promise in supporting high quality named entity recognition with domain-specific dictionaries on a wide variety of entity types.
引用
收藏
页码:818 / 827
页数:10
相关论文
共 50 条
  • [41] Arabic Named Entity Recognition
    Benajiba, Yassine
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152
  • [42] Dynamic Named Entity Recognition
    Luiggi, Tristan
    Soulier, Laure
    Guigue, Vincent
    Jendoubi, Siwar
    Baelde, Aurelien
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 890 - 897
  • [43] Speech recognition of a named entity
    Tomita, T
    Okimoto, Y
    Yamamoto, H
    Sagisaka, Y
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
  • [44] Named Entity Recognition in Query
    Guo, Jiafeng
    Xu, Gu
    Cheng, Xueqi
    Li, Hang
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 267 - 274
  • [45] Gazetteer-Enhanced Attentive Neural Networks for Named Entity Recognition
    Lin, Hongyu
    Lu, Yaojie
    Han, Xianpei
    Sun, Le
    Dong, Bin
    Jiang, Shanshan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6232 - 6237
  • [46] Enhanced Cascading Recognition with Positional Labels for Chinese Medicine Named Entity
    Wang, Xuyang
    Zhao, Lijie
    Zhang, Jiyuan
    Computer Engineering and Applications, 2024, 60 (02) : 121 - 128
  • [47] Gazetteer Enhanced Named Entity Recognition for Code-Mixed WebQueries
    Fetahu, Besnik
    Fang, Anjie
    Rokhlenko, Oleg
    Malmasi, Shervin
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1677 - 1681
  • [48] Enhanced neurologic concept recognition using a named entity recognition model based on transformers
    Azizi, Sima
    Hier, Daniel B.
    Wunsch II, Donald C. C.
    FRONTIERS IN DIGITAL HEALTH, 2022, 4
  • [49] Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition
    Tan, Chuanqi
    Qiu, Wei
    Chen, Mosha
    Wang, Rui
    Huang, Fei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9016 - 9023
  • [50] A Supervised Named Entity Recognition Method Based on Pattern Matching and Semantic Verification
    Gao, Nan
    Zhu, Zhenyang
    Weng, Zhengqiu
    Chen, Guolang
    Zhang, Min
    JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (07): : 1917 - 1928