Pattern-enhanced Named Entity Recognition with Distant Supervision

被引:7
|
作者
Wang, Xuan [1 ]
Guan, Yingjun [1 ]
Zhang, Yu [1 ]
Li, Qi [2 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA USA
关键词
named entity recognition; distant supervision; pattern mining; neural network;
D O I
10.1109/BigData50022.2020.9378052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised deep learning methods have achieved state-of-the-art performance on the task of named entity recognition (NER). However, such methods suffer from high cost and low efficiency in training data annotation, leading to highly specialized NER models that cannot be easily adapted to new domains. Recently, distant supervision has been applied to replace human annotation, thanks to the fast development of domain specific knowledge bases. However, the generated noisy labels pose significant challenges in learning effective neural models with distant supervision. We propose PATNER, a distantly supervised NER model that effectively deals with noisy distant supervision from domain-specific dictionaries. PATNER does not require human-annotated training data but only relies on unlabeled data and incomplete domain-specific dictionaries for distant supervision. It incorporates the distant labeling uncertainty into the neural model training to enhance distant supervision. We go beyond the traditional sequence labeling framework and propose a more effective fuzzy neural model using the tie-or break tagging scheme for the NER task. Extensive experiments on three benchmark datasets in two domains demonstrate the power of PATNER. Case studies on two additional real-world datasets demonstrate that PATNER improves the distant NER performance in both entity boundary detection and entity type recognition. The results show a great promise in supporting high quality named entity recognition with domain-specific dictionaries on a wide variety of entity types.
引用
收藏
页码:818 / 827
页数:10
相关论文
共 50 条
  • [21] Named Entity Recognition through Deep Representation Learning and Weak Supervision
    Parker, Jerrod
    Yu, Shi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3828 - 3839
  • [22] Enhanced character embedding for Chinese named entity recognition
    Jia, Bingjing
    Wu, Zhongli
    Wu, Bin
    Liu, Yutong
    Zhou, Pengpeng
    MEASUREMENT & CONTROL, 2020, 53 (9-10): : 1669 - 1681
  • [23] OpeNER: Open Polarity Enhanced Named Entity Recognition
    Agerri, Rodrigo
    Cuadros, Montse
    Gaines, Sean
    Rigau, German
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 215 - 218
  • [24] Lexicon enhanced Chinese named entity recognition with pointer network
    Guo, Qian
    Guo, Yi
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14535 - 14555
  • [25] OpeNER demo: Open Polarity Enhanced Named Entity Recognition
    Garcia-Pablos, Aitor
    Cuadros, Montse
    Gaines, Sean
    Rigau, German
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [26] Enhanced Named Entity Recognition through Joint Dependency Parsing
    Wang, Peng
    Wang, Zhe
    Zhang, Xiaowang
    Wang, Kewen
    Feng, Zhiyong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [27] Enhanced Named Entity Recognition algorithm for financial document verification
    Toprak, Ahmet
    Turan, Metin
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19431 - 19451
  • [28] Lexicon enhanced Chinese named entity recognition with pointer network
    Qian Guo
    Yi Guo
    Neural Computing and Applications, 2022, 34 : 14535 - 14555
  • [29] Enhanced Named Entity Recognition algorithm for financial document verification
    Ahmet Toprak
    Metin Turan
    The Journal of Supercomputing, 2023, 79 : 19431 - 19451
  • [30] Pattern acquisition for Chinese named entity recognition: A supervised learning approach
    Fang, XS
    Sheng, HY
    ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 166 - 175