A comparative study of Chinese named entity recognition with different segment representations

被引:0
|
作者
Jun Pan
Chaohua Zhang
Haijun Wang
Zongda Wu
机构
[1] Zhejiang University of Science and Technology,Laboratory of Artificial Intelligence, School of Science
[2] Taizhou University,School of Electronic and Information Engineering
[3] Shaoxing University,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Named entity recognition; Segment representation; Machine learning; Neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Named entity recognition (NER) is a fundamental but crucial task in the field of natural language processing and has been widely studied. Nevertheless, little attention has been given to the segment representation (SR) schemes used to map multi-token entities into categories in Chinese NER. To address this issue, in this paper, we explore and compare the impact of using different SR schemes on Chinese NER. Our experiments are conducted on four benchmark Chinese NER datasets extended with labels to include seven well-known SR schemes: IO, IOB2, IOE2, IOBES, BI, IE, and BIES. Moreover, all seven SR schemes are investigated via two sets of classifiers: machine learning-based and neural network-based classifiers. The experimental results demonstrate that the proper selection of the best SR scheme is a complicated problem that depends on various factors, such as corpus size, corpus distribution, and the chosen classifier. We also provide a comparative analysis of the time consumption of each classifier in different SR schemes and discuss the impacts of using different SR schemes on NER in Chinese and other languages.
引用
收藏
页码:12457 / 12469
页数:12
相关论文
共 50 条
  • [1] A comparative study of Chinese named entity recognition with different segment representations
    Pan, Jun
    Zhang, Chaohua
    Wang, Haijun
    Wu, Zongda
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12457 - 12469
  • [2] Segment Representations in Named Entity Recognition
    Konkol, Michal
    Konopik, Miloslav
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 61 - 70
  • [3] Named entity recognition with multiple segment representations
    Cho, Han-Cheol
    Okazaki, Naoaki
    Miwa, Makoto
    Tsujii, Jun'ichi
    INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (04) : 954 - 965
  • [4] A Comparative Study of Segment Representation for Biomedical Named Entity Recognition
    Shashirekha, H. L.
    Nayel, Hamada A.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1046 - 1052
  • [5] A Comparative Study of Named Entity Recognition for Telugu
    Gorla, SaiKiranmai
    Murthy, N. L. Bhanu
    Malapati, Aruna
    PROCEEDINGS OF THE 9TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2017), 2017, : 21 - 24
  • [6] A comparative study for biomedical named entity recognition
    Xu Wang
    Chen Yang
    Renchu Guan
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 373 - 382
  • [7] A comparative study for biomedical named entity recognition
    Wang, Xu
    Yang, Chen
    Guan, Renchu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 373 - 382
  • [8] A Comparative Study of Named Entity Recognition on Myanmar Language
    Nandar, Tin Latt
    Soe, Thinn Lai
    Soe, Khin Mar
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 60 - 64
  • [9] Deep Span Representations for Named Entity Recognition
    Zhu, Enwei
    Liu, Yiyang
    Li, Jinpeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10565 - 10582
  • [10] Chinese Governmental Named Entity Recognition
    Liu, Qi
    Wang, Dong
    Zhou, Meilin
    Li, Peng
    Qi, Baoyuan
    Bin Wang
    INFORMATION RETRIEVAL TECHNOLOGY (AIRS 2018), 2018, 11292 : 16 - 28