A comparative study of Chinese named entity recognition with different segment representations

被引:0
|
作者
Jun Pan
Chaohua Zhang
Haijun Wang
Zongda Wu
机构
[1] Zhejiang University of Science and Technology,Laboratory of Artificial Intelligence, School of Science
[2] Taizhou University,School of Electronic and Information Engineering
[3] Shaoxing University,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Named entity recognition; Segment representation; Machine learning; Neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Named entity recognition (NER) is a fundamental but crucial task in the field of natural language processing and has been widely studied. Nevertheless, little attention has been given to the segment representation (SR) schemes used to map multi-token entities into categories in Chinese NER. To address this issue, in this paper, we explore and compare the impact of using different SR schemes on Chinese NER. Our experiments are conducted on four benchmark Chinese NER datasets extended with labels to include seven well-known SR schemes: IO, IOB2, IOE2, IOBES, BI, IE, and BIES. Moreover, all seven SR schemes are investigated via two sets of classifiers: machine learning-based and neural network-based classifiers. The experimental results demonstrate that the proper selection of the best SR scheme is a complicated problem that depends on various factors, such as corpus size, corpus distribution, and the chosen classifier. We also provide a comparative analysis of the time consumption of each classifier in different SR schemes and discuss the impacts of using different SR schemes on NER in Chinese and other languages.
引用
收藏
页码:12457 / 12469
页数:12
相关论文
共 50 条
  • [21] Chinese named entity recognition: The state of the art
    Liu, Pan
    Guo, Yanming
    Wang, Fenglei
    Li, Guohui
    Neurocomputing, 2022, 473 : 37 - 53
  • [22] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217
  • [23] CLASSIFICATION ATTENTION FOR CHINESE NAMED ENTITY RECOGNITION
    Cong, Kai
    Wang, Yunpeng
    Li, Tao
    Xu, Yanbin
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (09) : 1675 - 1686
  • [24] Survey of Chinese Named Entity Recognition Research
    Zhao, Jigui
    Qian, Yurong
    Wang, Kui
    Hou, Shuxiang
    Chen, Jiaying
    Computer Engineering and Applications, 2024, 60 (01) : 15 - 27
  • [25] Chinese named entity recognition: The state of the art
    Liu, Pan
    Guo, Yanming
    Wang, Fenglei
    Li, Guohui
    NEUROCOMPUTING, 2022, 473 : 37 - 53
  • [26] Measuring the effect of different types of unsupervised word representations on Medical Named Entity Recognition
    Casillas, Arantza
    Ezeiza, Nerea
    Goenaga, Takes
    Perez, Alicia
    Soto, Xabier
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 : 100 - 106
  • [27] Improving Model Generalization: A Chinese Named Entity Recognition Case Study
    Liang, Guanqing
    Leung, Cane Wing-Ki
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 992 - 997
  • [28] ERPG: Enhancing Entity Representations with Prompt Guidance for Complex Named Entity Recognition
    Zhu, Xingyu
    Dai, Feifei
    Gu, Xiaoyan
    Fan, Haihui
    Li, Bo
    Wang, Weiping
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2813 - 2818
  • [29] Comparative study of text representation and learning for Persian named entity recognition
    Pour, Mohammad Mahdi Abdollah
    Momtazi, Saeedeh
    ETRI JOURNAL, 2022, 44 (05) : 794 - 804
  • [30] Named entity recognition in Turkish: A comparative study with detailed error analysis
    Ozcelik, Oguzhan
    Toraman, Cagri
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)