A comparative study of Chinese named entity recognition with different segment representations

被引:0
|
作者
Jun Pan
Chaohua Zhang
Haijun Wang
Zongda Wu
机构
[1] Zhejiang University of Science and Technology,Laboratory of Artificial Intelligence, School of Science
[2] Taizhou University,School of Electronic and Information Engineering
[3] Shaoxing University,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Named entity recognition; Segment representation; Machine learning; Neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Named entity recognition (NER) is a fundamental but crucial task in the field of natural language processing and has been widely studied. Nevertheless, little attention has been given to the segment representation (SR) schemes used to map multi-token entities into categories in Chinese NER. To address this issue, in this paper, we explore and compare the impact of using different SR schemes on Chinese NER. Our experiments are conducted on four benchmark Chinese NER datasets extended with labels to include seven well-known SR schemes: IO, IOB2, IOE2, IOBES, BI, IE, and BIES. Moreover, all seven SR schemes are investigated via two sets of classifiers: machine learning-based and neural network-based classifiers. The experimental results demonstrate that the proper selection of the best SR scheme is a complicated problem that depends on various factors, such as corpus size, corpus distribution, and the chosen classifier. We also provide a comparative analysis of the time consumption of each classifier in different SR schemes and discuss the impacts of using different SR schemes on NER in Chinese and other languages.
引用
收藏
页码:12457 / 12469
页数:12
相关论文
共 50 条
  • [31] Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition
    Chen, Chun
    Kong, Fang
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 20 - 25
  • [32] Chinese Named Entity Recognition Methods Combined with Entity Boundary Cues
    Huang, Rong
    Chen, Yanping
    Hu, Ying
    Huang, Ruizhang
    Qin, Yongbin
    Computer Engineering and Applications, 2024, 60 (06) : 199 - 206
  • [33] Chinese Named Entity Recognition Augmented with Lexicon Memory
    Zhou, Yi
    Zheng, Xiao-Qing
    Huang, Xuan-Jing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (05) : 1021 - 1035
  • [34] Application of Data Encryption in Chinese Named Entity Recognition
    Dong, Jikun
    Long, Kaifang
    Yu, Hui
    Xu, Weizhi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 99 - 111
  • [35] Research on Chinese Named Entity Recognition in the Marine Field
    Cao, Xiaojuan
    Yang, Yongquan
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [36] Chinese named entity recognition based on adaptive transformer
    Yan Yang
    Yin, Guozhe
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 327 - 331
  • [37] A Chinese Named Entity Recognition System with Neural Networks
    Yi, Hui-Kang
    Huang, Jiu-Ming
    Yang, Shu-Qiang
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [38] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
    Yu Miao
    Lv Yajuan
    Liu Qun
    Su Jinsong
    Xiong Hao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283
  • [39] Deep adaptation of CNN in Chinese named entity recognition
    Lv, Yana
    Qin, Xutong
    Du, Xiuli
    Qiu, Shaoming
    ENGINEERING REPORTS, 2023, 5 (06)
  • [40] Integrated Chinese Segmentation, Parsing and Named Entity Recognition
    LI Dongchen
    ZHANG Xiantao
    WU Xihong
    Chinese Journal of Electronics, 2018, 27 (04) : 756 - 760