A comparative study of Chinese named entity recognition with different segment representations

被引:0
|
作者
Jun Pan
Chaohua Zhang
Haijun Wang
Zongda Wu
机构
[1] Zhejiang University of Science and Technology,Laboratory of Artificial Intelligence, School of Science
[2] Taizhou University,School of Electronic and Information Engineering
[3] Shaoxing University,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Named entity recognition; Segment representation; Machine learning; Neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Named entity recognition (NER) is a fundamental but crucial task in the field of natural language processing and has been widely studied. Nevertheless, little attention has been given to the segment representation (SR) schemes used to map multi-token entities into categories in Chinese NER. To address this issue, in this paper, we explore and compare the impact of using different SR schemes on Chinese NER. Our experiments are conducted on four benchmark Chinese NER datasets extended with labels to include seven well-known SR schemes: IO, IOB2, IOE2, IOBES, BI, IE, and BIES. Moreover, all seven SR schemes are investigated via two sets of classifiers: machine learning-based and neural network-based classifiers. The experimental results demonstrate that the proper selection of the best SR scheme is a complicated problem that depends on various factors, such as corpus size, corpus distribution, and the chosen classifier. We also provide a comparative analysis of the time consumption of each classifier in different SR schemes and discuss the impacts of using different SR schemes on NER in Chinese and other languages.
引用
收藏
页码:12457 / 12469
页数:12
相关论文
共 50 条
  • [41] Research on Chinese Named Entity Recognition Based on Ontology
    Chang, Weili
    Luo, Fang
    Qian, Jilai
    MECHANICAL ENGINEERING AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2012, 195-196 : 1180 - 1185
  • [42] Chinese Named Entity Recognition with Inducted Context Patterns
    Pang, Wenbo
    Fan, Xiaozhong
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 608 - 611
  • [43] SciCN: A Scientific Dataset for Chinese Named Entity Recognition
    Yang, Jing
    Ji, Bin
    Li, Shasha
    Ma, Jun
    Yu, Jie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 4303 - 4315
  • [44] Chinese Named Entity Recognition with CRFs: Two Levels
    Hu, Hongping
    Zhang, Hui
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 564 - 569
  • [45] A Sequence Transformation Model for Chinese Named Entity Recognition
    Wang, Qingyue
    Song, Yanjing
    Liu, Hao
    Cao, Yanan
    Liu, Yanbing
    Guo, Li
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 491 - 502
  • [46] Exploiting Multiple Embeddings for Chinese Named Entity Recognition
    Xu, Canwen
    Wang, Feiyang
    Han, Jialong
    Li, Chenliang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2269 - 2272
  • [47] Survey on Chinese named entity recognition with deep learning
    Kang Y.
    Sun L.
    Zhu R.
    Li M.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 44 - 53
  • [48] Chinese Chemical Named Entity Recognition Based on Morpheme
    Wang, Guirong
    Xia, Bo
    Xiao, Ye
    Rao, Gaoqi
    Xun, Endong
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 247 - 252
  • [49] Utilizing Chinese Dictionary Information in Named Entity Recognition
    Hu, Yun
    Liao, Mingxue
    Lv, Pin
    Zheng, Changwen
    COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 17 - 26
  • [50] Token Relation Aware Chinese Named Entity Recognition
    Huang, Zeyu
    Rong, Wenge
    Zhang, Xiaofeng
    Ouyang, Yuanxin
    Lin, Chenghua
    Xiong, Zhang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)