Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision

被引:0
|
作者
Cai, Yi [1 ]
Su, Shuai [1 ]
Li, Zheng [2 ]
Han, Qinglong [2 ]
Zhang, Jianxia [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Beijing, Peoples R China
[3] China Construct Third Bur Digitalizat Engn CO LTD, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail fault texts; Named entity recognition; Distant supervision;
D O I
10.1109/ITSC57777.2023.10422388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most faults in rail field are recorded as texts, and neural network requires a large amount of labeled data which is used to mine and analyse the texts. However, manually labeled datasets are costly to obtain, so it is necessary to train a better model capable of recognising entities from small batches of manually annotated data. In this paper, a named entity recognition model based on large batches of distantly supervised data and small batches of manually annotated datasets is proposed, which increased the character representation. A reinforcement learning selector is used in the model to filter the distantly supervised data and a BERT encoder is implemented to enhance the character representation capability. Finally, the experiments on a real railway fault datasets are conducted with our proposed model, and the result shows that the model proposed in this paper outperforms other baseline models significantly, and is more adaptive with both reduced datasets.
引用
收藏
页码:939 / 944
页数:6
相关论文
共 50 条
  • [21] Named Entity Recognition for Short Text Messages
    Ek, Tobias
    Kirkegaard, Camilla
    Jonsson, Hakan
    Nugues, Pierre
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 178 - 187
  • [22] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217
  • [23] Research on Chinese Named Entity Recognition Based on Ontology
    Chang, Weili
    Luo, Fang
    Qian, Jilai
    MECHANICAL ENGINEERING AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2012, 195-196 : 1180 - 1185
  • [24] Extraction of Traditional Chinese Medicine Entity: Design of a Novel Span-Level Named Entity Recognition Method With Distant Supervision
    Jia, Qi
    Zhang, Dezheng
    Xu, Haifeng
    Xie, Yonghong
    JMIR MEDICAL INFORMATICS, 2021, 9 (06)
  • [25] One Class per Named Entity: Exploiting Unlabeled Text for Named Entity Recognition
    Wong, Yingchuan
    Ng, Hwee Tou
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1763 - 1768
  • [26] Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
    Hu, Xuming
    Jiang, Yong
    Liu, Aiwei
    Huang, Zhongqiang
    Xie, Pengjun
    Huang, Fei
    Wen, Lijie
    Yu, Philip S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9072 - 9087
  • [27] Debiased and Denoised Entity Recognition from Distant Supervision
    Wang, Haobo
    Dong, Yiwen
    Xiao, Ruixuan
    Huang, Fei
    Chen, Gang
    Zhao, Junbo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Named Entity Recognition of Chinese Agricultural Text Based on Attention Mechanism
    Zhao, Pengfei
    Zhao, Chunjiang
    Wu, Huarui
    Wang, Wei
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (01): : 185 - 192
  • [29] Entity Recognition by Distant Supervision with Soft List Constraint
    Tu, Hongkui
    Ma, Zongyang
    Sun, Aixin
    Xu, Zhiqiang
    Wang, Xiaodong
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 681 - 694
  • [30] Named Entity Recognition Method for Fault Knowledge based on Deep Learning
    Chen, Zhicheng
    Liu, Xiaobao
    Yin, Yanchao
    Lu, Hongbiao
    ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 1 - 4