Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision

被引:0
|
作者
Cai, Yi [1 ]
Su, Shuai [1 ]
Li, Zheng [2 ]
Han, Qinglong [2 ]
Zhang, Jianxia [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Beijing, Peoples R China
[3] China Construct Third Bur Digitalizat Engn CO LTD, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail fault texts; Named entity recognition; Distant supervision;
D O I
10.1109/ITSC57777.2023.10422388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most faults in rail field are recorded as texts, and neural network requires a large amount of labeled data which is used to mine and analyse the texts. However, manually labeled datasets are costly to obtain, so it is necessary to train a better model capable of recognising entities from small batches of manually annotated data. In this paper, a named entity recognition model based on large batches of distantly supervised data and small batches of manually annotated datasets is proposed, which increased the character representation. A reinforcement learning selector is used in the model to filter the distantly supervised data and a BERT encoder is implemented to enhance the character representation capability. Finally, the experiments on a real railway fault datasets are conducted with our proposed model, and the result shows that the model proposed in this paper outperforms other baseline models significantly, and is more adaptive with both reduced datasets.
引用
收藏
页码:939 / 944
页数:6
相关论文
共 50 条
  • [31] Research on Named Entity Recognition Methods for Urban Underground Space Disasters Based on Text Information Extraction
    Li, Zhaowen
    Zhang, Xuedong
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 547 - 552
  • [32] Research on Named Entity Recognition Based on Gated Interaction Mechanisms
    Liu, Bin
    Chen, Wanyuan
    Tao, Jialing
    He, Lei
    Tang, Dan
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [33] Nested named entity recognition in historical archive text
    Byrne, Kate
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 589 - 596
  • [34] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01)
  • [35] Named Entity Recognition in Unstructured Medical Text Documents
    Pearson, Cole
    Seliya, Naeem
    Dave, Rushit
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 412 - 417
  • [36] Named Entity Recognition for Russian Judicial Rulings Text
    Averina, Maria
    Levanova, Olga
    Kasatkina, Natalia
    2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 49 - 55
  • [37] Named Entity Recognition in Twitter Using Images and Text
    Esteves, Diego
    Peres, Rafael
    Lehmann, Jens
    Napolitano, Giulio
    CURRENT TRENDS IN WEB ENGINEERING, ICWE 2017, 2018, 10544 : 191 - 199
  • [38] Named Entity Recognition Method for Process Planning Text
    Dong H.
    Li Y.
    Qiao L.
    Huang Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 313 - 320
  • [39] Research on Named Entity Recognition Method Based on BERT Model
    Xie, Shaopeng
    2024 IEEE 10TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND MACHINE LEARNING APPLICATIONS, BIGDATASERVICE 2024, 2024, : 92 - 96
  • [40] KrNER : A Novel Named Entity Recognition Method Based on Knowledge Enhancement and Remote Supervision
    Du, Jinhua
    Yin, Hao
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2323 - 2332