Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision

被引:0
|
作者
Cai, Yi [1 ]
Su, Shuai [1 ]
Li, Zheng [2 ]
Han, Qinglong [2 ]
Zhang, Jianxia [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Beijing, Peoples R China
[3] China Construct Third Bur Digitalizat Engn CO LTD, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail fault texts; Named entity recognition; Distant supervision;
D O I
10.1109/ITSC57777.2023.10422388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most faults in rail field are recorded as texts, and neural network requires a large amount of labeled data which is used to mine and analyse the texts. However, manually labeled datasets are costly to obtain, so it is necessary to train a better model capable of recognising entities from small batches of manually annotated data. In this paper, a named entity recognition model based on large batches of distantly supervised data and small batches of manually annotated datasets is proposed, which increased the character representation. A reinforcement learning selector is used in the model to filter the distantly supervised data and a BERT encoder is implemented to enhance the character representation capability. Finally, the experiments on a real railway fault datasets are conducted with our proposed model, and the result shows that the model proposed in this paper outperforms other baseline models significantly, and is more adaptive with both reduced datasets.
引用
收藏
页码:939 / 944
页数:6
相关论文
共 50 条
  • [1] Adversarial training for named entity recognition of rail fault text
    Qu, J.
    Su, S.
    Li, R.
    Wang, G.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1353 - 1358
  • [2] Named Entity Recognition for Cancer Immunology Research Using Distant Supervision
    Hai-Long Trieu
    Miwa, Makoto
    Ananiadou, Sophia
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 171 - 177
  • [3] Named Entity Recognition for Open Domain Data Based on Distant Supervision
    Wu, Junshuang
    Zhang, Richong
    Deng, Ting
    Huai, Jinpeng
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 185 - 197
  • [4] Pattern-enhanced Named Entity Recognition with Distant Supervision
    Wang, Xuan
    Guan, Yingjun
    Zhang, Yu
    Li, Qi
    Han, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 818 - 827
  • [5] Bagging-Based Active Learning Model for Named Entity Recognition with Distant Supervision
    Lee, Sunghee
    Song, Yeongkil
    Choi, Maengsik
    Kim, Harksoo
    2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 321 - 324
  • [6] A template augmented distant supervision framework for Chinese named entity recognition
    Qi, Chengwen
    Laili, Yuanjun
    Ren, Lei
    Zhang, Lin
    Li, Bowen
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2024, 15 (01)
  • [7] Adaptive Named Entity Recognition Using Distant Supervision for Contemporary Written Texts
    Kim, Juae
    Kim, Yejin
    Kang, Sangwoo
    Seo, Jungyun
    IEEE ACCESS, 2021, 9 : 80405 - 80414
  • [8] Data set entity recognition based on distant supervision
    Li, Pengcheng
    Liu, Qikai
    Cheng, Qikai
    Lu, Wei
    ELECTRONIC LIBRARY, 2021, 39 (03): : 435 - 449
  • [9] Adversarial Training Lattice LSTM for Named Entity Recognition of Rail Fault Texts
    Su, Shuai
    Qu, Jia
    Cao, Yuan
    Li, Ruoqing
    Wang, Guang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21201 - 21215
  • [10] Biomedical Named Entity Recognition with Less Supervision
    Ghiasvand, Omid
    Kate, Rohit J.
    2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, : 495 - 495