Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision

被引:0
|
作者
Cai, Yi [1 ]
Su, Shuai [1 ]
Li, Zheng [2 ]
Han, Qinglong [2 ]
Zhang, Jianxia [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Beijing, Peoples R China
[3] China Construct Third Bur Digitalizat Engn CO LTD, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail fault texts; Named entity recognition; Distant supervision;
D O I
10.1109/ITSC57777.2023.10422388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most faults in rail field are recorded as texts, and neural network requires a large amount of labeled data which is used to mine and analyse the texts. However, manually labeled datasets are costly to obtain, so it is necessary to train a better model capable of recognising entities from small batches of manually annotated data. In this paper, a named entity recognition model based on large batches of distantly supervised data and small batches of manually annotated datasets is proposed, which increased the character representation. A reinforcement learning selector is used in the model to filter the distantly supervised data and a BERT encoder is implemented to enhance the character representation capability. Finally, the experiments on a real railway fault datasets are conducted with our proposed model, and the result shows that the model proposed in this paper outperforms other baseline models significantly, and is more adaptive with both reduced datasets.
引用
收藏
页码:939 / 944
页数:6
相关论文
共 50 条
  • [41] Construction of a Geological Fault Corpus and Named Entity Recognition
    Wang, Huainuo
    Niu, Ruiqing
    Han, Yongyao
    Deng, Qinglu
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [42] Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition
    Sun, Wei
    Ji, Shaoxiong
    Denti, Tuulia
    Moen, Hans
    Kerro, Oleg
    Rannikko, Antti
    Marttinen, Pekka
    Koskinen, Miika
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 444 - 459
  • [43] Chinese Named Entity Recognition for Hazard And Operability Analysis Text Based on Albert
    Wang, Zhenhua
    Zhang, Beike
    Gao, Dong
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5641 - 5645
  • [44] Text Summarization based Named Entity Recognition for Certain Application using BERT
    Tummala, Indira Priyadarshini
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1136 - 1141
  • [45] Benchmarking Named Entity Recognition Approaches for Extracting Research Infrastructure Information from Text
    Cheirmpos, Georgios
    Tabatabaei, Seyed Amin
    Kanoulas, Evangelos
    Tsatsaronis, Georgios
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 131 - 141
  • [46] An Association Rule Mining Method Based on Named Entity Recognition and Text Classification
    He, Bo
    Zhang, Jiru
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1503 - 1511
  • [47] An Association Rule Mining Method Based on Named Entity Recognition and Text Classification
    Bo He
    Jiru Zhang
    Arabian Journal for Science and Engineering, 2023, 48 : 1503 - 1511
  • [48] Named Entity Recognition for Electric Power Industry Based on Enhanced Text Features
    Liu W.
    Hu Z.
    Zhang J.
    Liu X.
    Lin F.
    Yu J.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2022, 46 (21): : 134 - 142
  • [49] Named entity recognition based on equipment and fault field of CNC machine tools
    Wang H.
    Zhu W.-Q.
    Wu Y.-Z.
    He P.-J.
    Wan L.-J.
    Wu, Yue-Zhong (yuezhong.wu@163.com), 1600, Science Press (42): : 476 - 482
  • [50] Survey of Chinese Named Entity Recognition Research
    Zhao, Jigui
    Qian, Yurong
    Wang, Kui
    Hou, Shuxiang
    Chen, Jiaying
    Computer Engineering and Applications, 2024, 60 (01) : 15 - 27