Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision

被引:0
|
作者
Cai, Yi [1 ]
Su, Shuai [1 ]
Li, Zheng [2 ]
Han, Qinglong [2 ]
Zhang, Jianxia [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Beijing, Peoples R China
[3] China Construct Third Bur Digitalizat Engn CO LTD, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail fault texts; Named entity recognition; Distant supervision;
D O I
10.1109/ITSC57777.2023.10422388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most faults in rail field are recorded as texts, and neural network requires a large amount of labeled data which is used to mine and analyse the texts. However, manually labeled datasets are costly to obtain, so it is necessary to train a better model capable of recognising entities from small batches of manually annotated data. In this paper, a named entity recognition model based on large batches of distantly supervised data and small batches of manually annotated datasets is proposed, which increased the character representation. A reinforcement learning selector is used in the model to filter the distantly supervised data and a BERT encoder is implemented to enhance the character representation capability. Finally, the experiments on a real railway fault datasets are conducted with our proposed model, and the result shows that the model proposed in this paper outperforms other baseline models significantly, and is more adaptive with both reduced datasets.
引用
收藏
页码:939 / 944
页数:6
相关论文
共 50 条
  • [11] Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature
    Wang, Xuan
    Song, Xiangchen
    Li, Bangzheng
    Zhou, Kang
    Li, Qi
    Han, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 491 - 494
  • [12] BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
    Liang, Chen
    Yu, Yue
    Jiang, Haoming
    Er, Siawpeng
    Wang, Ruijia
    Zhao, Tuo
    Zhang, Chao
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1054 - 1064
  • [13] Research on medical text named entity recognition based on Two-stage approach
    Sun, Fuquan
    Xu, Ximeng
    Dong, Xinyi
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 365 - 369
  • [14] Named Entity Recognition of Chinese Text Based on Attention Mechanism
    Shen, Tong-Ping
    Dumlao, Menchita
    Meng, Qing-Quan
    Zhan, Zhong-Hua
    Journal of Network Intelligence, 2023, 8 (02): : 505 - 518
  • [15] Persian Automatic Text Summarization Based on Named Entity Recognition
    Khademi, Mohammad Ebrahim
    Fakhredanesh, Mohammad
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2020,
  • [16] Research on College Academic Text Named Entity Recognition and Dataset Construction
    He, Chen
    Yuan, Yingchun
    Wang, Kejian
    Tao, Jia
    Computer Engineering and Applications, 2023, 59 (22) : 322 - 328
  • [17] Noise Detection for Distant Supervised Named Entity Recognition
    Wang J.
    Wang K.
    Wang H.
    Du W.
    He Z.
    Ruan T.
    Liu J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 916 - 928
  • [18] Product named entity recognition in Chinese text
    Jun Zhao
    Feifan Liu
    Language Resources and Evaluation, 2008, 42 : 197 - 217
  • [19] CHEMNER: Fine-Grained Chemistry Named Entity Recognition with Ontology-Guided Distant Supervision
    Wang, Xuan
    Hu, Vivian
    Song, Xiangchen
    Garg, Shweta
    Xiao, Jinfeng
    Han, Jiawei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5227 - 5240
  • [20] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94