BERT-based chinese text classification for emergency management with a novel loss function

被引:22
|
作者
Wang, Zhongju [1 ,2 ]
Wang, Long [1 ,2 ,3 ]
Huang, Chao [1 ,2 ]
Sun, Shutong [4 ]
Luo, Xiong [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;
D O I
10.1007/s10489-022-03946-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.
引用
收藏
页码:10417 / 10428
页数:12
相关论文
共 50 条
  • [31] BERT-based semi-supervised domain adaptation for disastrous classification
    Jing Wang
    Kexin Wang
    Multimedia Systems, 2022, 28 : 2237 - 2246
  • [32] A BERT-Based Two-Stage Model for Chinese Chengyu Recommendation
    Tan, Minghuan
    Jiang, Jing
    Dai, Bing Tian
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [33] fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP
    Geng, Zhichao
    Yan, Hang
    Qiu, Xipeng
    Huang, Xuanjing
    ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 99 - 106
  • [34] A Study of Bert-Based Sentiment Analysis Algorithm for Short Text Movie Reviews
    Gao, Mengshen
    Feng, Xiwei
    Sun, Enyu
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 106 - 111
  • [35] BERT-based semi-supervised domain adaptation for disastrous classification
    Wang, Jing
    Wang, Kexin
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2237 - 2246
  • [36] Chinese Grammatical Correction Using BERT-based Pre-trained Model
    Wang, Hongfei
    Kurosawa, Michiki
    Katsumatat, Satoru
    Komachi, Mamoru
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 163 - 168
  • [37] Bert-Based Chinese Medical Keyphrase Extraction Model Enhanced with External Features
    Ding, Liangping
    Zhang, Zhixiong
    Zhao, Yang
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 167 - 176
  • [38] A BERT-Based Artificial Intelligence to Analyze Free-Text Clinical Notes for Binary Classification in Papillary Thyroid Carcinoma Recurrence
    Nam, Jahyun
    Choi, Jee-Woo
    Shin, Yong-Goo
    Park, Seung
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [39] Anti-Drugs Chatbot: Chinese BERT-Based Cognitive Intent Analysis
    Lee, Jui-Hsuan
    Wu, Eric Hsiao-Kuang
    Ou, Yu-Yen
    Lee, Yueh-Che
    Lee, Cheng-Hsun
    Chung, Chia-Ru
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01): : 514 - 521
  • [40] BERT-based Regression Model for Micro-edit Humor Classification Task
    Chen, Yuancheng
    Hou, Yi
    Ye, Deqiang
    Yu, Yuehang
    2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933