BERT-based chinese text classification for emergency management with a novel loss function

被引:22
|
作者
Wang, Zhongju [1 ,2 ]
Wang, Long [1 ,2 ,3 ]
Huang, Chao [1 ,2 ]
Sun, Shutong [4 ]
Luo, Xiong [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;
D O I
10.1007/s10489-022-03946-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.
引用
收藏
页码:10417 / 10428
页数:12
相关论文
共 50 条
  • [41] Task-Aware BERT-based Sentiment Analysis from Multiple Essences of the Text
    Jia-Hao Hsu
    Chung-Hsien Wu
    Tsung-Hsien Yang
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1982 - 1986
  • [42] BERT-Based Dual-Channel Power Equipment Defect Text Assessment Model
    Zhou, Zhenan
    Zhang, Chuyan
    Liang, Xinyi
    Liu, Huifang
    Diao, Mingguang
    Deng, Yu
    IEEE ACCESS, 2024, 12 : 134020 - 134026
  • [43] Fault Text Classification of Rotating Machine Based BERT
    Chen Ling
    Liu Yimin
    Ji Lianlian
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6744 - 6750
  • [44] Enhancing text classification with attention matrices based on BERT
    Yu, Zhiyi
    Li, Hong
    Feng, Jialin
    EXPERT SYSTEMS, 2024, 41 (03)
  • [45] BERT-Based Hybrid RNN Model for Multi-class Text Classification to Study the Effect of Pre-trained Word Embeddings
    Shreyashree, S.
    Sunagar, Pramod
    Rajarajeswari, S.
    Kanavalli, Anita
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 667 - 674
  • [46] BERT-based NLP techniques for classification and severity modeling in basic warranty data study
    Xu, Shuzhe
    Zhang, Chuanlong
    Hong, Don
    INSURANCE MATHEMATICS & ECONOMICS, 2022, 107 : 57 - 67
  • [47] Identification of risk features using text mining and BERT-based models: Application to an oil refinery
    Macedo, July Bias
    Moura, Marcio das Chagas
    Aichele, Diego
    Lins, Isis Didier
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2022, 158 : 382 - 398
  • [48] A BERT-Based Model to Analyse Disaster’s Data for Efficient Resource Management
    Sonu Lamba
    Pranav Vidyarthi
    Mudit Aggarwal
    Priyanshi Gangawar
    Snehita Mulapalli
    SN Computer Science, 6 (2)
  • [49] BERT-Based Joint Model for Aspect Term Extraction and Aspect Polarity Detection in Arabic Text
    Chouikhi, Hasna
    Alsuhaibani, Mohammed
    Jarray, Fethi
    ELECTRONICS, 2023, 12 (03)
  • [50] Advancements in Text Subjectivity Analysis: From Simple Approaches to BERT-Based Models and Generalization Assessments
    Antal, Margit
    Buza, Krisztian
    Nemes, Szilard
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PART I, 2024, 2165 : 245 - 255