BERT-based chinese text classification for emergency management with a novel loss function

被引:22
|
作者
Wang, Zhongju [1 ,2 ]
Wang, Long [1 ,2 ,3 ]
Huang, Chao [1 ,2 ]
Sun, Shutong [4 ]
Luo, Xiong [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;
D O I
10.1007/s10489-022-03946-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.
引用
收藏
页码:10417 / 10428
页数:12
相关论文
共 50 条
  • [21] Hierarchical graph-based text classification framework with contextual node embedding and BERT-based dynamic fusion
    Onan, Aytug
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [22] Realistic Image Generation from Text by Using BERT-Based Embedding
    Na, Sanghyuck
    Do, Mirae
    Yu, Kyeonah
    Kim, Juntae
    ELECTRONICS, 2022, 11 (05)
  • [23] Biomedical Abstract Sentence Classification by BERT-Based Reading Comprehension
    Jiang C.-Y.
    Fan Y.-C.
    SN Computer Science, 4 (4)
  • [24] Multimodal Prediction of Social Responsiveness Score with BERT-Based Text Features
    Saga, Takeshi
    Tanaka, Hiroki
    Iwasaka, Hidemi
    Nakamura, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 578 - 586
  • [25] Interpreting BERT-based Text Similarity via Activation and Saliency Maps
    Malkiel, Itzik
    Ginzburg, Dvir
    Barkan, Oren
    Caciularu, Avi
    Weill, Jonathan
    Koenigstein, Noam
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3259 - 3268
  • [26] A Long-Text Classification Method of Chinese News Based on BERT and CNN
    Chen, Xinying
    Cong, Peimin
    Lv, Shuo
    IEEE ACCESS, 2022, 10 : 34046 - 34057
  • [27] BERT-based Transfer Learning in Sentence-level Anatomic Classification of Free-Text Radiology Reports
    Nishigaki, Daiki
    Suzuki, Yuki
    Wataya, Tomohiro
    Kita, Kosuke
    Yamagata, Kazuki
    Sato, Junya
    Kido, Shoji
    Tomiyama, Noriyuki
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2023, 5 (02)
  • [28] Assessing the use of attention weights to interpret BERT-based stance classification
    Cordova Saenz, Carlos Abel
    Becker, Karin
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 194 - 201
  • [29] Transformer models for text-based emotion detection: a review of BERT-based approaches
    Acheampong, Francisca Adoma
    Nunoo-Mensah, Henry
    Chen, Wenyu
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 5789 - 5829
  • [30] Transformer models for text-based emotion detection: a review of BERT-based approaches
    Francisca Adoma Acheampong
    Henry Nunoo-Mensah
    Wenyu Chen
    Artificial Intelligence Review, 2021, 54 : 5789 - 5829