BERT-based chinese text classification for emergency management with a novel loss function

被引：22

作者：

Wang, Zhongju ^{[1
,2
]}

Wang, Long ^{[1
,2
,3
]}

Huang, Chao ^{[1
,2
]}

Sun, Shutong ^{[4
]}

Luo, Xiong ^{[1
,2
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China

[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China

[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China

[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 09期

关键词：

Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;

D O I：

10.1007/s10489-022-03946-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.

引用

页码：10417 / 10428

页数：12

共 50 条

[31] BERT-based semi-supervised domain adaptation for disastrous classification
Jing Wang
Kexin Wang
Multimedia Systems, 2022, 28 : 2237 - 2246
[32] A BERT-Based Two-Stage Model for Chinese Chengyu Recommendation
Tan, Minghuan
Jiang, Jing
Dai, Bing Tian
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
[33] fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP
Geng, Zhichao
Yan, Hang
Qiu, Xipeng
Huang, Xuanjing
ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 99 - 106
[34] A Study of Bert-Based Sentiment Analysis Algorithm for Short Text Movie Reviews
Gao, Mengshen
Feng, Xiwei
Sun, Enyu
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 106 - 111
[35] BERT-based semi-supervised domain adaptation for disastrous classification
Wang, Jing
Wang, Kexin
MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2237 - 2246
[36] Chinese Grammatical Correction Using BERT-based Pre-trained Model
Wang, Hongfei
Kurosawa, Michiki
Katsumatat, Satoru
Komachi, Mamoru
1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 163 - 168
[37] Bert-Based Chinese Medical Keyphrase Extraction Model Enhanced with External Features
Ding, Liangping
Zhang, Zhixiong
Zhao, Yang
TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 167 - 176
[38] A BERT-Based Artificial Intelligence to Analyze Free-Text Clinical Notes for Binary Classification in Papillary Thyroid Carcinoma Recurrence
Nam, Jahyun
Choi, Jee-Woo
Shin, Yong-Goo
Park, Seung
2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
[39] Anti-Drugs Chatbot: Chinese BERT-Based Cognitive Intent Analysis
Lee, Jui-Hsuan
Wu, Eric Hsiao-Kuang
Ou, Yu-Yen
Lee, Yueh-Che
Lee, Cheng-Hsun
Chung, Chia-Ru
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01): : 514 - 521
[40] BERT-based Regression Model for Micro-edit Humor Classification Task
Chen, Yuancheng
Hou, Yi
Ye, Deqiang
Yu, Yuehang
2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933

← 1 2 3 4 5 →