A Hybrid Neural Network BERT-Cap Based on Pre-Trained Language Model and Capsule Network for User Intent Classification

被引:3
|
作者
Liu, Hai [1 ,2 ]
Liu, Yuanxia [1 ]
Wong, Leung-Pun [3 ]
Lee, Lap-Kei [3 ]
Hao, Tianyong [1 ,4 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510000, Peoples R China
[2] Guangzhou Key Lab Big Data & Intelligent Educ, Guangzhou 510000, Peoples R China
[3] Open Univ Hong Kong, Sch Sci & Technol, Kowloon, Hong Kong 999077, Peoples R China
[4] South China Normal Univ, Inst Adv Study Educ Dev Guangdong Hong Kong Macao, Guangzhou 510000, Peoples R China
基金
中国国家自然科学基金;
关键词
Signal encoding - Semantics - Speech processing - Text processing - Encoding (symbols);
D O I
10.1155/2020/8858852
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
User intent classification is a vital component of a question-answering system or a task-based dialogue system. In order to understand the goals of users' questions or discourses, the system categorizes user text into a set of pre-defined user intent categories. User questions or discourses are usually short in length and lack sufficient context; thus, it is difficult to extract deep semantic information from these types of text and the accuracy of user intent classification may be affected. To better identify user intents, this paper proposes a BERT-Cap hybrid neural network model with focal loss for user intent classification to capture user intents in dialogue. The model uses multiple transformer encoder blocks to encode user utterances and initializes encoder parameters with a pre-trained BERT. Then, it extracts essential features using a capsule network with dynamic routing after utterances encoding. Experiment results on four publicly available datasets show that our model BERT-Cap achieves a F1 score of 0.967 and an accuracy of 0.967, outperforming a number of baseline methods, indicating its effectiveness in user intent classification.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] CAM-BERT: Chinese Aerospace Manufacturing Pre-trained Language Model
    Dai, Jinchi
    Wang, Shengren
    Wang, Peiyan
    Li, Ruiting
    Chen, Jiaxin
    Li, Xinrong
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 361 - 365
  • [22] Predictive Recognition of DNA-binding Proteins Based on Pre-trained Language Model BERT
    Ma, Yue
    Pei, Yongzhen
    Li, Changguo
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2023, 21 (06)
  • [23] Hippocampus segmentation and classification for dementia analysis using pre-trained neural network models
    Priyanka, Ahana
    Ganesan, Kavitha
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2021, 66 (06): : 581 - 592
  • [24] Classification of fresh water Zooplankton by pre-trained convolutional neural network in underwater microscopy
    Hong S.
    Mehdi S.R.
    Huang H.
    Shahani K.
    Zhang Y.
    Junaidullah
    Raza K.
    Khan M.A.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (07): : 252 - 258
  • [25] Classification of defects in wooden structures using pre-trained models of convolutional neural network
    Ehtisham, Rana
    Qayyum, Waqas
    Camp, Charles, V
    Plevris, Vagelis
    Mir, Junaid
    Khan, Qaiser-uz Zaman
    Ahmad, Afaq
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2023, 19
  • [26] Development of a deep learning network using a pre-trained convolutional neural network
    Rooney, M.
    Mitchell, J.
    McLaren, D. B.
    Nailon, W. H.
    RADIOTHERAPY AND ONCOLOGY, 2019, 133 : S1051 - S1052
  • [27] INTENT CLASSIFICATION USING PRE-TRAINED LANGUAGE AGNOSTIC EMBEDDINGS FOR LOW RESOURCE LANGUAGES
    Yadav, Hemant
    Gupta, Akshat
    Rallabandi, Sai Krishna
    Black, Alan W.
    Shah, Rajiv Ratn
    INTERSPEECH 2022, 2022, : 3473 - 3477
  • [28] Classification of Shoulder Implant Manufacturer Using Pre-Trained DenseNet201 Combined With Capsule Network
    Jian, Xianzhong
    Zhou, Zhenling
    Zhang, Wuwen
    INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY, 2024, 20 (05):
  • [29] A pre-trained convolutional neural network based method for thyroid nodule diagnosis
    Ma, Jinlian
    Wu, Fa
    Zhu, Jiang
    Xu, Dong
    Kong, Dexing
    ULTRASONICS, 2017, 73 : 221 - 230
  • [30] EFFICIENT TEXT ANALYSIS WITH PRE-TRAINED NEURAL NETWORK MODELS
    Cui, Jia
    Lu, Heng
    Wang, Wenjie
    Kang, Shiyin
    He, Liqiang
    Li, Guangzhi
    Yu, Dong
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 671 - 676