An attention mechanism and multi-granularity-based Bi-LSTM model for Chinese Q&A system

被引:1
|
作者
Xiao-mei Yu
Wen-zhi Feng
Hong Wang
Qian Chu
Qi Chen
机构
[1] Shandong Normal University,School of Information Science and Engineering
[2] Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
NLP; Artificial intelligence; Long short-term memory; Question-answering system;
D O I
暂无
中图分类号
学科分类号
摘要
Natural language processing (NLP) is one of the key techniques in intelligent question-answering (Q&A) systems. Although recurrent neural networks and long short-term memory (LSTM) networks exhibit obvious advantages on well-known English Q&A datasets, they still suffer from several defects including indeterminateness, polysemy and the lack of changing morphology in Chinese, which results in complex NLP on large and diverse Chinese Q&A datasets. In this paper, we first analyze limitations of applying LSTM and bidirectional LSTM (Bi-LSTM) models to noisy Chinese Q&A datasets. Then, we focus on integrating attention mechanisms and multi-granularity word segmentation into Bi-LSTM and propose an attention mechanism and multi-granularity-based Bi-LSTM model (AM–Bi-LSTM) which combines the improved attention mechanism with a novel processing of multi-granularity word segmentation to handle the complex NLP in Chinese Q&A datasets. Furthermore, similarity of questions and answers is formulated to implement the quantitative computation which helps to achieve better performance in Chinese Q&A systems. Finally, we verify the proposed model on a noisy Chinese Q&A dataset. The experimental results demonstrate that the novel AM–Bi-LSTM model achieves significant improvement on evaluation metrics of accuracy, mean average precision and so on. Moreover, the experimental results indicate that the novel AM–Bi-LSTM model outperforms baseline methods and other LSTM-based models.
引用
收藏
页码:5831 / 5845
页数:14
相关论文
共 50 条
  • [31] Attention-based Spatialized Word Embedding Bi-LSTM Model for Sentiment Analysis
    Zhu, Kun
    Samsudin, Nur Hana
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2024, 32 (01): : 79 - 98
  • [32] Partial Discharge Detection and Recognition in Insulated Overhead Conductor Based on Bi-LSTM with Attention Mechanism
    Xi, Yanhui
    Zhou, Feng
    Zhang, Weijie
    ELECTRONICS, 2023, 12 (11)
  • [33] Convolutional neural network based on attention mechanism and Bi-LSTM for bearing remaining life prediction
    Luo, Jiahang
    Zhang, Xu
    APPLIED INTELLIGENCE, 2022, 52 (01) : 1076 - 1091
  • [34] Convolutional neural network based on attention mechanism and Bi-LSTM for bearing remaining life prediction
    Jiahang Luo
    Xu Zhang
    Applied Intelligence, 2022, 52 : 1076 - 1091
  • [35] An improved Bi-LSTM method based on heterogeneous features fusion and attention mechanism for ECG recognition
    Song, Chaoyang
    Zhou, Zilong
    Yu, Yue
    Shi, Manman
    Zhang, Jingxiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [36] A Novel CNN-based Bi-LSTM parallel model with attention mechanism for human activity recognition with noisy data
    Xiaochun Yin
    Zengguang Liu
    Deyong Liu
    Xiaojun Ren
    Scientific Reports, 12
  • [37] Research on Chinese Sentiment Analysis Based on Bi-LSTM Networks
    Zhang, Taozheng
    Guo, Jiaqi
    2021 IEEE/ACIS 21ST INTERNATIONAL FALL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2021-FALL), 2021, : 198 - 203
  • [38] The Category Emotion Classification of Chinese Comments Based on BI-LSTM
    Wang, Fugang
    Wang, Xingkai
    Gong, Xueliang
    Liu, Xuan
    Chen, Yu
    Chang, Zirun
    Liu, Zirui
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 98 - 102
  • [39] A Novel CNN-based Bi-LSTM parallel model with attention mechanism for human activity recognition with noisy data
    Yin, Xiaochun
    Liu, Zengguang
    Liu, Deyong
    Ren, Xiaojun
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] CoBiCo: A model using multi-stage ConvNet with attention-based Bi-LSTM for efficient sentiment classification
    Ranjan, Roop
    Daniel, A. K.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2023, 27 (01) : 1 - 24