An attention mechanism and multi-granularity-based Bi-LSTM model for Chinese Q&A system

被引:1
|
作者
Xiao-mei Yu
Wen-zhi Feng
Hong Wang
Qian Chu
Qi Chen
机构
[1] Shandong Normal University,School of Information Science and Engineering
[2] Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
NLP; Artificial intelligence; Long short-term memory; Question-answering system;
D O I
暂无
中图分类号
学科分类号
摘要
Natural language processing (NLP) is one of the key techniques in intelligent question-answering (Q&A) systems. Although recurrent neural networks and long short-term memory (LSTM) networks exhibit obvious advantages on well-known English Q&A datasets, they still suffer from several defects including indeterminateness, polysemy and the lack of changing morphology in Chinese, which results in complex NLP on large and diverse Chinese Q&A datasets. In this paper, we first analyze limitations of applying LSTM and bidirectional LSTM (Bi-LSTM) models to noisy Chinese Q&A datasets. Then, we focus on integrating attention mechanisms and multi-granularity word segmentation into Bi-LSTM and propose an attention mechanism and multi-granularity-based Bi-LSTM model (AM–Bi-LSTM) which combines the improved attention mechanism with a novel processing of multi-granularity word segmentation to handle the complex NLP in Chinese Q&A datasets. Furthermore, similarity of questions and answers is formulated to implement the quantitative computation which helps to achieve better performance in Chinese Q&A systems. Finally, we verify the proposed model on a noisy Chinese Q&A dataset. The experimental results demonstrate that the novel AM–Bi-LSTM model achieves significant improvement on evaluation metrics of accuracy, mean average precision and so on. Moreover, the experimental results indicate that the novel AM–Bi-LSTM model outperforms baseline methods and other LSTM-based models.
引用
收藏
页码:5831 / 5845
页数:14
相关论文
共 50 条
  • [41] Improving Dialogue in Intelligent Q&A System in Financial Sector Using Multi-Granularity and Multi-Attention Interactive Matching Model
    Shu, Zhou
    Hao, Wang
    Guoliang, Shi
    Bin, Shi
    Jingwen, Qiu
    Data Analysis and Knowledge Discovery, 2024, 8 (8-9) : 240 - 250
  • [42] Attention-Based Bi-LSTM Network for Abusive Language Detection
    Nelatoori, Kiran Babu
    Kommanti, Hima Bindu
    IETE JOURNAL OF RESEARCH, 2023, 69 (11) : 7884 - 7892
  • [43] Classification and extraction of medical clinical trial screening standard texts based on Bi-LSTM and Attention mechanism
    Du, Lin
    Cao, Dong
    Li, Jinghua
    Ye, Hui
    2020 ASIA CONFERENCE ON GEOLOGICAL RESEARCH AND ENVIRONMENTAL TECHNOLOGY, 2021, 632
  • [44] Power quality disturbance signal segmentation and classification based on modified BI-LSTM with double attention mechanism
    Khetarpal, Poras
    Nagpal, Neelu
    Siano, Pierluigi
    Al-Numay, Mohammed
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (01) : 50 - 62
  • [45] REMAINING USEFUL LIFE PREDICTION OF AIRCRAFT ENGINE BASED ON BI-LSTM NETWORK INTEGRATED WITH ATTENTION MECHANISM
    Qu, Guixian
    Qiu, Tian
    Ding, Shuiting
    Ma, Long
    Yuan, Qiyu
    Ma, Qinglin
    Si, Yang
    PROCEEDINGS OF ASME TURBO EXPO 2024: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, GT2024, VOL 1, 2024,
  • [46] Short-Term Load Forecasting with an Ensemble Model Using Densely Residual Block and Bi-LSTM Based on the Attention Mechanism
    Chen, Wenhao
    Han, Guangjie
    Zhu, Hongbo
    Liao, Lyuchao
    SUSTAINABILITY, 2022, 14 (24)
  • [47] Web Services Classification Based on Wide & Bi-LSTM Model
    Ye, Hongfan
    Cao, Buqing
    Peng, Zhenlian
    Chen, Ting
    Wen, Yiping
    Liu, Jianxun
    IEEE ACCESS, 2019, 7 : 43697 - 43706
  • [48] Trajectory outlier detection algorithm based on Bi-LSTM model
    Han Z.
    Huang T.
    Ren W.
    Xu G.
    Journal of Radars, 2019, 8 (01) : 36 - 43
  • [49] Ship Trajectory Prediction Model Based on Improved Bi-LSTM
    Li, Weifeng
    Lian, Yifan
    Liu, Yaochen
    Shi, Guoyou
    ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2024, 10 (03):
  • [50] A Novel Wireless Propagation Model Based on Bi-LSTM Algorithm
    Yang, Yu Lu
    Wan, Guo Chun
    Tong, Mei Song
    IEEE ACCESS, 2022, 10 : 43837 - 43847