An attention mechanism and multi-granularity-based Bi-LSTM model for Chinese Q&A system

被引:1
|
作者
Xiao-mei Yu
Wen-zhi Feng
Hong Wang
Qian Chu
Qi Chen
机构
[1] Shandong Normal University,School of Information Science and Engineering
[2] Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
NLP; Artificial intelligence; Long short-term memory; Question-answering system;
D O I
暂无
中图分类号
学科分类号
摘要
Natural language processing (NLP) is one of the key techniques in intelligent question-answering (Q&A) systems. Although recurrent neural networks and long short-term memory (LSTM) networks exhibit obvious advantages on well-known English Q&A datasets, they still suffer from several defects including indeterminateness, polysemy and the lack of changing morphology in Chinese, which results in complex NLP on large and diverse Chinese Q&A datasets. In this paper, we first analyze limitations of applying LSTM and bidirectional LSTM (Bi-LSTM) models to noisy Chinese Q&A datasets. Then, we focus on integrating attention mechanisms and multi-granularity word segmentation into Bi-LSTM and propose an attention mechanism and multi-granularity-based Bi-LSTM model (AM–Bi-LSTM) which combines the improved attention mechanism with a novel processing of multi-granularity word segmentation to handle the complex NLP in Chinese Q&A datasets. Furthermore, similarity of questions and answers is formulated to implement the quantitative computation which helps to achieve better performance in Chinese Q&A systems. Finally, we verify the proposed model on a noisy Chinese Q&A dataset. The experimental results demonstrate that the novel AM–Bi-LSTM model achieves significant improvement on evaluation metrics of accuracy, mean average precision and so on. Moreover, the experimental results indicate that the novel AM–Bi-LSTM model outperforms baseline methods and other LSTM-based models.
引用
收藏
页码:5831 / 5845
页数:14
相关论文
共 50 条
  • [21] Attention-based Bi-LSTM Model for Anomalous HTTP Traffic Detection
    Yu, Yuqi
    Liu, Guannan
    Yan, Hanbing
    Li, Hong
    Guan, Hongchao
    2018 15TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2018,
  • [22] Extraction and Classification of TCM Medical Records Based on BERT and Bi-LSTM With Attention Mechanism
    Hui, Ye
    Du, Lin
    Lin, Shuyuan
    Qu, Yiqian
    Cao, Dong
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1626 - 1631
  • [23] Epileptic Seizure Detection Based on Path Signature and Bi-LSTM Network With Attention Mechanism
    Tang, Yixuan
    Wu, Qianyi
    Mao, Haifeng
    Guo, Lihua
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 304 - 313
  • [24] Chinese Medical Q&A Matching Model Based on Multi-Granularity Semantic Information and Knowledge Graph
    Guan, Liben
    Li, Shi
    Computer Engineering and Applications, 2024, 60 (14) : 152 - 161
  • [25] Forecasting Bus Passenger Flow Using Bi-LSTM with Attention Mechanism
    Chen, Tao
    Fang, Jie
    Liu, Zhijia
    Xiao, Pinghui
    CICTP 2021: ADVANCED TRANSPORTATION, ENHANCED CONNECTION, 2021, : 54 - 65
  • [26] Sentiment Analysis Model Using Word2vec, Bi-LSTM and Attention Mechanism
    Jaca-Madariaga, M.
    Zarrabeitia-Bilbao, E.
    Rio-Belver, R. M.
    Moens, M. F.
    IOT AND DATA SCIENCE IN ENGINEERING MANAGEMENT, 2023, 160 : 239 - 244
  • [27] Attention Based Graph Bi-LSTM Networks for Traffic Forecasting
    Zhao, Han
    Yang, Huan
    Wang, Yu
    Wang, Danwei
    Su, Rong
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [28] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    VeeraSekharReddy, B.
    Rao, Koppula Srinivas
    Koppula, Neerja
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (02) : 1435 - 1448
  • [29] ABCNet: A comprehensive highway visibility prediction model based on attention, Bi-LSTM and CNN
    Li, Wen
    Yang, Xuekun
    Yuan, Guowu
    Xu, Dan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4397 - 4420
  • [30] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    B. VeeraSekharReddy
    Koppula Srinivas Rao
    Neerja Koppula
    Wireless Personal Communications, 2023, 130 : 1435 - 1448