HSAN: A HIERARCHICAL SELF-ATTENTION NETWORK FOR MULTI-TURN DIALOGUE GENERATION

被引:9
|
作者
Kong, Yawei [1 ,2 ]
Zhang, Lu [1 ,2 ]
Ma, Can [2 ]
Cao, Cong [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
关键词
Dialogue generation; Hierarchical network; Self attention;
D O I
10.1109/ICASSP39728.2021.9413753
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the multi-turn dialogue system, response generation is not only related to the sentences in context but also relies on the words in each utterance. Although there are lots of methods that pay attention to model words and utterances, there still exist problems such as tending to generate common responses. In this paper, we propose a hierarchical self-attention network, named HSAN, which attends to the important words and utterances in context simultaneously. Firstly, we use the hierarchical encoder to update the word and utterance representations with their position information respectively. Secondly, the response representations are updated by the mask self-attention module in the decoder. Finally, the relevance between utterances and response is computed by another self-attention module and used for the next response decoding process. In terms of automatic metrics and human judgements, experimental results show that HSAN significantly outperforms all baselines on two common public datasets.
引用
收藏
页码:7433 / 7437
页数:5
相关论文
共 50 条
  • [1] ReCoSa: Detecting the Relevant Contexts with Self-Attention for Multi-turn Dialogue Generation
    Zhang, Hainan
    Lan, Yanyan
    Pang, Liang
    Guo, Jiafeng
    Cheng, Xueqi
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3721 - 3730
  • [2] Multi-turn dialogue model based on the improved hierarchical recurrent attention network
    Miao J.
    Wu J.
    International Journal for Engineering Modelling, 2021, 34 (02) : 17 - 29
  • [3] A Hierarchical Structured Multi-Head Attention Network for Multi-Turn Response Generation
    Lin, Fei
    Zhang, Cong
    Liu, Shengqiang
    Ma, Hong
    IEEE ACCESS, 2020, 8 : 46802 - 46810
  • [4] HKA: A HIERARCHICAL KNOWLEDGE ATTENTION MECHANISM FOR MULTI-TURN DIALOGUE SYSTEM
    Song, Jian
    Zhang, Kailai
    Zhou, Xuesi
    Wu, Ji
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3512 - 3516
  • [5] Multi-turn Dialogue Generation Model with Dialogue Structure
    Jiang X.-T.
    Wang Z.-Q.
    Li S.-S.
    Zhou G.-D.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (11): : 4239 - 4250
  • [6] Dialogue Generation Using Self-Attention Generative Adversarial Network
    Hatua, Amartya
    Nguyen, Trung T.
    Sung, Andrew H.
    2019 IEEE INTERNATIONAL CONFERENCE ON CONVERSATIONAL DATA & KNOWLEDGE ENGINEERING (CDKE), 2019, : 33 - 38
  • [7] Envisioning Future from the Past: Hierarchical Duality Learning for Multi-Turn Dialogue Generation
    Lv, Ang
    Li, Jinpeng
    Xie, Shufang
    Yan, Rui
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7382 - 7394
  • [8] Modeling Topical Relevance for Multi-Turn Dialogue Generation
    Zhang, Hainan
    Lan, Yanyan
    Pang, Liang
    Chen, Hongshen
    Ding, Zhuoye
    Yin, Dawei
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3737 - 3743
  • [9] Multi-Level Curriculum Learning for Multi-Turn Dialogue Generation
    Chen, Guanhua
    Zhan, Runzhe
    Wong, Derek F.
    Chao, Lidia S.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3958 - 3967
  • [10] Memory Graph with Message Rehearsal for Multi-Turn Dialogue Generation
    Cai, Xiaoyu
    Fu, Yao
    Zhao, Hong
    Jiang, Weihao
    Pu, Shiliang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 108 - 117