Exemplar Encoder-Decoder for Neural Conversation Generation

被引:0
|
作者
Pandey, Gaurav [1 ]
Contractor, Danish [1 ]
Kumar, Vineet [1 ]
Joshi, Sachindra [1 ]
机构
[1] IBM Res AI, New Delhi, India
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present the Exemplar Encoder-Decoder network (EED), a novel conversation model that learns to utilize similar examples from training data to generate responses. Similar conversation examples (context-response pairs) from training data are retrieved using a traditional TF-IDF based retrieval model. The retrieved responses are used to create exemplar vectors that are used by the decoder to generate the response. The contribution of each retrieved response is weighed by the similarity of corresponding context with the input context. We present detailed experiments on two large data sets and find that our method outperforms state of the art sequence to sequence generative models on several recently proposed evaluation metrics. We also observe that the responses generated by the proposed EED model are more informative and diverse compared to existing state-of-the-art method.
引用
收藏
页码:1329 / 1338
页数:10
相关论文
共 50 条
  • [41] Interpretable Transformations with Encoder-Decoder Networks
    Worrall, Daniel E.
    Garbin, Stephan J.
    Turmukhambetov, Daniyar
    Brostow, Gabriel J.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5737 - 5746
  • [42] Automatic Generation of Chinese Couplets with Attention Based Encoder-Decoder Model
    Yuan, Shengqiong
    Zhong, Luo
    Li, Lin
    Zhang, Rui
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 65 - 70
  • [43] Labeled Data Generation with Encoder-decoder LSTM for Semantic Slot Filling
    Kurata, Gakuto
    Xiang, Bing
    Zhou, Bowen
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 725 - 729
  • [44] Encoder-Decoder Based Route Generation Model for Flexible Travel Recommendation
    Zhang, Jiale
    Ma, Mingqian
    Gao, Xiaofeng
    Chen, Guihai
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 905 - 920
  • [45] Seismic Stratum Segmentation Using an Encoder-Decoder Convolutional Neural Network
    Wang, Detao
    Chen, Guoxiong
    MATHEMATICAL GEOSCIENCES, 2021, 53 (06) : 1355 - 1374
  • [46] Encoder-decoder recurrent network model for interactive character animation generation
    Wang, Yumeng
    Che, Wujun
    Xu, Bo
    VISUAL COMPUTER, 2017, 33 (6-8): : 971 - 980
  • [47] Understanding Geometry of Encoder-Decoder CNNs
    Ye, Jong Chul
    Sung, Woon Kyoung
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [48] Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping
    Sak, Hasim
    Shannon, Matt
    Rao, Kanishka
    Beaufays, Francoise
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1298 - 1302
  • [49] Encoder-Decoder Neural Network with Attention Mechanism for Types Detection in Linked Data
    Hamel, Oussama
    Fareh, Messaouda
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 733 - 739
  • [50] The local ternary pattern encoder-decoder neural network for dental image segmentation
    Salih, Omran
    Duffy, Kevin Jan
    IET IMAGE PROCESSING, 2022, 16 (06) : 1520 - 1530