Neural Machine Translation with Key-Value Memory-Augmented Attention

被引:0
|
作者
Meng, Fandong [1 ]
Tu, Zhaopeng [1 ]
Cheng, Yong [1 ]
Wu, Haiyang [1 ]
Zhai, Junjie [1 ]
Yang, Yuekui [1 ]
Wang, Di [1 ]
机构
[1] Tencent AI Lab, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although attention-based Neural Machine Translation (NMT) has achieved remarkable progress in recent years, it still suffers from issues of repeating and dropping translations. To alleviate these issues, we propose a novel key-value memory-augmented attention model for NMT, called KVMEMATT. Specifically, we maintain a timely updated key-memory to keep track of attention history and a fixed value-memory to store the representation of source sentence throughout the whole translation process. Via nontrivial transformations and iterative interactions between the two memories, the decoder focuses on more appropriate source word(s) for predicting the next target word at each decoding step, therefore can improve the adequacy of translations. Experimental results on Chinese double right arrow English and WMT17 German double left right arrow English translation tasks demonstrate the superiority of the proposed model.
引用
收藏
页码:2574 / 2580
页数:7
相关论文
共 50 条
  • [31] Evaluation and Analysis of In-Memory Key-Value Systems
    Cao, Wenqi
    Sahin, Semih
    Liu, Ling
    Bao, Xianqiang
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 26 - 33
  • [32] Knowledge Tracing with Sequential Key-Value Memory Networks
    Abdelrahman, Ghodai
    Wang, Qing
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 175 - 184
  • [33] MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks
    Jang, Hanhwi
    Kim, Joonsung
    Jo, Jae-Eon
    Lee, Jaewon
    Kim, Jangwoo
    PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 250 - 263
  • [34] Lightweight and Accurate Memory Allocation in Key-Value Cache
    Cheng Pan
    Lan Zhou
    Yingwei Luo
    Xiaolin Wang
    Zhenlin Wang
    International Journal of Parallel Programming, 2019, 47 : 451 - 466
  • [35] LibreKV: A Persistent in-Memory Key-Value Store
    Liu, Hao
    Huang, Linpeng
    Zhu, Yanmin
    Shen, Yanyan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2020, 8 (04) : 916 - 927
  • [36] FloDB: Unlocking Memory in Persistent Key-Value Stores
    Balmau, Oana
    Guerraoui, Rachid
    Trigonakis, Vasileios
    Zablotchi, Igor
    PROCEEDINGS OF THE TWELFTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS 2017), 2017, : 80 - 94
  • [37] Self-Attention Memory-Augmented Wavelet-CNN for Anomaly Detection
    Wu, Kun
    Zhu, Lei
    Shi, Weihang
    Wang, Wenwu
    Wu, Jin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1374 - 1385
  • [38] Memory-Augmented Non-Local Attention for Video Super-Resolution
    Yu, Jiyang
    Liu, Jingen
    Bo, Liefeng
    Mei, Tao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17813 - 17822
  • [39] Secure In-memory Key-Value Storage with SGX
    Kim, Taehoon
    Park, Joongun
    Woo, Jaewook
    Jeon, Seungheun
    Huh, Jaehyuk
    PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, : 507 - 507
  • [40] ChameleonDB: a Key-value Store for Optane Persistent Memory
    Zhang, Wenhui
    Zhao, Xingsheng
    Jiang, Song
    Jiang, Hong
    PROCEEDINGS OF THE SIXTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '21), 2021, : 194 - 209