Adaptive Nearest Neighbor Machine Translation

被引:0
|
作者
Zheng, Xin [1 ]
Zhang, Zhirui [2 ]
Guo, Junliang [3 ]
Huang, Shujian [1 ]
Chen, Boxing [2 ]
Luo, Weihua [2 ]
Chen, Jiajun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Alibaba DAMO Acad, Machine Intelligence Technol Lab, Shanghai, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
kNN-MT, recently proposed by Khandelwal et al. (2020a), successfully combines pretrained neural machine translation (NMT) model with token-level k-nearest-neighbor (kNN) retrieval to improve the translation accuracy. However, the traditional kNN algorithm used in kNN-MT simply retrieves a same number of nearest neighbors for each target token, which may cause prediction errors when the retrieved neighbors include noises. In this paper, we propose Adaptive kNN-MT to dynamically determine the number of k for each target token. We achieve this by introducing a light-weight Meta-k Network, which can be efficiently trained with only a few training samples. On four benchmark machine translation datasets, we demonstrate that the proposed method is able to effectively filter out the noises in retrieval results and significantly outperforms the vanilla kNN-MT model. Even more noteworthy is that the Meta-k Network learned on one domain could be directly applied to other domains and obtain consistent improvements, illustrating the generality of our method. Our implementation is open-sourced at https://github. com/zhengxxn/adaptive-knn-mt.
引用
收藏
页码:368 / 374
页数:7
相关论文
共 50 条
  • [21] Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
    Cao, Zhiwei
    Yang, Baosong
    Lin, Huan
    Wu, Suhang
    Wei, Xiangpeng
    Liu, Dayiheng
    Xie, Jun
    Zhang, Min
    Su, Jinsong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5841 - 5853
  • [22] Adaptive kernel metric nearest neighbor classification
    Peng, J
    Heisterkamp, DR
    Dai, HK
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 33 - 36
  • [23] Automatic adjustment of discriminant adaptive nearest neighbor
    Delannay, Nicolas
    Archambeau, Cedric
    Verleysen, Michel
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 552 - +
  • [24] Nonlinear discriminant adaptive nearest neighbor classifiers
    Zhang, P
    Peng, J
    Sims, SRF
    AUTOMATIC TARGET RECOGNITON XV, 2005, 5807 : 359 - 369
  • [25] Discriminant adaptive nearest neighbor classification and regression
    Hastie, T
    Tibshirani, R
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 409 - 415
  • [26] ADAPTIVE NEAREST NEIGHBOR PATTERN-CLASSIFICATION
    GEVA, S
    SITTE, J
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (02): : 318 - 322
  • [27] Minimax Regression via Adaptive Nearest Neighbor
    Zhao, Puning
    Lai, Lifeng
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 1447 - 1451
  • [28] Product Quantized Translation for Fast Nearest Neighbor Search
    Hwang, Yoonho
    Baek, Mooyeol
    Kim, Saehoon
    Han, Bohyung
    Ahn, Hee-Kap
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3295 - 3301
  • [30] Adaptive quasiconformal kernel nearest neighbor classification
    Peng, J
    Heisterkamp, DR
    Dai, HK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (05) : 656 - 661