Adaptive Nearest Neighbor Machine Translation

被引:0
|
作者
Zheng, Xin [1 ]
Zhang, Zhirui [2 ]
Guo, Junliang [3 ]
Huang, Shujian [1 ]
Chen, Boxing [2 ]
Luo, Weihua [2 ]
Chen, Jiajun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Alibaba DAMO Acad, Machine Intelligence Technol Lab, Shanghai, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
kNN-MT, recently proposed by Khandelwal et al. (2020a), successfully combines pretrained neural machine translation (NMT) model with token-level k-nearest-neighbor (kNN) retrieval to improve the translation accuracy. However, the traditional kNN algorithm used in kNN-MT simply retrieves a same number of nearest neighbors for each target token, which may cause prediction errors when the retrieved neighbors include noises. In this paper, we propose Adaptive kNN-MT to dynamically determine the number of k for each target token. We achieve this by introducing a light-weight Meta-k Network, which can be efficiently trained with only a few training samples. On four benchmark machine translation datasets, we demonstrate that the proposed method is able to effectively filter out the noises in retrieval results and significantly outperforms the vanilla kNN-MT model. Even more noteworthy is that the Meta-k Network learned on one domain could be directly applied to other domains and obtain consistent improvements, illustrating the generality of our method. Our implementation is open-sourced at https://github. com/zhengxxn/adaptive-knn-mt.
引用
收藏
页码:368 / 374
页数:7
相关论文
共 50 条
  • [31] Random nearest neighbor graphs: The translation invariant case
    Bock, Bounghun
    Damron, Michael
    Hanson, Jack
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2023, 59 (02): : 849 - 866
  • [32] Distributed adaptive nearest neighbor classifier: algorithm and theory
    Liu, Ruiqi
    Xu, Ganggang
    Shang, Zuofeng
    STATISTICS AND COMPUTING, 2023, 33 (05)
  • [33] An adaptive nearest neighbor classification algorithm for data streams
    Law, YN
    Zaniolo, C
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 108 - 120
  • [34] An adaptive large margin nearest neighbor classification algorithm
    Yang, Liu
    Yu, Jian
    Jing, Liping
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (11): : 2269 - 2277
  • [35] On nearest neighbor classification using adaptive choice of k
    Ghosh, Anil K.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (02) : 482 - 502
  • [36] Adaptive κ-nearest-neighbor classification using a dynamic number of nearest neighbors
    Ougiaroglou, Stefanos
    Nanopoulos, Alexandros
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    Welzer-Druzovec, Tatjana
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4690 : 66 - +
  • [37] Overfit prevention in adaptive weighted distance nearest neighbor
    Parvinnia, Elham
    Moosavi, Mohammad R.
    Jahromi, Mansoor Z.
    Ziarati, Koorush
    WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [38] Multiview Adaptive K-Nearest Neighbor Classification
    School of Science, East China Jiaotong University, Nanchang
    330013, China
    不详
    330013, China
    不详
    IEEE. Trans. Artif. Intell., 2024, 3 (1221-1234): : 1221 - 1234
  • [39] Distributed adaptive nearest neighbor classifier: algorithm and theory
    Ruiqi Liu
    Ganggang Xu
    Zuofeng Shang
    Statistics and Computing, 2023, 33
  • [40] Adaptive Binary Quantization for Fast Nearest Neighbor Search
    Li, Zhujin
    Liu, Xianglong
    Wu, Junjie
    Su, Hao
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 64 - 72