Adaptive Nearest Neighbor Machine Translation

被引:0
|
作者
Zheng, Xin [1 ]
Zhang, Zhirui [2 ]
Guo, Junliang [3 ]
Huang, Shujian [1 ]
Chen, Boxing [2 ]
Luo, Weihua [2 ]
Chen, Jiajun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Alibaba DAMO Acad, Machine Intelligence Technol Lab, Shanghai, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
kNN-MT, recently proposed by Khandelwal et al. (2020a), successfully combines pretrained neural machine translation (NMT) model with token-level k-nearest-neighbor (kNN) retrieval to improve the translation accuracy. However, the traditional kNN algorithm used in kNN-MT simply retrieves a same number of nearest neighbors for each target token, which may cause prediction errors when the retrieved neighbors include noises. In this paper, we propose Adaptive kNN-MT to dynamically determine the number of k for each target token. We achieve this by introducing a light-weight Meta-k Network, which can be efficiently trained with only a few training samples. On four benchmark machine translation datasets, we demonstrate that the proposed method is able to effectively filter out the noises in retrieval results and significantly outperforms the vanilla kNN-MT model. Even more noteworthy is that the Meta-k Network learned on one domain could be directly applied to other domains and obtain consistent improvements, illustrating the generality of our method. Our implementation is open-sourced at https://github. com/zhengxxn/adaptive-knn-mt.
引用
收藏
页码:368 / 374
页数:7
相关论文
共 50 条
  • [1] Fast Nearest Neighbor Machine Translation
    Meng, Yuxian
    Li, Xiaoya
    Zheng, Xiayu
    Wu, Fei
    Sun, Xiaofei
    Zhang, Tianwei
    Li, Jiwei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 555 - 565
  • [2] Multilingual κ-Nearest-Neighbor Machine Translation
    Stap, David
    Monz, Christof
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9200 - 9208
  • [3] Datastore Distillation for Nearest Neighbor Machine Translation
    Dai, Yuhan
    Zhang, Zhirui
    Du, Yichao
    Liu, Shengcai
    Liu, Lemao
    Xu, Tong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 807 - 817
  • [4] Subset Retrieval Nearest Neighbor Machine Translation
    Deguchi, Hiroyuki
    Watanabe, Taro
    Matsui, Yusuke
    Utiyama, Masao
    Tanaka, Hideki
    Sumita, Eiichiro
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 174 - 189
  • [5] Revisiting Source Context in Nearest Neighbor Machine Translation
    Li, Xuanhong
    Li, Peng
    Hu, Po
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 8087 - 8098
  • [6] Nearest Neighbor Knowledge Distillation for Neural Machine Translation
    Yang, Zhixian
    Sun, Renliang
    Wan, Xiaojun
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5546 - 5556
  • [7] INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation
    Zhu, Wenhao
    Xu, Jingjing
    Huang, Shujian
    Kong, Lingpeng
    Chen, Jiajun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15948 - 15959
  • [8] Towards Faster k-Nearest-Neighbor Machine Translation
    Shi, Xiangyu
    Liang, Yunlong
    Xu, Jinan
    Chen, Yufeng
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (01): : 1943 - 1958
  • [9] Adaptive nearest neighbor classifier
    Ghosh, Anil K.
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 281 - 284
  • [10] Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer
    Gaol, Ruize
    Zhang, Zhirui
    Dui, Yichao
    Liu, Lemao
    Wang, Rui
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 15592 - 15608