Multilingual Pre-training Model-Assisted Contrastive Learning Neural Machine Translation

被引：0

作者：

Sun, Shuo ^{[1
]}

Hou, Hong-xu ^{[1
]}

Yang, Zong-heng ^{[1
]}

Wang, Yi-song ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, Natl & Local Joint Engn Res Ctr Intelligent Infor, Inner Mongolia Key Lab Mongolian Informat Proc Te, Hohhot, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

Low-Resource NMT; Pre-training Model; Contrastive Learning; Dynamic Training;

D O I：

10.1109/IJCNN54540.2023.10191766

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Since pre-training and fine-tuning have been a successful paradigm in Natural Language Processing (NLP), this paper adopts the SOTA pre-training model-CeMAT as a strong assistant for low-resource ethnic language translation tasks. Aiming at the exposure bias problem in the fine-tuning process, we use the contrastive learning framework and propose a new contrastive examples generation method, which uses self-generated predictions as contrastive examples to expose the model to errors during inference. Moreover, in order to effectively utilize the limited bilingual data in low-resource tasks, this paper proposes a dynamic training strategy to fine-tune the model, and refines the model step by step by taking word embedding norm and uncertainty as the criteria of evaluate data and model respectively. Experimental results demonstrate that our method significantly improves the quality compared to the baselines, which fully verifies the effectiveness.

引用

页数：7

共 50 条

[41] A Multi-view Molecular Pre-training with Generative Contrastive Learning
Liu, Yunwu
Zhang, Ruisheng
Yuan, Yongna
Ma, Jun
Li, Tongfeng
Yu, Zhixuan
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (03) : 741 - 754
[42] Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer
Liang, Zhuomin
Bai, Liang
Fan, Jinyu
Yang, Xian
Liang, Jiye
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 219 - 231
[43] From Bilingual to Multilingual Neural Machine Translation by Incremental Training
Escolano, Carlos
Costa-Jussa, Marta R.
Fonollosa, Jose A. R.
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 236 - 242
[44] Discovering Representation Sprachbund For Multilingual Pre-Training
Fan, Yimin
Liang, Yaobo
Muzio, Alexandre
Hassan, Hany
Li, Houqiang
Zhou, Ming
Duan, Nan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 881 - 894
[45] Adversarial momentum-contrastive pre-training
Xu, Cong
Li, Dan
Yang, Min
PATTERN RECOGNITION LETTERS, 2022, 160 : 172 - 179
[46] Contrastive Pre-Training of GNNs on Heterogeneous Graphs
Jiang, Xunqiang
Lu, Yuanfu
Fang, Yuan
Shi, Chuan
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 803 - 812
[47] UserBERT: Pre-training User Model with Contrastive Self-supervision
Wu, Chuhan
Wu, Fangzhao
Qi, Tao
Huang, Yongfeng
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2087 - 2092
[48] Contrastive Code-Comment Pre-training
Pei, Xiaohuan
Liu, Daochang
Qian, Luo
Xu, Chang
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 398 - 407
[49] A Multilingual Framework Based on Pre-training Model for Speech Emotion Recognition
Zhang, Zhaohang
Zhang, Xiaohui
Guo, Min
Zhang, Wei-Qiang
Li, Ke
Huang, Yukai
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 750 - 755
[50] Contrastive Pre-training for Personalized Expert Finding
Peng, Qiyao
Liu, Hongtao
Lv, Zhepeng
Ng, Qingay
Wang, Wenjun
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15797 - 15806

← 1 2 3 4 5 →