Pre-training neural machine translation with alignment information via optimal transport

被引：0

作者：

Su, Xueping ^{[1
]}

Zhao, Xingkai ^{[1
]}

Ren, Jie ^{[1
]}

Li, Yunhong ^{[1
]}

Raetsch, Matthias ^{[2
]}

机构：

[1] Xian Polytech Univ, Sch Elect & Informat, Xian, Peoples R China

[2] Reutlingen Univ, Dept Engn, Interact & Mobile Robot & Artificial Intelligence, Reutlingen, Germany

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Optimal Transport; Alignment Information; Pre-training; Neural Machine Translation;

D O I：

10.1007/s11042-023-17479-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of globalization, the demand for translation between different languages is also increasing. Although pre-training has achieved excellent results in neural machine translation, the existing neural machine translation has almost no high-quality suitable for specific fields. Alignment information, so this paper proposes a pre-training neural machine translation with alignment information via optimal transport. First, this paper narrows the representation gap between different languages by using OTAP to generate domain-specific data for information alignment, and learns richer semantic information. Secondly, this paper proposes a lightweight model DR-Reformer, which uses Reformer as the backbone network, adds Dropout layers and Reduction layers, reduces model parameters without losing accuracy, and improves computational efficiency. Experiments on the Chinese and English datasets of AI Challenger 2018 and WMT-17 show that the proposed algorithm has better performance than existing algorithms.

引用

页码：48377 / 48397

页数：21

共 50 条

[1] Pre-training neural machine translation with alignment information via optimal transport
Xueping Su
Xingkai Zhao
Jie Ren
Yunhong Li
Matthias Rätsch
Multimedia Tools and Applications, 2024, 83 : 48377 - 48397
[2] Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information
Lin, Zehui
Pan, Xiao
Wang, Mingxuan
Qiu, Xipeng
Feng, Jiangtao
Zhou, Hao
Li, Lei
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2649 - 2663
[3] Pre-training Methods for Neural Machine Translation
Wang, Mingxuan
Li, Lei
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: TUTORIAL ABSTRACTS, 2021, : 21 - 25
[4] Pre-training via Leveraging Assisting Languages for Neural Machine Translation
Song, Haiyue
Dabre, Raj
Mao, Zhuoyuan
Cheng, Fei
Kurohashi, Sadao
Sumita, Eiichiro
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 279 - 285
[5] Multilingual Denoising Pre-training for Neural Machine Translation
Liu, Yinhan
Gu, Jiatao
Goyal, Naman
Li, Xian
Edunov, Sergey
Ghazvininejad, Marjan
Lewis, Mike
Zettlemoyer, Luke
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 726 - 742
[6] Curriculum pre-training for stylized neural machine translation
Zou, Aixiao
Wu, Xuanxuan
Li, Xinjie
Zhang, Ting
Cui, Fuwei
Xu, Jinan
APPLIED INTELLIGENCE, 2024, 54 (17-18) : 7958 - 7968
[7] Synthetic Pre-Training Tasks for Neural Machine Translation
He, Zexue
Blackwood, Graeme
Panda, Rameswar
McAuley, Julian
Feris, Rogerio
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8080 - 8098
[8] On the Copying Behaviors of Pre-Training for Neural Machine Translation
Liu, Xuebo
Wang, Longyue
Wong, Derek F.
Ding, Liang
Chao, Lidia S.
Shi, Shuming
Tu, Zhaopeng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4265 - 4275
[9] DEEP: DEnoising Entity Pre-training for Neural Machine Translation
Hu, Junjie
Hayashi, Hiroaki
Cho, Kyunghyun
Neubig, Graham
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1753 - 1766
[10] On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Liu, Xuebo
Wang, Longyue
Wong, Derek F.
Ding, Liang
Chao, Lidia S.
Shi, Shuming
Tu, Zhaopeng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2900 - 2907

← 1 2 3 4 5 →