Pre-training neural machine translation with alignment information via optimal transport

被引：0

作者：

Su, Xueping ^{[1
]}

Zhao, Xingkai ^{[1
]}

Ren, Jie ^{[1
]}

Li, Yunhong ^{[1
]}

Raetsch, Matthias ^{[2
]}

机构：

[1] Xian Polytech Univ, Sch Elect & Informat, Xian, Peoples R China

[2] Reutlingen Univ, Dept Engn, Interact & Mobile Robot & Artificial Intelligence, Reutlingen, Germany

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Optimal Transport; Alignment Information; Pre-training; Neural Machine Translation;

D O I：

10.1007/s11042-023-17479-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of globalization, the demand for translation between different languages is also increasing. Although pre-training has achieved excellent results in neural machine translation, the existing neural machine translation has almost no high-quality suitable for specific fields. Alignment information, so this paper proposes a pre-training neural machine translation with alignment information via optimal transport. First, this paper narrows the representation gap between different languages by using OTAP to generate domain-specific data for information alignment, and learns richer semantic information. Secondly, this paper proposes a lightweight model DR-Reformer, which uses Reformer as the backbone network, adds Dropout layers and Reduction layers, reduces model parameters without losing accuracy, and improves computational efficiency. Experiments on the Chinese and English datasets of AI Challenger 2018 and WMT-17 show that the proposed algorithm has better performance than existing algorithms.

引用

页码：48377 / 48397

页数：21

共 50 条

[21] Exploring the Role of Monolingual Data in Cross-Attention Pre-training for Neural Machine Translation
Khang Pham
Long Nguyen
Dien Dinh
COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 179 - 190
[22] Graph Neural Pre-training for Recommendation with Side Information
Liu, Siwei
Meng, Zaiqiao
Macdonald, Craig
Ounis, Iadh
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (03)
[23] Character-Aware Low-Resource Neural Machine Translation with Weight Sharing and Pre-training
Cao, Yichao
Li, Miao
Feng, Tao
Wang, Rujing
CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 321 - 333
[24] Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning
Geng, Xiang
Zhang, Yu
Li, Jiahuan
Huang, Shujian
Yang, Hao
Tao, Shimin
Chen, Yimeng
Xie, Ning
Chen, Jiajun
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12827 - 12835
[25] Cross-lingual Visual Pre-training for Multimodal Machine Translation
Caglayan, Ozan
Kuyu, Menekse
Amac, Mustafa Sercan
Madhyastha, Pranava
Erdem, Erkut
Erdem, Aykut
Specia, Lucia
16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1317 - 1324
[26] XLIT: A Method to Bridge Task Discrepancy in Machine Translation Pre-training
Pham, Khang
Nguyen, Long
Dinh, Dien
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (10)
[27] Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation
Mao, Zhuoyuan
Chu, Chenhui
Kurohashi, Sadao
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
[28] Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation
Liu, Zihan
Winata, Genta Indra
Fung, Pascale
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2706 - 2718
[29] Cross-Lingual Pre-Training Based Transfer for Zero-Shot Neural Machine Translation
Ji, Baijun
Zhang, Zhirui
Duan, Xiangyu
Zhang, Min
Chen, Boxing
Luo, Weihua
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 115 - 122
[30] Explicit Cross-lingual Pre-training for Unsupervised Machine Translation
Ren, Shuo
Wu, Yu
Liu, Shujie
Zhou, Ming
Ma, Shuai
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 770 - 779

← 1 2 3 4 5 →