Low-Resource Neural Machine Translation Using XLNet Pre-training Model

被引：3

作者：

Wu, Nier ^{[1
]}

Hou, Hongxu ^{[1
]}

Guo, Ziyue ^{[1
]}

Zheng, Wei ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, Coll Software, Hohhot, Inner Mongolia, Peoples R China

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V | 2021年 / 12895卷

关键词：

Low-resource; Machine translation; XLNet; Pre-training;

D O I：

10.1007/978-3-030-86383-8_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The methods to improve the quality of low-resource neural machine translation (NMT) include: change the token granularity to reduce the number of low-frequency words; generate pseudo-parallel corpus from large-scale monolingual data to optimize model parameters; Use the auxiliary knowledge of pre-trained model to train NMT model. However, reducing token granularity will result in a large number of invalid operations and increase the complexity of local reordering on the target side. Pseudo-parallel corpus contains noise affect model convergence. Pre-training methods also limit translation quality due to the human error and the assumption of conditional independence. Therefore, we proposed a XLNet based pre-training method, that corrects the defects of the pre-training model, and enhance NMT model for context feature extraction. Experiments are carried out on CCMT2019 Mongolian-Chinese (Mo-Zh), Uyghur-Chinese (Ug-Zh) and Tibetan-Chinese (Ti-Zh) tasks, the results show that the generalization ability and BLEU scores of our method are improved compared with the baseline, which fully verifies the effectiveness of the method.

引用

页码：503 / 514

页数：12

共 50 条

[21] Low-resource Neural Machine Translation: Methods and Trends
Shi, Shumin
Wu, Xing
Su, Rihai
Huang, Heyan
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
[22] Neural Machine Translation for Low-resource Languages: A Survey
Ranathunga, Surangika
Lee, En-Shiun Annie
Skenduli, Marjana Prifti
Shekhar, Ravi
Alam, Mehreen
Kaur, Rishemjit
ACM COMPUTING SURVEYS, 2023, 55 (11)
[23] Data Augmentation for Low-Resource Neural Machine Translation
Fadaee, Marzieh
Bisazza, Arianna
Monz, Christof
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 567 - 573
[24] Recent advances of low-resource neural machine translation
Haque, Rejwanul
Liu, Chao-Hong
Way, Andy
MACHINE TRANSLATION, 2021, 35 (04) : 451 - 474
[25] DEEP: DEnoising Entity Pre-training for Neural Machine Translation
Hu, Junjie
Hayashi, Hiroaki
Cho, Kyunghyun
Neubig, Graham
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1753 - 1766
[26] Multi-Stage Pre-training for Low-Resource Domain Adaptation
Zhang, Rong
Reddy, Revanth Gangi
Sultan, Md Arafat
Castelli, Vittorio
Ferritto, Anthony
Florian, Radu
Kayi, Efsun Sarioglu
Roukos, Salim
Sil, Avirup
Ward, Todd
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5461 - 5468
[27] On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Liu, Xuebo
Wang, Longyue
Wong, Derek F.
Ding, Liang
Chao, Lidia S.
Shi, Shuming
Tu, Zhaopeng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2900 - 2907
[28] Multilingual Pre-training Model-Assisted Contrastive Learning Neural Machine Translation
Sun, Shuo
Hou, Hong-xu
Yang, Zong-heng
Wang, Yi-song
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[29] A Strategy for Referential Problem in Low-Resource Neural Machine Translation
Ji, Yatu
Shi, Lei
Su, Yila
Ren, Qing-dao-er-ji
Wu, Nier
Wang, Hongbin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 321 - 332
[30] Machine Translation in Low-Resource Languages by an Adversarial Neural Network
Sun, Mengtao
Wang, Hao
Pasquine, Mark
Hameed, Ibrahim A.
APPLIED SCIENCES-BASEL, 2021, 11 (22):

← 1 2 3 4 5 →