A two-stage fine-tuning method for low-resource cross-lingual summarization

被引：3

作者：

Zhang, Kaixiong ^{[1
,2
]}

Zhang, Yongbing ^{[1
,2
]}

Yu, Zhengtao ^{[1
,2
]}

Huang, Yuxin ^{[1
,2
]}

Tan, Kaiwen ^{[1
,2
]}

机构：

[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China

[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Peoples R China

来源：

MATHEMATICAL BIOSCIENCES AND ENGINEERING | 2024年 / 21卷 / 01期

基金：

中国国家自然科学基金;

关键词：

cross-lingual; low-resource; summarization; fine-tuning;

D O I：

10.3934/mbe.2024047

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Cross-lingual summarization (CLS) is the task of condensing lengthy source language text into a concise summary in a target language. This presents a dual challenge, demanding both cross-language semantic understanding (i.e., semantic alignment) and effective information compression capabilities. Traditionally, researchers have tackled these challenges using two types of methods: pipeline methods (e.g., translate-then-summarize) and end-to-end methods. The former is intuitive but prone to error propagation, particularly for low-resource languages. The later has shown an impressive performance, due to multilingual pre-trained models (mPTMs). However, mPTMs (e.g., mBART) are primarily trained on resource-rich languages, thereby limiting their semantic alignment capabilities for low-resource languages. To address these issues, this paper integrates the intuitiveness of pipeline methods and the effectiveness of mPTMs, and then proposes a two-stage fine-tuning method for low-resource cross-lingual summarization (TFLCLS). In the first stage, by recognizing the deficiency in the semantic alignment for low-resource languages in mPTMs, a semantic alignment fine-tuning method is employed to enhance the mPTMs' understanding of such languages. In the second stage, while considering that mPTMs are not originally tailored for information compression and CLS demands the model to simultaneously align and compress, an adaptive joint fine-tuning method is introduced. This method further enhances the semantic alignment and information compression abilities of mPTMs that were trained in the first stage. To evaluate the performance of TFLCLS, a low-resource CLS dataset, named Vi2ZhLow, is constructed from scratch; moreover, two additional low-resource CLS datasets, En2ZhLow and Zh2EnLow, are synthesized from widely used large-scale CLS datasets. Experimental results show that TFCLS outperforms state-of-the-art methods by 18.88%, 12.71% and 16.91% in ROUGE-2 on the three datasets, respectively, even when limited with only 5,000 training samples.

引用

页码：1125 / 1143

页数：19

共 50 条

[41] Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning
Agrawal, Ashish Sunil
Fazili, Barah
Jyothi, Preethi
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 319 - 329
[42] Knowledge Collaborative Fine-tuning for Low-resource Knowledge Graph Completion
Zhang N.-Y.
Xie X.
Chen X.
Deng S.-M.
Ye H.-B.
Chen H.-J.
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3531 - 3545
[43] Cross-lingual subspace Gaussian mixture models for low-resource speech recognition
1600, Institute of Electrical and Electronics Engineers Inc., United States (22):
[44] Cross-lingual offensive speech identification with transfer learning for low-resource languages
Shi, Xiayang
Liu, Xinyi
Xu, Chun
Huang, Yuanyuan
Chen, Fang
Zhu, Shaolin
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
[45] AgglutiFiT: Efficient Low-Resource Agglutinative Language Model Fine-Tuning
Li, Zhe
Li, Xiuhong
Sheng, Jiabao
Slamu, Wushour
IEEE ACCESS, 2020, 8 : 148489 - 148499
[46] Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages
Schlichtkrull, Michael Sejr
Sogaard, Anders
15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 220 - 229
[47] Cross-lingual subspace Gaussian mixture models for low-resource speech recognition
1600, Institute of Electrical and Electronics Engineers Inc., United States (22):
[48] CAM: A cross-lingual adaptation framework for low-resource language speech recognition
Hu, Qing
Zhang, Yan
Zhang, Xianlei
Han, Zongyu
Yu, Xilong
INFORMATION FUSION, 2024, 111
[49] Is Translation Helpful? An Exploration of Cross-Lingual Transfer in Low-Resource Dialog Generation
Shen, Lei
Yu, Shuai
Shen, Xiaoyu
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
[50] SUBSPACE MIXTURE MODEL FOR LOW-RESOURCE SPEECH RECOGNITION IN CROSS-LINGUAL SETTINGS
Miao, Yajie
Metze, Florian
Waibel, Alex
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7339 - 7343

← 1 2 3 4 5 →