Domain Adaptation for Arabic Machine Translation: Financial Texts as a Case Study

被引:0
|
作者
Alghamdi, Emad A. [1 ,2 ]
Zakraoui, Jezia [2 ]
Abanmy, Fares A. [2 ]
机构
[1] King Abdulaziz Univ, Ctr Excellence AI & Data Sci, Jeddah 21589, Saudi Arabia
[2] ASAS AI Lab, Riyadh 13518, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
关键词
machine translation; Arabic MT; domain adaptation; financial domain;
D O I
10.3390/app14167088
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Neural machine translation (NMT) has shown impressive performance when trained on large-scale corpora. However, generic NMT systems have demonstrated poor performance on out-of-domain translation. To mitigate this issue, several domain adaptation methods have recently been proposed which often lead to better translation quality than genetic NMT systems. While there has been some continuous progress in NMT for English and other European languages, domain adaption in Arabic has received little attention in the literature. The current study, therefore, aims to explore the effectiveness of domain-specific adaptation for Arabic MT (AMT), in yet unexplored domain, financial news articles. To this end, we developed a parallel corpus for Arabic-English (AR-EN) translation in the financial domain to benchmark different domain adaptation methods. We then fine-tuned several pre-trained NMT and Large Language models including ChatGPT-3.5 Turbo on our dataset. The results showed that fine-tuning pre-trained NMT models on a few well-aligned in-domain AR-EN segments led to noticeable improvement. The quality of ChatGPT translation was superior to other models based on automatic and human evaluations. To the best of our knowledge, this is the first work on fine-tuning ChatGPT towards financial domain transfer learning. To contribute to research in domain translation, we made our datasets and fine-tuned models available.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Efficient Domain Adaptation for Non-Autoregressive Machine Translation
    You, Wangjie
    Guo, Pei
    Li, Juntao
    Chen, Kehai
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13657 - 13670
  • [32] Arabic machine translation: a survey
    Alqudsi, Arwa
    Omar, Nazlia
    Shaker, Khalid
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (04) : 549 - 572
  • [33] Segmentation for Domain Adaptation in Arabic
    Attia, Mohammed
    Elkahky, Ali
    FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 119 - 129
  • [34] LINGUISTIC LOSSES IN THE TRANSLATION OF ARABIC LITERARY TEXTS
    Al-Masri, Hanada
    PERSPECTIVES ON ARABIC LINGUISTICS XXI: PAPERS FROM THE TWENTY-FIRST ANNUAL SYMPOSIUM ON ARABIC LINGUISTICS, 2008, 301 : 173 - 204
  • [35] Arabic Terminology in the Translation of Multimedia Environmental Texts
    Faber, Pamela
    Kerras, Nassima
    ARAB WORLD ENGLISH JOURNAL, 2015, : 88 - 112
  • [36] Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
    Dou, Zi-Yi
    Hu, Junjie
    Anastasopoulos, Antonios
    Neubig, Graham
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1417 - 1422
  • [37] THE ADAPTATION TECHNIQUE IN THE TRANSLATION OF LITERARY TEXTS
    Tasenko, P. S.
    Nelyubova, N. Yu.
    Ershov, V. I.
    VESTNIK ROSSIISKOGO UNIVERSITETA DRUZHBY NARODOV-SERIYA LINGVISTIKA-RUSSIAN JOURNAL OF LINGUISTICS, 2016, 20 (02): : 128 - 141
  • [38] Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhan, Runzhe
    Liu, Xuebo
    Wong, Derek F.
    Chao, Lidia S.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14310 - 14318
  • [39] Neural Network Machine Translation Method Based on Unsupervised Domain Adaptation
    Wang, Rui
    COMPLEXITY, 2020, 2020 (2020)
  • [40] Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
    Gu, Shuhao
    Feng, Yang
    Xie, Wanying
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3942 - 3952