Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [1] Semantic Connections in the Complex Sentences for Post-Editing Machine Translation in the Kazakh Language
    Turganbayeva, Aliya
    Rakhimova, Diana
    Karyukin, Vladislav
    Karibayeva, Aidana
    Turarbek, Asem
    INFORMATION, 2022, 13 (09)
  • [2] POST-EDITING FOR THE KAZAKH LANGUAGE USING OPENNMT
    Rakhimova, D. R.
    Zhunussova, A. Z.
    JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2022, 113 (01): : 118 - 123
  • [3] Development and Study of a Post-editing Model for Russian-Kazakh and English-Kazakh Translation Based on Machine Learning
    Rakhimova, Diana
    Sagat, Kamila
    Zhakypbaeva, Kamila
    Zhunussova, Aliya
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 525 - 534
  • [4] Training in machine translation post-editing for foreign language students
    Zhang, Hong
    Torres-Hostench, Olga
    LANGUAGE LEARNING & TECHNOLOGY, 2022, 26 (01):
  • [5] Multidimensional strategy for the selection of machine translation candidates for post-editing
    de Gibert, Ona
    Aranberri, Nora
    LINGUAMATICA, 2019, 11 (02): : 3 - 16
  • [6] On the correctness of machine translation: A machine translation post-editing task
    Koponen, Maarit
    Salmi, Leena
    JOURNAL OF SPECIALISED TRANSLATION, 2015, (23): : 117 - 135
  • [7] Second language learners' post-editing strategies for machine translation errors
    Shin, Dongkawang
    Chon, Yuah V.
    LANGUAGE LEARNING & TECHNOLOGY, 2023, 27 (01):
  • [8] The Task of Post-Editing Machine Translation for the Low-Resource Language
    Rakhimova, Diana
    Karibayeva, Aidana
    Turarbek, Assem
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [9] Evaluating the use of machine translation post-editing in the foreign language class
    Nino, Ana
    COMPUTER ASSISTED LANGUAGE LEARNING, 2008, 21 (01) : 29 - 49
  • [10] Is machine translation post-editing worth the effort? A survey of research into post-editing and effort
    Koponen, Maarit
    JOURNAL OF SPECIALISED TRANSLATION, 2016, (25): : 131 - 148