Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [21] Indices of cognitive effort in machine translation post-editing
    Vieira, Lucas Nunes
    MACHINE TRANSLATION, 2014, 28 (3-4) : 187 - 216
  • [22] Mind the gap The nature of machine translation post-editing
    Rico, Celia
    BABEL-REVUE INTERNATIONALE DE LA TRADUCTION-INTERNATIONAL JOURNAL OF TRANSLATION, 2022, 68 (05): : 697 - 722
  • [23] Neural Machine Translation Quality and Post-Editing Performance
    Zouhar, Vilem
    Tamchyna, Ales
    Popel, Martin
    Bojar, Ondrej
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10204 - 10214
  • [24] Post-editing neural machine translation versus translation memory segments
    Sanchez-Gijon, Pilar
    Moorkens, Joss
    Way, Andy
    MACHINE TRANSLATION, 2019, 33 (1-2) : 31 - 59
  • [25] The Trials and Tribulations of Predicting Machine Translation Post-Editing Productivity
    Marg, Lena
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 23 - 26
  • [26] Social groups in machine translation post-editing A SCOT analysis
    Sakamoto, Akiko
    Yamada, Masaru
    TRANSLATION SPACES, 2020, 9 (01) : 78 - 97
  • [27] Multi-Modal Approaches for Post-Editing Machine Translation
    Herbig, Nico
    Pal, Santanu
    van Genabith, Josef
    Krueger, Antonio
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [28] The impact of post-editing and machine translation on creativity and reading experience
    Guerberof-Arenas, Ana
    Toral, Antonio
    TRANSLATION SPACES, 2020, 9 (02) : 255 - 282
  • [29] Comparing the Quality of Neural Machine Translation and Professional Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [30] System for Post-Editing and Automatic Error Classification of Machine Translation
    Munkova, Dasa
    Kapusta, Jozef
    Drlik, Martin
    DIVAI 2016: 11TH INTERNATIONAL SCIENTIFIC CONFERENCE ON DISTANCE LEARNING IN APPLIED INFORMATICS, 2016, : 571 - 579