Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing

被引:21
|
作者
Vardaro, Jennifer [1 ]
Schaeffer, Moritz [1 ]
Hansen-Schirra, Silvia [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, English Linguist & Translat Studies, D-76726 Mainz, Germany
来源
INFORMATICS-BASEL | 2019年 / 6卷 / 03期
关键词
neural machine translation; post-editing; revision; error annotations; Hjerson; MQM; European Commission (DGT); eye-tracking; key-logging; post-editing effort; ENGLISH;
D O I
10.3390/informatics6030041
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to analyse how translation experts from the German department of the European Commission's Directorate-General for Translation (DGT) identify and correct different error categories in neural machine translated texts (NMT) and their post-edited versions (NMTPE). The term translation expert encompasses translator, post-editor as well as revisor. Even though we focus on neural machine-translated segments, translator and post-editor are used synonymously because of the combined workflow using CAT-Tools as well as machine translation. Only the distinction between post-editor, which refers to a DGT translation expert correcting the neural machine translation output, and revisor, which refers to a DGT translation expert correcting the post-edited version of the neural machine translation output, is important and made clear whenever relevant. Using an automatic error annotation tool and the more fine-grained manual error annotation framework to identify characteristic error categories in the DGT texts, a corpus analysis revealed that quality assurance measures by post-editors and revisors of the DGT are most often necessary for lexical errors. More specifically, the corpus analysis showed that, if post-editors correct mistranslations, terminology or stylistic errors in an NMT sentence, revisors are likely to correct the same error type in the same post-edited sentence, suggesting that the DGT experts were being primed by the NMT output. Subsequently, we designed a controlled eye-tracking and key-logging experiment to compare participants' eye movements for test sentences containing the three identified error categories (mistranslations, terminology or stylistic errors) and for control sentences without errors. We examined the three error types' effect on early (first fixation durations, first pass durations) and late eye movement measures (e.g., total reading time and regression path durations). Linear mixed-effects regression models predict what kind of behaviour of the DGT experts is associated with the correction of different error types during the post-editing process.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] PET: a Tool for Post-editing and Assessing Machine Translation
    Aziz, Wilker
    de Sousa, Sheila C. M.
    Specia, Lucia
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3982 - 3987
  • [22] Machine translation post-editing through emotional narratives
    Ramos, Maria del Mar Sanchez
    TRANSLATION AND TRANSLANGUAGING IN MULTILINGUAL CONTEXTS, 2025, 11 (01) : 31 - 47
  • [23] The effectiveness of online queries in machine translation post-editing
    Zhang, Hong
    Torres-Hostench, Olga
    CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2023, (93): : 289 - 303
  • [24] Ranking Machine Translation Systems via Post-editing
    Aziz, Wilker
    Mitkov, Ruslan
    Specia, Lucia
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 410 - 418
  • [25] Indices of cognitive effort in machine translation post-editing
    Vieira, Lucas Nunes
    MACHINE TRANSLATION, 2014, 28 (3-4) : 187 - 216
  • [26] Mind the gap The nature of machine translation post-editing
    Rico, Celia
    BABEL-REVUE INTERNATIONALE DE LA TRADUCTION-INTERNATIONAL JOURNAL OF TRANSLATION, 2022, 68 (05): : 697 - 722
  • [27] Translators' perceptions of literary post-editing using statistical and neural machine translation
    Moorkens, Joss
    Toral, Antonio
    Castilho, Sheila
    Way, Andy
    TRANSLATION SPACES, 2018, 7 (02) : 240 - 262
  • [28] Predicting Post-Editing Effort for English-Chinese Neural Machine Translation
    Ye, Na
    Jiang, Ling
    Ma, Dandan
    Zhang, Yingxin
    Zhao, Sanyuan
    Cai, Dongfeng
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 154 - 158
  • [29] Automatic Post-Editing of Machine Translation: A Neural Programmer-Interpreter Approach
    Vu, Thuy-Trang
    Haffari, Gholamreza
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3048 - 3053
  • [30] Translation Tasks for Learning Collocations: Effects of Machine Translation plus Post-editing and Sight Translation
    Chang, Chia-chien
    Yamada, Masaru
    ENGLISH TEACHING AND LEARNING, 2021, 45 (01): : 27 - 44