Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing

被引:21
|
作者
Vardaro, Jennifer [1 ]
Schaeffer, Moritz [1 ]
Hansen-Schirra, Silvia [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, English Linguist & Translat Studies, D-76726 Mainz, Germany
来源
INFORMATICS-BASEL | 2019年 / 6卷 / 03期
关键词
neural machine translation; post-editing; revision; error annotations; Hjerson; MQM; European Commission (DGT); eye-tracking; key-logging; post-editing effort; ENGLISH;
D O I
10.3390/informatics6030041
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to analyse how translation experts from the German department of the European Commission's Directorate-General for Translation (DGT) identify and correct different error categories in neural machine translated texts (NMT) and their post-edited versions (NMTPE). The term translation expert encompasses translator, post-editor as well as revisor. Even though we focus on neural machine-translated segments, translator and post-editor are used synonymously because of the combined workflow using CAT-Tools as well as machine translation. Only the distinction between post-editor, which refers to a DGT translation expert correcting the neural machine translation output, and revisor, which refers to a DGT translation expert correcting the post-edited version of the neural machine translation output, is important and made clear whenever relevant. Using an automatic error annotation tool and the more fine-grained manual error annotation framework to identify characteristic error categories in the DGT texts, a corpus analysis revealed that quality assurance measures by post-editors and revisors of the DGT are most often necessary for lexical errors. More specifically, the corpus analysis showed that, if post-editors correct mistranslations, terminology or stylistic errors in an NMT sentence, revisors are likely to correct the same error type in the same post-edited sentence, suggesting that the DGT experts were being primed by the NMT output. Subsequently, we designed a controlled eye-tracking and key-logging experiment to compare participants' eye movements for test sentences containing the three identified error categories (mistranslations, terminology or stylistic errors) and for control sentences without errors. We examined the three error types' effect on early (first fixation durations, first pass durations) and late eye movement measures (e.g., total reading time and regression path durations). Linear mixed-effects regression models predict what kind of behaviour of the DGT experts is associated with the correction of different error types during the post-editing process.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Comparing the Quality of Neural Machine Translation and Professional Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [2] Neural Machine Translation Quality and Post-Editing Performance
    Zouhar, Vilem
    Tamchyna, Ales
    Popel, Martin
    Bojar, Ondrej
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10204 - 10214
  • [3] Post-editing neural machine translation versus translation memory segments
    Sanchez-Gijon, Pilar
    Moorkens, Joss
    Way, Andy
    MACHINE TRANSLATION, 2019, 33 (1-2) : 31 - 59
  • [4] The raw machine translation to the professional advanced post-editing: the case of financial translation
    Peraldi, Sandrine
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2016, 21 (01): : 67 - 90
  • [5] System for Post-Editing and Automatic Error Classification of Machine Translation
    Munkova, Dasa
    Kapusta, Jozef
    Drlik, Martin
    DIVAI 2016: 11TH INTERNATIONAL SCIENTIFIC CONFERENCE ON DISTANCE LEARNING IN APPLIED INFORMATICS, 2016, : 571 - 579
  • [6] On the correctness of machine translation: A machine translation post-editing task
    Koponen, Maarit
    Salmi, Leena
    JOURNAL OF SPECIALISED TRANSLATION, 2015, (23): : 117 - 135
  • [7] IntelliCAT: Intelligent Machine Translation Post-Editing with Quality Estimation and Translation Suggestion
    Lee, Dongjun
    Ahn, Junhyeong
    Park, Heesoo
    Jo, Jaemin
    ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 11 - 19
  • [8] Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing
    Jia, Yanfang
    Sun, Sanjun
    PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (05): : 950 - 968
  • [9] The Role of Machine Translation Quality Estimation in the Post-Editing Workflow
    Bechara, Hannah
    Orasan, Constantin
    Escartin, Carla Parra
    Zampieri, Marcos
    Lowe, William
    INFORMATICS-BASEL, 2021, 8 (03):
  • [10] Post-Editing Machine Translation As an FSL Exercise
    Kliffer, Michael D.
    PORTA LINGUARUM, 2008, (09) : 53 - 67