Analysis of Edit Operations for Post-editing Systems

被引:3
|
作者
Kapusta, Jozef [1 ]
Benko, L'ubomir [1 ]
Munkova, Dasa [1 ]
Munk, Michal [1 ,2 ]
机构
[1] Constantine Philosopher Univ Nitra, Dept Comp Sci, Tr A Hlinku 1, Nitra 94974, Slovakia
[2] Univ Pardubice, Sci & Res Ctr, Studentska 84, Pardubice 53210, Czech Republic
关键词
Machine translation; Natural language processing; Recommender tool; Post-editing; POS tagging; Sequence rules; METRICS;
D O I
10.1007/s44196-021-00048-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-editing has become an important part not only of translation research but also in the global translation industry. While computer-aided translation tools, such as translation memories, are considered to be part of a translator's work, lately, machine translation (MT) systems have also been accepted by human translators. However, many human translators are still adopting the changes brought by translation technologies to the translation industry. This paper introduces a novel approach for seeking suitable pairs of n-grams when recommending n-grams (corresponding n-grams between MT and post-edited MT) based on the type of text (manual or administrative) and MT system used for machine translation. A tool that recommends and speeds up the correction of MT was developed to help the post-editors with their work. It is based on the analysis of words with the same lemmas and analysis of n-gram recommendations. These recommendations are extracted from sequence patterns of the mismatched words (MisMatch) between MT output and post-edited MT output. The paper aims to show the usage of morphological analysis for recommending the post-edit operations. It describes the usage of mismatched words in the n-gram recommendations for the post-edited MT output. The contribution consists of the methodology for seeking suitable pairs of words, n-grams and additionally the importance of taking into account metadata (the type of the text and/or style and MT system) when recommending post-edited operations.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Post-Editing Machine Translation As an FSL Exercise
    Kliffer, Michael D.
    PORTA LINGUARUM, 2008, (09) : 53 - 67
  • [22] Post-Editing Extractive Summaries by Definiteness Prediction
    Kabbara, Jad
    Cheung, Jackie Chi Kit
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3682 - 3692
  • [23] Post-editing through approximation and global correction
    Taghva, K
    Borsack, J
    Bullard, B
    Condit, A
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1995, 9 (06) : 911 - 923
  • [24] An Exploration of Post-Editing Effectiveness in Text Summarization
    Lai, Vivian
    Smith-Renner, Alison
    Zhang, Ke
    Cheng, Ruijia
    Zhang, Wenjuan
    Tetreault, Joel
    Jaimes, Alejandro
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 502 - +
  • [25] POST-EDITING FOR THE KAZAKH LANGUAGE USING OPENNMT
    Rakhimova, D. R.
    Zhunussova, A. Z.
    JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2022, 113 (01): : 118 - 123
  • [26] Post-editing of machine translation: processes and applications
    Garcia, Ignacio
    TRANSLATOR, 2015, 21 (01): : 110 - 114
  • [27] Post-editing of Machine Translation: Processes and Applications
    Folaron, Debbie
    MACHINE TRANSLATION, 2015, 29 (01) : 69 - 76
  • [28] Post-editing for Professional Translators: Cheer or Fear?
    Vidal, Sergi Alvarez
    Oliver, Antoni
    Badia, Toni
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2020, (18): : 49 - 69
  • [29] Evaluating the Impact of Light Post-Editing on Usability
    Castilho, Sheila
    O'Brien, Sharon
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 310 - 316
  • [30] Post-editing through approximation and global correction
    Univ. Nevada, information sci. res. inst., Las Vegas NV
    Int J Pattern Recognit Artif Intell, 6 (911-923):