Ebaluatoia: crowd evaluation for English–Basque machine translation

被引:0
|
作者
Nora Aranberri
Gorka Labaka
Arantza Díaz de Ilarraza
Kepa Sarasola
机构
[1] University of the Basque Country (UPV/EHU),IXA Group, Faculty of Computer Science
来源
关键词
Machine translation; Crowd evaluation; Pair-wise comparison; English; Basque;
D O I
暂无
中图分类号
学科分类号
摘要
This work explores the feasibility of a crowd-based pair-wise comparison evaluation to get feedback on machine translation progress for under-resourced languages. Specifically, we propose a task based on simple work units to compare the outputs of five English-to-Basque systems, which we implement in a web application. In our design, we put forward two key aspects that we believe community collaboration initiatives should consider in order to attract and maintain participants, that is, providing both a community challenge and a personal challenge. We describe how these aspects can comply with a strict methodology to ensure research validity. In particular, we consider the evaluation set size and the characteristics of the test sentences, the number of evaluators per comparison pair, and a mechanism to identify dishonest participation (or participants with insufficient linguistic knowledge). We also describe our dissemination effort, which targeted both general users and interest groups. Over 500 people participated actively in the Ebaluatoia campaign and we were able to collect over 35,000 evaluations in a short period of 10 days. From the results, we complete the ranking of the systems under evaluation and establish whether the difference in quality between the systems is significant.
引用
收藏
页码:1053 / 1084
页数:31
相关论文
共 50 条
  • [41] QUANTITATIVE-EVALUATION OF ENGLISH-JAPANESE MACHINE TRANSLATION OF MEDICAL LITERATURE
    KIUCHI, T
    KAIHARA, S
    METHODS OF INFORMATION IN MEDICINE, 1991, 30 (03) : 199 - 205
  • [42] Machine translation model for effective translation of Hindi poetries into English
    Chakrawarti, Rajesh Kumar
    Bansal, Jayshri
    Bansal, Pratosh
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (01) : 95 - 109
  • [43] AccurIT: a prototype of a machine translation engine for English to Arabic translation
    Alkazemi, Basem
    Nour, Mohammed
    Naseer, Atif
    Natto, Ammar
    Grami, Grami
    INTERNATIONAL JOURNAL OF INNOVATION AND LEARNING, 2019, 26 (02) : 115 - 130
  • [44] Machine translation model for effective translation of Hindi poetries into English
    Chakrawarti, Rajesh Kumar
    Bansal, Jayshri
    Bansal, Pratosh
    Journal of Experimental and Theoretical Artificial Intelligence, 2022, 34 (01): : 95 - 109
  • [45] Translation Divergence Patterns Handling in English to Urdu Machine Translation
    Khan, Shahnawaz
    Mir, Usama
    Shreem, Salam S.
    Al Amari, Sultan
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (05)
  • [46] The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English
    Guzman, Francisco
    Chen, Peng-Jen
    Ott, Myle
    Pino, Juan
    Lample, Guillaume
    Koehn, Philipp
    Chaudhary, Vishrav
    Ranzato, Marc'Aurelio
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6098 - 6111
  • [47] Statistical analysis of machine translation evaluation systems for English-Hindi language pair
    Malik P.
    Mrudula Y.
    Baghel A.S.
    Recent Advances in Computer Science and Communications, 2020, 13 (05) : 864 - 870
  • [48] Phrase Tagset Mapping for French and English Treebanks and Its Application in Machine Translation Evaluation
    Han, Aaron Li-Feng
    Wong, Derek F.
    Chao, Lidia S.
    He, Liangye
    Li, Shuo
    Zhu, Ling
    LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 119 - 131
  • [49] Translating Akkadian to English with neural machine translation
    Gutherz, Gai
    Gordin, Shai
    Saenz, Luis
    Levy, Omer
    Berant, Jonathan
    PNAS NEXUS, 2023, 2 (05):
  • [50] The Translation Crowd
    Cronin, Michael
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2010, (08):