Ebaluatoia: crowd evaluation for English–Basque machine translation

被引:0
|
作者
Nora Aranberri
Gorka Labaka
Arantza Díaz de Ilarraza
Kepa Sarasola
机构
[1] University of the Basque Country (UPV/EHU),IXA Group, Faculty of Computer Science
来源
关键词
Machine translation; Crowd evaluation; Pair-wise comparison; English; Basque;
D O I
暂无
中图分类号
学科分类号
摘要
This work explores the feasibility of a crowd-based pair-wise comparison evaluation to get feedback on machine translation progress for under-resourced languages. Specifically, we propose a task based on simple work units to compare the outputs of five English-to-Basque systems, which we implement in a web application. In our design, we put forward two key aspects that we believe community collaboration initiatives should consider in order to attract and maintain participants, that is, providing both a community challenge and a personal challenge. We describe how these aspects can comply with a strict methodology to ensure research validity. In particular, we consider the evaluation set size and the characteristics of the test sentences, the number of evaluators per comparison pair, and a mechanism to identify dishonest participation (or participants with insufficient linguistic knowledge). We also describe our dissemination effort, which targeted both general users and interest groups. Over 500 people participated actively in the Ebaluatoia campaign and we were able to collect over 35,000 evaluations in a short period of 10 days. From the results, we complete the ranking of the systems under evaluation and establish whether the difference in quality between the systems is significant.
引用
收藏
页码:1053 / 1084
页数:31
相关论文
共 50 条
  • [21] Neural Machine Translation for Amharic-English Translation
    Gezmu, Andargachew Mekonne
    Nuernberger, Andreas
    Bati, Tesfaye Bayu
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, : 526 - 532
  • [22] Improvements in Machine Translation for English/Iraqi Speech Translation
    Saleem, S.
    Subramanian, K.
    Prasad, R.
    Stallard, D.
    Kao, C.
    Natarajan, P.
    Suleiman, R.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2165 - 2168
  • [23] A Crowd-Source Based Corpus on Bangla to English Translation
    Nowshin, Nafisa
    Ritu, Zakia Sultana
    Ismail, Sabir
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [24] A Constructive Machine Translation System for English to Odia Translation
    Das, Aditya Kaustav
    Pradhan, Manabhanjan
    Dash, Amiya Kumar
    Pradhan, Chittaranjan
    Das, Himansu
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 854 - 857
  • [25] Spoken English Grading: Machine Learning with Crowd Intelligence
    Shashidhar, Vinay
    Pandey, Nishant
    Aggarwal, Varun
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 2089 - 2097
  • [26] English/Arabic/English machine translation: A historical perspective
    Zughoul, MR
    Abu-Alshaar, AM
    META, 2005, 50 (03) : 1022 - 1041
  • [27] English generation in Japanese-to-English machine translation
    NTT Communication Science Lab, Japan
    NTT R&D, 12 (1419-1424):
  • [28] Evaluating Arabic to English Machine Translation
    Hadla, Laith S.
    Hailat, Taghreed M.
    Al-Kabi, Mohammed N.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (11) : 68 - 73
  • [29] Neural machine translation of Hindi and English
    Bhatnagar, Sahil
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2071 - 2079
  • [30] Neural machine translation for Tamil to English
    Jain, Minni
    Punia, Ravneet
    Hooda, Ishika
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2020, 23 (07) : 1251 - 1264