An Empirical Study of Generation Order for Machine Translation

被引:0
|
作者
Chan, William [1 ]
Stern, Mitchell [2 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present an empirical study of generation order for machine translation. Building on recent advances in insertion-based modeling, we first introduce a soft orderreward framework that enables us to train models to follow arbitrary oracle generation policies. We then make use of this framework to explore a large variety of generation orders, including uninformed orders, locationbased orders, frequency-based orders, contentbased orders, and model-based orders. Curiously, we find that for the WMT'14 English ! German and WMT'18 English ! Chinese translation tasks, order does not have a substantial impact on output quality. Moreover, for English ! German, we even discover that unintuitive orderings such as alphabetical and shortest-first can match the performance of a standard Transformer, suggesting that traditional left-to-right generation may not be necessary to achieve high performance.
引用
收藏
页码:5764 / 5773
页数:10
相关论文
共 50 条
  • [31] The link between translation difficulty and the quality of machine translation: a literature review and empirical investigation
    Araghi, Sahar
    Palangkaraya, Alfons
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (04) : 1093 - 1114
  • [32] Putting Human Assessments of Machine Translation Systems in Order
    Lopez, Adam, 2012, Association for Computational Linguistics (ACL)
  • [33] Measuring translation difficulty An empirical study
    Sun, Sanjun
    Shreve, Gregory M.
    TARGET-INTERNATIONAL JOURNAL OF TRANSLATION STUDIES, 2014, 26 (01) : 98 - 127
  • [34] An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation
    Chu, Chenhui
    Dabre, Raj
    Kurohashi, Sadao
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 385 - 391
  • [35] A Study on Output Sentence Generation Method for Question Answering Using Statistical Machine Translation
    Yamada, Tessei
    Arakawa, Tatsuya
    2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013), 2013, : 1199 - 1202
  • [36] An Empirical Analysis of Data Selection Techniques in Statistical Machine Translation
    Chinea-Rios, Mara
    Sanchis-Triches, German
    Casacuberta, Francisco
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (55): : 101 - 108
  • [37] Second-order Optimization for Non-convex Machine Learning: an Empirical Study
    Xu, Peng
    Roosta, Fred
    Mahoney, Michael W.
    PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 199 - 207
  • [38] Generation of Translation Tables Adequate for Example-Based Machine Translation by Analogy
    Kimura, Tatsuya
    Matsuoka, Jin
    Nishikawa, Yusuke
    Lepage, Yves
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 200 - 203
  • [39] Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation
    Alinejad, Ashkan
    Shavarani, Hassan S.
    Sarkar, Anoop
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1734 - 1744
  • [40] Contextual Parameter Generation for Universal Neural Machine Translation
    Platanios, Emmanouil Antonios
    Sachan, Mrinmaya
    Neubig, Graham
    Mitchell, Tom M.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 425 - 435