Insertion Transformer: Flexible Sequence Generation via Insertion Operations

被引:0
|
作者
Stern, Mitchell [1 ,2 ]
Chan, William [1 ]
Kiros, Jamie [1 ]
Uszkoreit, Jakob [1 ]
机构
[1] Google Brain, Berlin, Germany
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the Insertion Transformer, an iterative, partially autoregressive model for sequence generation based on insertion operations. Unlike typical autoregressive models which rely on a fixed, often left-to-right ordering of the output, our approach accommodates arbitrary orderings by allowing for tokens to be inserted anywhere in the sequence during decoding. This flexibility confers a number of advantages: for instance, not only can our model be trained to follow specific orderings such as left-to-right generation or a binary tree traversal, but it can also be trained to maximize entropy over all valid insertions for robustness. In addition, our model seamlessly accommodates both fully autoregressive generation (one insertion at a time) and partially autoregressive generation (simultaneous insertions at multiple locations). We validate our approach by analyzing its performance on the WMT 2014 English-German machine translation task under various settings for training and decoding. We find that the Insertion Transformer outperforms many prior non-autoregressive approaches to translation at comparable or better levels of parallelism, and successfully recovers the performance of the original Transformer while requiring only logarithmically many iterations during decoding.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Transposition of a bacterial insertion sequence in chloroplasts
    Kohl, Stefan
    Bock, Ralph
    PLANT JOURNAL, 2009, 58 (03): : 423 - 436
  • [22] Insertion sequence IS900 revisited
    Semret, M
    Turenne, CY
    Behr, MA
    JOURNAL OF CLINICAL MICROBIOLOGY, 2006, 44 (03) : 1081 - 1083
  • [23] Estimating the fitness effect of an insertion sequence
    Manuel Bichsel
    A. D. Barbour
    Andreas Wagner
    Journal of Mathematical Biology, 2013, 66 : 95 - 114
  • [24] Insertion sequence elements in Lactococcus garvieae
    Eraclio, Giovanni
    Ricci, Giovanni
    Fortina, Maria Grazia
    GENE, 2015, 555 (02) : 291 - 296
  • [25] Estimating the fitness effect of an insertion sequence
    Bichsel, Manuel
    Barbour, A. D.
    Wagner, Andreas
    JOURNAL OF MATHEMATICAL BIOLOGY, 2013, 66 (1-2) : 95 - 114
  • [26] Insertion sequence elements and transposons in Bacillus
    Mahillon, J
    APPLICATIONS AND SYSTEMATICS OF BACILLUS AND RELATIVES, 2002, : 236 - 253
  • [27] GAMMA-DELTA SEQUENCE OF F IS AN INSERTION SEQUENCE
    GUYER, MS
    JOURNAL OF MOLECULAR BIOLOGY, 1978, 126 (03) : 347 - 365
  • [28] A new model to calculate insertion loss of DSL transformer
    Su, Hua
    Zhang, Huaiwu
    Tang, Xiaoli
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 147 - +
  • [29] Insertion of Distributed Generation into Rural Feeders
    Mohr, R. A.
    Moreno, R.
    Rudnick, H.
    2009 CIGRE/IEEE PES JOINT SYMPOSIUM INTEGRATION OF WIDE-SCALE RENEWABLE RESOURCES INTO THE POWER DELIVERY SYSTEM, 2009, : 260 - 269
  • [30] Study of deformation and insertion tasks of a flexible wire
    Nakagaki, H
    Kitagaki, K
    Ogasawara, T
    Tsukune, H
    1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 2397 - 2402