Insertion Transformer: Flexible Sequence Generation via Insertion Operations

被引:0
|
作者
Stern, Mitchell [1 ,2 ]
Chan, William [1 ]
Kiros, Jamie [1 ]
Uszkoreit, Jakob [1 ]
机构
[1] Google Brain, Berlin, Germany
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the Insertion Transformer, an iterative, partially autoregressive model for sequence generation based on insertion operations. Unlike typical autoregressive models which rely on a fixed, often left-to-right ordering of the output, our approach accommodates arbitrary orderings by allowing for tokens to be inserted anywhere in the sequence during decoding. This flexibility confers a number of advantages: for instance, not only can our model be trained to follow specific orderings such as left-to-right generation or a binary tree traversal, but it can also be trained to maximize entropy over all valid insertions for robustness. In addition, our model seamlessly accommodates both fully autoregressive generation (one insertion at a time) and partially autoregressive generation (simultaneous insertions at multiple locations). We validate our approach by analyzing its performance on the WMT 2014 English-German machine translation task under various settings for training and decoding. We find that the Insertion Transformer outperforms many prior non-autoregressive approaches to translation at comparable or better levels of parallelism, and successfully recovers the performance of the original Transformer while requiring only logarithmically many iterations during decoding.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Flexible microelectrode arrays with integrated insertion devices
    O'Brien, DP
    Nichols, TR
    Allen, MG
    14TH IEEE INTERNATIONAL CONFERENCE ON MICRO ELECTRO MECHANICAL SYSTEMS, TECHNICAL DIGEST, 2001, : 216 - 219
  • [32] Analysis of flexible insertion assembly of polygonal pegs
    Cheng, CC
    Chen, GS
    JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 2003, 46 (03) : 1130 - 1141
  • [33] Modeling of Flexible Needle for Haptic Insertion Simulation
    He, Xuejian
    Chen, Yonghua
    Tang, Libo
    2008 IEEE INTERNATIONAL CONFERENCE ON VIRTUAL ENVIRONMENTS, HUMAN-COMPUTER INTERFACES AND MEASUREMENT SYSTEMS, 2008, : 184 - 189
  • [34] MODELING THE FLEXIBLE NEEDLE INSERTION INTO THE HUMAN LIVER
    Chiroiu, Veturia
    Nedelcu, Nicoleta
    Munteanu, Ligia
    Rugina, Cristian
    Ionescu, Marius
    Dragne, Ciprian
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2021, 22 (02): : 163 - 171
  • [35] Magnetic insertion system for flexible electrode implantation
    Jaroch, David B.
    Ward, Matthew P.
    Chow, Eric Y.
    Rickus, Jenna L.
    Irazoqui, Pedro P.
    JOURNAL OF NEUROSCIENCE METHODS, 2009, 183 (02) : 213 - 222
  • [36] Oral insertion of a flexible bronchoscope is associated with less discomfort than nasal insertion for Korean patients
    Choi, CM
    Yoon, HI
    Lee, SM
    Yoo, CG
    Kim, YW
    Han, SK
    Shim, YS
    Yim, JJ
    INTERNATIONAL JOURNAL OF TUBERCULOSIS AND LUNG DISEASE, 2005, 9 (03) : 344 - 348
  • [37] Differentiation of carbazole catabolic operons by replacement of the regulated promoter via transposition of an insertion sequence
    Miyakoshi, M
    Urata, M
    Habe, H
    Omori, T
    Yamane, H
    Nojiri, H
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2006, 281 (13) : 8450 - 8457
  • [38] Escherichia coli insertion sequence IS150:: Transposition via circular and linear intermediates
    Haas, M
    Rak, B
    JOURNAL OF BACTERIOLOGY, 2002, 184 (21) : 5833 - 5841
  • [39] Targeted, efficient sequence insertion and replacement in rice
    Lu, Yuming
    Tian, Yifu
    Shen, Rundong
    Yao, Qi
    Wang, Mugui
    Chen, Mei
    Dong, Jinsong
    Zhang, Tongen
    Li, Feng
    Lei, Mingguang
    Zhu, Jian-Kang
    NATURE BIOTECHNOLOGY, 2020, 38 (12) : 1402 - 1407
  • [40] Targeted, efficient sequence insertion and replacement in rice
    Yuming Lu
    Yifu Tian
    Rundong Shen
    Qi Yao
    Mugui Wang
    Mei Chen
    Jinsong Dong
    Tongen Zhang
    Feng Li
    Mingguang Lei
    Jian-Kang Zhu
    Nature Biotechnology, 2020, 38 : 1402 - 1407