Simultaneous neural machine translation with a reinforced attention mechanism

被引：6

作者：

Lee, YoHan ^{[1
]}

Shin, JongHun ^{[1
]}

Kim, YoungKil ^{[1
]}

机构：

[1] Elect & Telecommun Res Inst, Language Intelligence Res Sect, Daejeon, South Korea

来源：

ETRI JOURNAL | 2021年 / 43卷 / 05期

关键词：

attention mechanism; neural network; reinforcement learning; simultaneous machine translation;

D O I：

10.4218/etrij.2020-0358

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

To translate in real time, a simultaneous translation system should determine when to stop reading source tokens and generate target tokens corresponding to a partial source sentence read up to that point. However, conventional attention-based neural machine translation (NMT) models cannot produce translations with adequate latency in online scenarios because they wait until a source sentence is completed to compute alignment between the source and target tokens. To address this issue, we propose a reinforced learning (RL)-based attention mechanism, the reinforced attention mechanism, which allows a neural translation model to jointly train the stopping criterion and a partial translation model. The proposed attention mechanism comprises two modules, one to ensure translation quality and the other to address latency. Different from previous RL-based simultaneous translation systems, which learn the stopping criterion from a fixed NMT model, the modules can be trained jointly with a novel reward function. In our experiments, the proposed model has better translation quality and comparable latency compared to previous models.

引用

页码：775 / 786

页数：12

共 50 条

[21] Neural Machine Translation with Target-Attention Model
Yang, Mingming
Zhang, Min
Chen, Kehai
Wang, Rui
Zhao, Tiejun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (03) : 684 - 694
[22] Syntax-Directed Attention for Neural Machine Translation
Chen, Kehai
Wang, Rui
Utiyama, Masao
Sumita, Eiichiro
Zhao, Tiejun
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4792 - 4799
[23] Dynamic Attention Aggregation with BERT for Neural Machine Translation
Zhang, JiaRui
Li, HongZheng
Shi, ShuMin
Huang, HeYan
Hu, Yue
Wei, XiangPeng
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[24] Synchronous Syntactic Attention for Transformer Neural Machine Translation
Deguchi, Hiroyuki
Tamura, Akihiro
Ninomiya, Takashi
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 348 - 355
[25] Attention based English to Punjabi neural machine translation
Singh, Shivkaran
Kumar, M. Anand
Soman, K. P.
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1551 - 1559
[26] Measuring and Improving Faithfulness of Attention in Neural Machine Translation
Moradi, Pooya
Kambhatla, Nishant
Sarkar, Anoop
16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2791 - 2802
[27] A Reinforced Generation of Adversarial Examples for Neural Machine Translation
Zou, Wei
Huang, Shujian
Xie, Jun
Dai, Xinyu
Chen, Jiajun
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3486 - 3497
[28] Gaussian Multi-head Attention for Simultaneous Machine Translation
Zhang, Shaolei
Feng, Yang
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3019 - 3030
[29] ATTENTION AND SIMULTANEOUS TRANSLATION
LAWSON, EA
LANGUAGE AND SPEECH, 1967, 10 : 29 - &
[30] Attention over Heads: A Multi-Hop Attention for Neural Machine Translation
Iida, Shohei
Kimura, Ryuichiro
Cui, Hongyi
Hung, Po-Hsuan
Utsuro, Takehito
Nagata, Masaaki
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 217 - 222

← 1 2 3 4 5 →