ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020

被引：0

作者：

Elbayad, Maha ^{[1
,4
]}

Ha Nguyen ^{[1
,2
]}

Bougares, Fethi ^{[3
]}

Tomashenko, Natalia ^{[2
]}

Caubriere, Antoine ^{[3
]}

Lecouteux, Benjamin ^{[1
]}

Esteve, Yannick ^{[2
]}

Besacier, Laurent ^{[1
]}

机构：

[1] LIG Univ Grenoble Alpes, St Martin Dheres, France

[2] LIA Avignon Univ, Avignon, France

[3] LIUM Le Mans Univ, Le Mans, France

[4] INRIA, Grenoble, France

来源：

17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020) | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2020, offline speech translation and simultaneous speech translation. ON-TRAC Consortium is composed of researchers from three French academic laboratories: LIA (Avignon Universite), LIG (Universite Grenoble Alpes), and LIUM (Le Mans Universite). Attention-based encoder-decoder models, trained end-to-end, were used for our submissions to the offline speech translation track. Our contributions focused on data augmentation and ensembling of multiple models. In the simultaneous speech translation track, we build on Transformer-based wait k models for the text-to-text subtask. For speech-to-text simultaneous translation, we attach a wait k MT system to a hybrid ASR system. We propose an algorithm to control the latency of the ASR+MT cascade and achieve a good latency-quality trade-off on both subtasks.

引用

页码：35 / 43

页数：9

共 50 条

[41] TIGHT INTEGRATED END-TO-END TRAINING FOR CASCADED SPEECH TRANSLATION
Bahar, Parnia
Bieschke, Tobias
Schlueter, Ralf
Ney, Hermann
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 950 - 957
[42] Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Sun, Haoran
Zhao, Xiaohu
Lei, Yikun
Zhu, Shaolin
Xiong, Deyi
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14332 - 14348
[43] Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement
Du, Yichao
Zhang, Zhirui
Wang, Weizhi
Chen, Boxing
Xie, Jun
Xu, Tong
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10590 - 10598
[44] Knowledge Distillation on Joint Task End-to-End Speech Translation
Nayem, Khandokar Md
Xue, Ran
Chang, Ching-Yun
Shanbhogue, Akshaya Vishnu Kudlu
INTERSPEECH 2023, 2023, : 1493 - 1497
[45] SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Tsiamas, Ioannis
Gallego, Gerard I.
Fonollosa, Jose A. R.
Costa-jussa, Marta R.
INTERSPEECH 2022, 2022, : 106 - 110
[46] PromptST: Abstract Prompt Learning for End-to-End Speech Translation
Yu, Tengfei
Ding, Liang
Liu, Xuebo
Chen, Kehai
Zhang, Meishan
Tao, Dacheng
Zhang, Min
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10140 - 10154
[47] ONE-TO-MANY MULTILINGUAL END-TO-END SPEECH TRANSLATION
Di Gangi, Mattia A.
Negri, Matteo
Turchi, Marco
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 585 - 592
[48] Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Salesky, Elizabeth
Sperber, Matthias
Black, Alan W.
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1835 - 1841
[49] Improving End-to-End Speech Translation by Leveraging Auxiliary Speech and Text Data
Zhang, Yuhao
Xu, Chen
Hu, Bojie
Zhang, Chunliang
Xiao, Tong
Zhu, Jingbo
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13984 - 13992
[50] End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge
Kimura, Naoki
Su, Zixiong
Saeki, Takaaki
INTERSPEECH 2020, 2020, : 1025 - 1026

← 1 2 3 4 5 →