Domain adaptation of a speech translation system for lectures by utilizing frequently appearing parallel phrases in-domain

被引：0

作者：

Goto, Norioki ^{[1
]}

Yamamoto, Kazumasa ^{[1
]}

Nakagawa, Seiichi ^{[1
]}

机构：

[1] Toyohashi Univ Technol, Aichi, Japan

来源：

2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2016年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper describes our scheme to translate spoken English lectures into Japanese consisting of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We focused on domain adaptation of the acoustic and translation models. For domain adaptation of the translation model, frequently appearing English-phrases consisting of multiple words are extracted from transcripts of in-domain lectures based on n-gram words or a part of syntax tree. Then we translated the English phrases into Japanese-phrases by hand semi-automatically. These phrase pairs of source and target language are used to learn an SMT model for domain adaptation. An adaptation method directly inserts these phrase pairs into a phrase table or adds them to a parallel corpus. In the experiments, n-gram and syntax tree based methods are compared whilst extracting frequent English-phrases. Furthermore, the adapted phrase table and the parallel corpus are compared. When the frequent English and Japanese phrase pairs based on syntax tree were added to the phrase table, the baseline model was improved.

引用

页数：4

共 14 条

[1] Robust Speech Translation by Domain Adaptation
He, Xiaodong
Deng, Li
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2116 - 2119
[2] Open domain speech recognition & translation:: Lectures and speeches
Fuegen, C.
Kolss, M.
Bernreuther, D.
Paulik, M.
Stueker, S.
Vogel, S.
Waibel, A.
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 569 - 572
[3] Acoustic model adaptation using in-domain background models for dysarthric speech recognition
Sharma, Harsh Vardhan
Hasegawa-Johnson, Mark
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (06): : 1147 - 1162
[4] Adapting a Speech into Sign Language Translation System to a new domain
Lopez-Ludena, V.
San-Segundo, R.
Gonzalez-Morcillo, C.
Lopez, J. C.
Ferreiro, E.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1163 - 1167
[5] Automatic Speech Recognition Adaptation to the IoT Domain Dialogue System
Zembrzuski, Maciej
Jeon, Heesik
Marhula, Joanna
Beksa, Katarzyna
Sikorski, Szymon
Latkowski, Tomasz
Bujnowski, Pawel
FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 215 - 226
[6] Noise robust in-domain children speech enhancement for automatic Punjabi recognition system under mismatched conditions
Bawa, Puneet
Kadyan, Virender
APPLIED ACOUSTICS, 2021, 175
[7] A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
Hosseini-Asl, Ehsan
Zhou, Yingbo
Xiong, Caiming
Socher, Richard
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3758 - 3762
[8] Large Scale Speech-to-Text Translation with Out-of-Domain Corpora using Better Context-Based Models and Domain Adaptation
Junczys-Dowmunt, Marcin
Przybysz, Pawel
Staszuk, Arleta
Kim, Eun-Kyoung
Lee, JaeWon
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2272 - 2276
[9] Content-Equivalent Translated Parallel News Corpus and Extension of Domain Adaptation for Neural Machine Translation
Mino, Hideya
Tanaka, Hideki
Ito, Hitoshi
Goto, Isao
Yamada, Ichiro
Tokunaga, Takenobu
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3616 - 3622
[10] A speech translation system applied to a real-world task/domain and its evaluation using real-world speech data
Nakamura, A
Naito, M
Tsukada, H
Gruhn, R
Sumita, E
Kashioka, N
Nakajima, H
Shimizu, T
Sagisaka, Y
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 142 - 154

← 1 2 →