Domain adaptation of a speech translation system for lectures by utilizing frequently appearing parallel phrases in-domain

被引:0
|
作者
Goto, Norioki [1 ]
Yamamoto, Kazumasa [1 ]
Nakagawa, Seiichi [1 ]
机构
[1] Toyohashi Univ Technol, Aichi, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes our scheme to translate spoken English lectures into Japanese consisting of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We focused on domain adaptation of the acoustic and translation models. For domain adaptation of the translation model, frequently appearing English-phrases consisting of multiple words are extracted from transcripts of in-domain lectures based on n-gram words or a part of syntax tree. Then we translated the English phrases into Japanese-phrases by hand semi-automatically. These phrase pairs of source and target language are used to learn an SMT model for domain adaptation. An adaptation method directly inserts these phrase pairs into a phrase table or adds them to a parallel corpus. In the experiments, n-gram and syntax tree based methods are compared whilst extracting frequent English-phrases. Furthermore, the adapted phrase table and the parallel corpus are compared. When the frequent English and Japanese phrase pairs based on syntax tree were added to the phrase table, the baseline model was improved.
引用
收藏
页数:4
相关论文
共 14 条
  • [1] Robust Speech Translation by Domain Adaptation
    He, Xiaodong
    Deng, Li
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2116 - 2119
  • [2] Open domain speech recognition & translation:: Lectures and speeches
    Fuegen, C.
    Kolss, M.
    Bernreuther, D.
    Paulik, M.
    Stueker, S.
    Vogel, S.
    Waibel, A.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 569 - 572
  • [3] Acoustic model adaptation using in-domain background models for dysarthric speech recognition
    Sharma, Harsh Vardhan
    Hasegawa-Johnson, Mark
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (06): : 1147 - 1162
  • [4] Adapting a Speech into Sign Language Translation System to a new domain
    Lopez-Ludena, V.
    San-Segundo, R.
    Gonzalez-Morcillo, C.
    Lopez, J. C.
    Ferreiro, E.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1163 - 1167
  • [5] Automatic Speech Recognition Adaptation to the IoT Domain Dialogue System
    Zembrzuski, Maciej
    Jeon, Heesik
    Marhula, Joanna
    Beksa, Katarzyna
    Sikorski, Szymon
    Latkowski, Tomasz
    Bujnowski, Pawel
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 215 - 226
  • [6] Noise robust in-domain children speech enhancement for automatic Punjabi recognition system under mismatched conditions
    Bawa, Puneet
    Kadyan, Virender
    APPLIED ACOUSTICS, 2021, 175
  • [7] A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
    Hosseini-Asl, Ehsan
    Zhou, Yingbo
    Xiong, Caiming
    Socher, Richard
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3758 - 3762
  • [8] Large Scale Speech-to-Text Translation with Out-of-Domain Corpora using Better Context-Based Models and Domain Adaptation
    Junczys-Dowmunt, Marcin
    Przybysz, Pawel
    Staszuk, Arleta
    Kim, Eun-Kyoung
    Lee, JaeWon
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2272 - 2276
  • [9] Content-Equivalent Translated Parallel News Corpus and Extension of Domain Adaptation for Neural Machine Translation
    Mino, Hideya
    Tanaka, Hideki
    Ito, Hitoshi
    Goto, Isao
    Yamada, Ichiro
    Tokunaga, Takenobu
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3616 - 3622
  • [10] A speech translation system applied to a real-world task/domain and its evaluation using real-world speech data
    Nakamura, A
    Naito, M
    Tsukada, H
    Gruhn, R
    Sumita, E
    Kashioka, N
    Nakajima, H
    Shimizu, T
    Sagisaka, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 142 - 154