Cross-language Bootstrapping for Unsupervised Acoustic Model Training: Rapid Development of a Polish Speech Recognition System

被引:0
|
作者
Loeoef, Jonas [1 ]
Gollan, Christian [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Lehrstuhl Informat 6, Dept Comp Sci, Aachen, Germany
关键词
speech recognition; unsupervised training; cross-language bootstrapping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the rapid development of a Polish language speech recognition system. The system development was performed without access to any transcribed acoustic training data. This was achieved through the combined use of cross-language bootstrapping and confidence based unsupervised acoustic model training. A Spanish acoustic model was ported to Polish, through the use of a manually constructed phoneme mapping. This initial model was refined through iterative recognition and retraining of the untranscribed audio data. The system was trained and evaluated on recordings from the European Parliament, and included several state-of-the-art speech recognition techniques in addition to the use of unsupervised model training. Confidence based speaker adaptive training using features space transform adaptation, as well as vocal tract length normalization and maximum likelihood linear regression, was used to refine the acoustic model. Through the combination of the different techniques, good performance was achieved on the domain of parliamentary speeches.
引用
收藏
页码:96 / 99
页数:4
相关论文
共 50 条
  • [21] Unsupervised training of acoustic models for large vocabulary continuous speech recognition
    Wessel, F
    Ney, H
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 307 - 310
  • [22] An improved cross-language model adaptation method for speech synthesis
    Liu, Hang
    Ling, Zhen-Hua
    Guo, Wu
    Dai, Li-Rong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2011, 24 (04): : 457 - 463
  • [23] CROSS-LANGUAGE SPEECH-PERCEPTION IN ADULTS - PHONEMIC, PHONETIC, AND ACOUSTIC CONTRIBUTIONS
    POLKA, L
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (06): : 2961 - 2977
  • [24] SPEAKER ADAPTATION OF A MULTILINGUAL ACOUSTIC MODEL FOR CROSS-LANGUAGE SYNTHESIS
    Himawan, Ivan
    Aryal, Sandesh
    Ouyang, Iris
    Kang, Sam
    Lanchantin, Pierre
    King, Simon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7629 - 7633
  • [25] RAPID BOOTSTRAPPING OF A UKRAINIAN LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    Schlippe, Tim
    Volovyk, Mykola
    Yurchenko, Kateryna
    Schultz, Tanja
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7329 - 7333
  • [26] Development of integral model of speech recognition system for Uzbek language
    Musaev, Muhammadjon
    Khujayorov, Ilyos
    Ochilov, Mannon
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [27] An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface
    Walter, Oliver
    Despotovic, Vladimir
    Haeb-Umbach, Reinhold
    Gemnzeke, Jort F.
    Ons, Bart
    Van Hamme, Hugo
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1013 - 1017
  • [28] Chinese-English bilingual phone modeling for cross-language speech recognition
    Yu, SM
    Zhang, SW
    Xu, B
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 917 - 920
  • [29] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [30] Recognition of Cross-Language Acoustic Emotional Valence Using Stacked Ensemble Learning
    Zvarevashe, Kudakwashe
    Olugbara, Oludayo O.
    ALGORITHMS, 2020, 13 (10)