Cross-language Bootstrapping for Unsupervised Acoustic Model Training: Rapid Development of a Polish Speech Recognition System

被引：0

作者：

Loeoef, Jonas ^{[1
]}

Gollan, Christian ^{[1
]}

Ney, Hermann ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Lehrstuhl Informat 6, Dept Comp Sci, Aachen, Germany

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

speech recognition; unsupervised training; cross-language bootstrapping;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the rapid development of a Polish language speech recognition system. The system development was performed without access to any transcribed acoustic training data. This was achieved through the combined use of cross-language bootstrapping and confidence based unsupervised acoustic model training. A Spanish acoustic model was ported to Polish, through the use of a manually constructed phoneme mapping. This initial model was refined through iterative recognition and retraining of the untranscribed audio data. The system was trained and evaluated on recordings from the European Parliament, and included several state-of-the-art speech recognition techniques in addition to the use of unsupervised model training. Confidence based speaker adaptive training using features space transform adaptation, as well as vocal tract length normalization and maximum likelihood linear regression, was used to refine the acoustic model. Through the combination of the different techniques, good performance was achieved on the domain of parliamentary speeches.

引用

页码：96 / 99

页数：4

共 50 条

[1] CROSS-LANGUAGE BOOTSTRAPPING BASED ON COMPLETELY UNSUPERVISED TRAINING USING MULTILINGUAL A-STABIL
Ngoc Thang Vu
Kraus, Franziska
Schultz, Tanja
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5000 - 5003
[2] Cross-language use of acoustic information for automatic speech recognition
Nieuwoudt, C
Botha, EC
SPEECH COMMUNICATION, 2002, 38 (1-2) : 101 - 113
[3] Cross-language adaptation of acoustic models in automatic speech recognition
Univ of Pretoria, Pretoria, South Africa
IEEE AFRICON Conf, (181-184):
[4] Unsupervised cross-adaptation approach for speech recognition by combined language model and acoustic model adaptation
School of Science and Engineering, Yamagata University, Yonezawa, Japan
APSIPA ASC - Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf., (943-946):
[5] Cross-language speech emotion recognition in German and Chinese
School of Information Science and Engineering, Southeast University, No. 2, Si Pai Lou, Nanjing 210096, China
不详
不详
Huang, C. (Huang.Chengwei1@gmail.com), 2012, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (06):
[6] Cross-language speech emotion recognition in German and Chinese
Huang, Chengwei
Han, Dong
Bao, Yongqiang
Yu, Hua
Zhao, Li
ICIC Express Letters, 2012, 6 (08): : 2141 - 2146
[7] Cross-language Transfer Speech Recognition using Deep Learning
Zhao, Yue
Xu, Yan M.
Sun, Mei J.
Xu, Xiao N.
Wang, Hui
Yang, Guo S.
Ji, Qiang
11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1422 - 1426
[8] Cross-language acoustic model refinement forthe Indonesian language
Martin, T
Sridharan, S
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 865 - 868
[9] Cross-Language Acoustic Emotion Recognition: An Overview and Some Tendencies
Feraru, Silvia Monica
Schuller, Dagmar
Schuller, Bjoern
2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 125 - 131
[10] Unsupervised Acoustic Model Training for the Korean Language
Laurent, Antoine
Hartmann, William
Lamel, Lori
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 469 - 473

← 1 2 3 4 5 →