Unsupervised acoustic model training

被引:0
|
作者
Lamel, L [1 ]
Gauvain, JL [1 ]
Adda, G [1 ]
机构
[1] CNRS, LIMSI, Spoken Language Proc Grp, F-91403 Orsay, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes some recent experiments using unsupervised techniques for acoustic model training in order to reduce the system development cost. The approach uses a speech recognizer to transcribe unannotated raw broadcast news data. The hypothesized transcription is used to create labels for the training data. Experiments providing supervision only via the language model training materials show that including texts which are contemporaneous with the audio data is not crucial for success of the approach, and that the acoustic models can be initialized with as little as 10 minutes of manually annotated data. These experiments demonstrate that unsupervised training is a viable training scheme and can dramatically reduce the cost of building acoustic models.
引用
收藏
页码:877 / 880
页数:4
相关论文
共 50 条
  • [1] Unsupervised Acoustic Model Training for the Korean Language
    Laurent, Antoine
    Hartmann, William
    Lamel, Lori
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 469 - 473
  • [2] Lightly supervised and unsupervised acoustic model training
    Lamel, L
    Gauvain, JL
    Adda, G
    COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 115 - 129
  • [3] LATTICE-BASED UNSUPERVISED ACOUSTIC MODEL TRAINING
    Fraga-Silva, Thiago
    Gauvain, Jean-Luc
    Lamel, Lori
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4656 - 4659
  • [4] An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface
    Walter, Oliver
    Despotovic, Vladimir
    Haeb-Umbach, Reinhold
    Gemnzeke, Jort F.
    Ons, Bart
    Van Hamme, Hugo
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1013 - 1017
  • [5] Unsupervised acoustic model training: comparing South African English and isiZulu
    Kleynhans, Neil
    de Wet, Febe
    Barnard, Etienne
    PROCEEDINGS OF THE 2015 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2015, : 136 - 141
  • [6] WEAK TOP-DOWN CONSTRAINTS FOR UNSUPERVISED ACOUSTIC MODEL TRAINING
    Jansen, Aren
    Thomas, Samuel
    Hermansky, Hynek
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8091 - 8095
  • [7] UNSUPERVISED ACOUSTIC AND LANGUAGE MODEL TRAINING WITH SMALL AMOUNTS OF LABELLED DATA
    Novotney, Scott
    Schwartz, Richard
    Ma, Jeff
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4297 - 4300
  • [8] Unsupervised versus Supervised Training of Acoustic Models
    Ma, Jeff
    Schwartz, Richard
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2374 - 2377
  • [9] Data Selection from Multiple ASR Systems' Hypotheses for Unsupervised Acoustic Model Training
    Li, Sheng
    Akita, Yuya
    Kawahara, Tatsuya
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5875 - 5879
  • [10] EFFICIENT MULTI-LINGUAL UNSUPERVISED ACOUSTIC MODEL TRAINING UNDER MISMATCH CONDITIONS
    Saiko, Masahiro
    Yamamoto, Hitoshi
    Isotani, Ryosuke
    Hori, Chiori
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 24 - 29