Cross-language acoustic model refinement forthe Indonesian language

被引:0
|
作者
Martin, T [1 ]
Sridharan, S [1 ]
机构
[1] Queensland Univ Technol, Brisbane, Qld 4001, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Porting ASR capabilities to many languages is hindered by a lack of transcribed acoustic data. Cross-language adaptation techniques seek to address this problem by substituting models trained in resource-rich source languages to recognise speech in resource-poor target languages. The differences in co-articulatory effects between the source and target languages, together with unwanted pronunciation and channel variation, result in recognition rates that are typically much worse then those achieved by well trained monolingual systems. In this paper, we present a technique which makes more effective use of limited adaptation data by structuring the state distributions to suit the co-articulatory occurrences in the target language. Additionally the proposed technique provides a more suitable method for synthesising unseen contexts. Evaluation of this technique is presented for a word recognition task using English and Spanish source language acoustic models trained using Switchboard and CallHome databases respectively. Using 25 minutes of Indonesian speech for target language adaptation data, this technique achieved an absolute improvement of 3.69% and 6.31% for English and Spanish respectively, when compared to traditional adaptation techniques. Using 90 minutes of adaptation data, an absolute improvement of 3.22% and 3.07% was achieved.
引用
收藏
页码:865 / 868
页数:4
相关论文
共 50 条
  • [1] SPEAKER ADAPTATION OF A MULTILINGUAL ACOUSTIC MODEL FOR CROSS-LANGUAGE SYNTHESIS
    Himawan, Ivan
    Aryal, Sandesh
    Ouyang, Iris
    Kang, Sam
    Lanchantin, Pierre
    King, Simon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7629 - 7633
  • [2] Cross-Language and Language-Specific Acoustic Correlates of Gemination in Berber and Japanese
    Bouarourou, Fayssal
    Koya, Tomoki
    Bouzidi, Said
    Vaxelaire, Beatrice
    Sock, Rudolph
    2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING (ICNLSP), 2018, : 160 - 165
  • [3] Cross-language and language-specific acoustic correlates of gemination in Berber and Japanese
    U.R. 1339 LILPA, Equipe de Recherche Parole et Cognition, Institut de Phonétique de Strasbourg , Strasbourg, France
    不详
    不详
    Int. Conf. Nat. Lang. Speech Process., ICNLSP, (1-6):
  • [4] An acoustic phonetic explanation for cross-language patterns in labialization
    Hogan, JT
    TWENTY-THIRD LACUS FORUM 1996, 1997, : 601 - 607
  • [5] A CROSS-LANGUAGE ACOUSTIC SPACE FOR VOCALIC PHONATION DISTINCTIONS
    Keating, Patricia
    Kuang, Jianjing
    Garellek, Marc
    Esposito, Christina M.
    Khan, Sameer ud Dowla
    LANGUAGE, 2023, 99 (02) : 351 - 389
  • [6] ZERO-SHOT PRONUNCIATION LEXICONS FOR CROSS-LANGUAGE ACOUSTIC MODEL TRANSFER
    Wiesner, Matthew
    Adams, Oliver
    Yarowsky, David
    Trmal, Jan
    Khudanpur, Sanjeev
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 1048 - 1054
  • [7] Language and cognition: A cross-language perspective
    Chen, HC
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 148 - 148
  • [8] Cross-Language Experiment
    Stastny, Jakub
    Sovka, Pavel
    RADIOENGINEERING, 2003, 12 (03) : 37 - 41
  • [9] Indonesian-English Transitive Translation for Cross-Language Information Retrieval
    Adriani, Mirna
    Hayurani, Herika
    Sari, Syandra
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 127 - 133
  • [10] CROSS-LANGUAGE PSYCHOLINGUISTICS
    CUTLER, A
    LINGUISTICS, 1985, 23 (05) : 659 - 667