Decision tree-based acoustic models for speech recognition

被引:0
|
作者
Masami Akamine
Jitendra Ajmera
机构
[1] Toshiba Corporate R&D Center,
[2] IBM Research Lab.,undefined
关键词
speech recognition; acoustic modeling; decision trees; probability estimation; likelihood computation;
D O I
暂无
中图分类号
学科分类号
摘要
This article proposes a new acoustic model using decision trees (DTs) as replacements for Gaussian mixture models (GMM) to compute the observation likelihoods for a given hidden Markov model state in a speech recognition system. DTs have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform feature selection. This article explores and exploits DTs for the purpose of large vocabulary speech recognition. Equal and decoding questions have newly been introduced into DTs to directly model gender- and context-dependent acoustic space. Experimental results for the 5k ARPA wall-street-journal task show that context information significantly improves the performance of DT-based acoustic models as expected. Context-dependent DT-based models are highly compact compared to conventional GMM-based acoustic models. This means that the proposed models have effective data-sharing across various context classes.
引用
收藏
相关论文
共 50 条
  • [31] Automatic Speech Emotion Recognition using Auditory Models with Binary Decision Tree and SVM
    Yuncu, Enes
    Hacihabiboglu, Huseyin
    Bozsahin, Cem
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 773 - 778
  • [32] Acoustic Analysis and Decision Tree-Based Shifting Hierarchical Approach for Prediction of Uyghur Prosodic Boundary
    Guljamal Mamateli
    Askar Hamdulla
    WuhanUniversityJournalofNaturalSciences, 2013, 18 (04) : 363 - 368
  • [33] SOME APPLICATIONS OF TREE-BASED MODELING TO SPEECH AND LANGUAGE
    RILEY, MD
    SPEECH AND NATURAL LANGUAGE, 1989, : 339 - 352
  • [34] Predicting transport mode choice preferences in a university district with decision tree-based models
    Diaz-Ramirez, Jenny
    Estrada-Garcia, Juan Alberto
    Figueroa-Sayago, Juliana
    CITY AND ENVIRONMENT INTERACTIONS, 2023, 20
  • [35] Efficient Decision Tree-Based Classification Models to Predict Safety Rating for Bridge Maintenance
    Hong, Jisu
    Jeon, Se-Jin
    JOURNAL OF INFRASTRUCTURE SYSTEMS, 2025, 31 (01)
  • [36] Conversion from Phoneme Based to Grapheme Based Acoustic Models for Speech Recognition
    Zgank, Andrej
    Kacic, Zdravko
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1587 - 1590
  • [37] Concurrent constraint programming and tree-based acoustic modelling
    Neugebauer, M
    LOGIC PROGRAMMING, PROCEEDINGS, 2004, 3132 : 467 - 468
  • [38] Continuous speech recognition based on general factor dependent acoustic models
    Suzuki, H
    Zen, H
    Nankaku, Y
    Miyajima, C
    Tokuda, K
    Kitamura, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 410 - 417
  • [39] Speech emotion recognition based on DNN-decision tree SVM model
    Sun, Linhui
    Zou, Bo
    Fu, Sheng
    Chen, Jia
    Wang, Fu
    SPEECH COMMUNICATION, 2019, 115 : 29 - 37
  • [40] Multilingual acoustic models for speech recognition and synthesis
    Kunzmann, S
    Fischer, V
    Gonzalez, J
    Emam, O
    Günther, C
    Janke, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 745 - 748