Decision tree-based acoustic models for speech recognition

被引:0
|
作者
Masami Akamine
Jitendra Ajmera
机构
[1] Toshiba Corporate R&D Center,
[2] IBM Research Lab.,undefined
关键词
speech recognition; acoustic modeling; decision trees; probability estimation; likelihood computation;
D O I
暂无
中图分类号
学科分类号
摘要
This article proposes a new acoustic model using decision trees (DTs) as replacements for Gaussian mixture models (GMM) to compute the observation likelihoods for a given hidden Markov model state in a speech recognition system. DTs have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform feature selection. This article explores and exploits DTs for the purpose of large vocabulary speech recognition. Equal and decoding questions have newly been introduced into DTs to directly model gender- and context-dependent acoustic space. Experimental results for the 5k ARPA wall-street-journal task show that context information significantly improves the performance of DT-based acoustic models as expected. Context-dependent DT-based models are highly compact compared to conventional GMM-based acoustic models. This means that the proposed models have effective data-sharing across various context classes.
引用
收藏
相关论文
共 50 条
  • [41] Decision tree based mandarin tone model and its application to speech recognition
    Cao, Y
    Deng, YG
    Zhang, H
    Huang, TY
    Xu, B
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1759 - 1762
  • [42] Dynamically configurable acoustic models for speech recognition
    Hwang, MY
    Huang, XD
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 669 - 672
  • [43] Compact Acoustic Models for Embedded Speech Recognition
    Levy, Christophe
    Linares, Georges
    Bonastre, Jean-Francois
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [44] Acoustic-to-Phrase Models for Speech Recognition
    Gaur, Yashesh
    Li, Jinyu
    Meng, Zhong
    Gong, Yifan
    INTERSPEECH 2019, 2019, : 2240 - 2244
  • [45] Compact Acoustic Models for Embedded Speech Recognition
    Christophe Lévy
    Georges Linarès
    Jean-François Bonastre
    EURASIP Journal on Audio, Speech, and Music Processing, 2009
  • [46] PyXAI: An XAI Library for Tree-Based Models
    Audemard, Gilles
    Lagniez, Jean-Marie
    Marquis, Pierre
    Szczepanski, Nicolas
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8601 - 8605
  • [47] ON MARGINAL FEATURE ATTRIBUTIONS OF TREE-BASED MODELS
    Filom, Khashayar
    Miroshnikov, Alexey
    Kotsiopoulos, Konstandinos
    Kannan, Arjun ravi
    FOUNDATIONS OF DATA SCIENCE, 2024, 6 (04): : 395 - 467
  • [48] The benefits of tree-based models for stock selection
    Zhu, Min
    Philpotts, David
    Stevenson, Maxwell J.
    JOURNAL OF ASSET MANAGEMENT, 2012, 13 (06) : 437 - 448
  • [49] The benefits of tree-based models for stock selection
    Min Zhu
    David Philpotts
    Maxwell J Stevenson
    Journal of Asset Management, 2012, 13 (6) : 437 - 448
  • [50] Comparison of tree-based ensemble models for regression
    Park, Sangho
    Kim, Chanmin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2022, 29 (05) : 561 - 590