CONTEXT-DEPENDENT CONNECTIONIST PROBABILITY ESTIMATION IN A HYBRID HIDDEN MARKOV MODEL NEURAL-NET SPEECH RECOGNITION SYSTEM

被引:22
|
作者
FRANCO, H
COHEN, M
MORGAN, N
RUMELHART, D
ABRASH, V
机构
[1] INT COMP SCI INST,BERKELEY,CA 94704
[2] STANFORD UNIV,STANFORD,CA 94305
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 03期
关键词
D O I
10.1006/csla.1994.1010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a training method and a network architecture for estimating context-dependent observation probabilities in the framework of a hybrid hidden Markov model (HMM)/multi layer perceptron (MLP) speaker-independent continuous speech recognition system. The context-dependent modeling approach we present here computes the HMM context-dependent observation probabilities using a Bayesian factorization in terms of context-conditioned posterior phone probabilities which are computed with a set of MLPs, one for every relevant context. The proposed network architecture shares the input-to-hidden layer among the set of context dependent MLPs in order to reduce the number of independent parameters. Multiple states for phone models with different context dependence for each state are used to model the different context effects at the beginning and end of phonetic segments. A new training procedure that ''smooths'' networks with different degrees of context depedence is proposed to obtain a robust estimate of the context-dependent probabilities. We have used this new architecture to model generalized biphone phonetic contexts. Tests with the speaker-independent DARPA Resource Management database have shown average reductions in word error rates of 28% using a word-pair grammar, compared to our earlier context-independent HMM/MLP hybrid.
引用
收藏
页码:211 / 222
页数:12
相关论文
共 50 条
  • [41] REFINEMENTS OF REGRESSION-BASED CONTEXT-DEPENDENT MODELLING OF DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
    Wang, Guangsen
    Sim, Khe Chai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [42] Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
    Dahl, George E.
    Yu, Dong
    Deng, Li
    Acero, Alex
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 30 - 42
  • [43] A study of speech recognition system based on the Hidden Markov Model with Gaussian-Mixture
    Ben Hazem, Zied
    Zouhir, Youssef
    Ouni, Kais
    2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
  • [44] Context-dependent tree-structured image classification using the QDA distortion measure and the hidden Markov model
    Ozonat, KM
    Yoon, SH
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 1887 - 1890
  • [45] Hybrid approaches to frontal view face recognition using the hidden Markov model and neural network
    Yoon, KS
    Ham, YK
    Park, RH
    PATTERN RECOGNITION, 1998, 31 (03) : 283 - 293
  • [46] Implementation and Optimization of a Speech Recognition System Based on Hidden Markov Model Using Genetic Algorithm
    Farsi, Hassan
    Saleh, Reza
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [47] A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
    Oura, Keiichiro
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (11): : 2693 - 2700
  • [48] Continuous action segmentation and recognition using hybrid convolutional neural network-hidden Markov model model
    Lei, Jun
    Li, Guohui
    Zhang, Jun
    Guo, Qiang
    Tu, Dan
    IET COMPUTER VISION, 2016, 10 (06) : 537 - 544
  • [49] Automatic phoneme recognition in Venezuelan continuous speech based on hidden Markov models and artificial neural networks hybrid systems
    Jabbour, Georges
    Maldonado, Jose Luciano
    CIENCIA E INGENIERIA, 2014, 35 (01): : 29 - 38
  • [50] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Zi-Rui Wang
    Jun Du
    Wen-Chao Wang
    Jian-Fang Zhai
    Jin-Shui Hu
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 241 - 251