CONTEXT-DEPENDENT CONNECTIONIST PROBABILITY ESTIMATION IN A HYBRID HIDDEN MARKOV MODEL NEURAL-NET SPEECH RECOGNITION SYSTEM

被引:22
|
作者
FRANCO, H
COHEN, M
MORGAN, N
RUMELHART, D
ABRASH, V
机构
[1] INT COMP SCI INST,BERKELEY,CA 94704
[2] STANFORD UNIV,STANFORD,CA 94305
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 03期
关键词
D O I
10.1006/csla.1994.1010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a training method and a network architecture for estimating context-dependent observation probabilities in the framework of a hybrid hidden Markov model (HMM)/multi layer perceptron (MLP) speaker-independent continuous speech recognition system. The context-dependent modeling approach we present here computes the HMM context-dependent observation probabilities using a Bayesian factorization in terms of context-conditioned posterior phone probabilities which are computed with a set of MLPs, one for every relevant context. The proposed network architecture shares the input-to-hidden layer among the set of context dependent MLPs in order to reduce the number of independent parameters. Multiple states for phone models with different context dependence for each state are used to model the different context effects at the beginning and end of phonetic segments. A new training procedure that ''smooths'' networks with different degrees of context depedence is proposed to obtain a robust estimate of the context-dependent probabilities. We have used this new architecture to model generalized biphone phonetic contexts. Tests with the speaker-independent DARPA Resource Management database have shown average reductions in word error rates of 28% using a word-pair grammar, compared to our earlier context-independent HMM/MLP hybrid.
引用
收藏
页码:211 / 222
页数:12
相关论文
共 50 条
  • [21] Speech recognition using hybrid hidden Markov model and NN classifier
    Kundu A.
    Bayya A.
    International Journal of Speech Technology, 1998, 2 (3) : 227 - 240
  • [22] Speech recognition algorithm based on neural network and hidden Markov model
    Jianhui Z.
    Hongbo G.
    Yuchao L.
    Bo C.
    Journal of China Universities of Posts and Telecommunications, 2018, 25 (04): : 28 - 37
  • [23] Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications
    Niu, Jianwei
    Xie, Lei
    Jia, Lei
    Hu, Na
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [24] FROM SENONES TO CHENONES: TIED CONTEXT-DEPENDENT GRAPHEMES FOR HYBRID SPEECH RECOGNITION
    Le, Duc
    Zhang, Xiaohui
    Zheng, Weiyi
    Fugen, Christian
    Zweig, Geoffrey
    Seltzer, Michael L.
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 457 - 464
  • [25] Automatic Speech Recognition: Comparisons Between Convolutional Neural Networks, Hidden Markov Model and Hybrid Architecture
    Santos, Lyndaines
    Moreira, Nicolas de Araujo
    Sampaio, Robson
    Lima, Raizielle
    Oliveira, Francisco Carlos Mattos Brito
    EXPERT SYSTEMS, 2025, 42 (05)
  • [26] CONTEXT-DEPENDENT LANDMINE DETECTION WITH GROUND-PENETRATING RADAR USING A HIDDEN MARKOV CONTEXT MODEL
    Ratto, Christopher
    Torrione, Peter
    Morton, Kenneth
    Collins, Leslie
    2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 4192 - 4195
  • [27] A Neural Network Hidden Markov Model Hybrid for cursive word recognition
    Knerr, S
    Augustin, E
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1518 - 1520
  • [28] Hybrid Deep Neural Network - Hidden Markov Model (DNN-HMM) Based Speech Emotion Recognition
    Li, Longfei
    Zhao, Yong
    Jiang, Dongmei
    Zhang, Yanning
    Wang, Fengna
    Gonzalez, Isabel
    Valentin, Enescu
    Sahli, Hichem
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 312 - 317
  • [29] Regression-Based Context-Dependent Modeling of Deep Neural Networks for Speech Recognition
    Wang, Guangsen
    Sim, Khe Chai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (11) : 1660 - 1669
  • [30] A neural network model of hidden markov model applied to the auditory periphery for speech processing and recognition
    Ye, DT
    Songhua
    Ying, LX
    Krishnan, SM
    PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 1371 - 1376