A General Artificial Neural Network Extension for HTK

被引:0
|
作者
Zhang, C. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the recently developed artificial neural network (ANN) modules in HTK hidden Markov model toolkit, which enables ANN models with very general feed-forward architectures to be used for either acoustic modelling or feature extraction. The HTK ANN extension includes many recent ANN-based speech processing techniques, such as sequence training, model stacking, speaker adaptation, and parameterised activation functions. The implementation allows efficient training by supporting GPUs and various types of data cache. The ANN modules are fully integrated into the rest of the HTK toolkit, which allows existing GMM-HMM methods to be easily used in the ANN-HMM framework. Speech recognition results on a 300 hours DARPA BOLT conversational Mandarin task show that HTK can produce tandem and hybrid systems with state-of-the-art performance on this very challenging task. Furthermore, the flexibility of the implementation is illustrated using demo systems for a Wall Street Journal (WSJ) task. The HTK ANN extension is planned for release in HTK version 3.5.
引用
收藏
页码:3581 / 3585
页数:5
相关论文
共 50 条
  • [1] A general artificial neural network extension for HTK
    Cambridge University Engineering Dept., Trumpington St., Cambridge
    CB2 1PZ, United Kingdom
    Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH, (3581-3585):
  • [2] Artificial General Intelligence and Classical Neural Network
    Wang, Pei
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 130 - 135
  • [3] An artificial neural network architecture for application in general diagnostics
    Osborne, M
    Cornish, M
    Gorringe, C
    AUTOTESTCON 2004, PROCEEDINGS: TECHNOLOGY AND TRADITION UNITE IN SAN ANTONIO, 2004, : 402 - 406
  • [4] Extension of Convolutional Neural Network with General Image Processing Kernels
    Jung, Jay Hoon
    Shin, Yousun
    Kwon, YoungMin
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1436 - 1439
  • [5] Deep neural network ensemble for reducing artificial noise in bandwidth extension
    Noh, Kyoungjin
    Chang, Joon-Hyuk
    DIGITAL SIGNAL PROCESSING, 2020, 102
  • [6] Extension neural network
    Wang, MH
    Hung, CP
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 399 - 403
  • [7] An intelligent neural network based Driving System using Artificial Net Extension
    Srinivasan, T
    Chandrasekhar, A
    Seshadri, J
    Jonathan, JBS
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 246 - 251
  • [8] Design and Development of an Intelligent Extension for Mapping Landslide Susceptibility Using Artificial Neural Network
    Vahidnia, Mohammad H.
    Alesheikh, Ali A.
    Alimohammadi, Abbas
    Hosseinali, Farhad
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2009, PT I, 2009, 5592 : 17 - 32
  • [9] General neural network
    Degeratu, Vasile
    Degeratu, Stefania
    Schiopu, Paul
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 10, 2005, : 217 - 220
  • [10] Insolvency Prediction Model Using Artificial Neural Network for Malaysian General Insurers
    Chiet, Ng Shu
    Jaaman, Saiful Hafizah
    Ismail, Noriszura
    Shamsuddin, Siti Mariyam
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 583 - +