A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems

被引:4
|
作者
Cai, Chenghao [1 ]
Xu, Yanyan [2 ]
Ke, Dengfeng [3 ]
Su, Kaile [4 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Sch Informat Sci & Technol, 35 Qinghua Dong Rd, Beijing 100083, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[4] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4111, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2015/797083
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We propose a fast learning method for multilayer perceptrons (MLPs) on large vocabulary continuous speech recognition (LVCSR) tasks. A preadjusting strategy based on separation of training data and dynamic learning-rate with a cosine function is used to increase the accuracy of a stochastic initial MLP. Weight matrices of the preadjusted MLP are restructured by a method based on singular value decomposition (SVD), reducing the dimensionality of the MLP. A back propagation (BP) algorithm that fits the unfolded weight matrices is used to train the restructured MLP, reducing the time complexity of the learning process. Experimental results indicate that on LVCSR tasks, in comparison with the conventional learning method, this fast learning method can achieve a speedup of around 2.0 times with improvement on both the cross entropy loss and the frame accuracy. Moreover, it can achieve a speedup of approximately 3.5 times with only a little loss of the cross entropy loss and the frame accuracy. Since this method consumes less time and space than the conventional method, it is more suitable for robots which have limitations on hardware.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] PRELIMINARY CONSIDERATIONS FOR AUTOMATIC SPEECH RECOGNITION SYSTEMS
    UNGEHEUER, G
    PHONETICA, 1979, 36 (4-5) : 254 - 262
  • [42] Validation of Speech Data for Training Automatic Speech Recognition Systems
    Krizaj, Janes
    Gros, Jerneja Zganec
    Dobrisek, Simon
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1165 - 1169
  • [43] TEXT NORMALIZATION FOR AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasile, Alin-Florentin
    Boros, Tiberiu
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 121 - 128
  • [44] EM learning algorithm for multilayer stochastic perceptrons
    Jisuanji Yanjiu yu Fazhan, 11 (808-815):
  • [45] AUTOMATIC SPEECH RECOGNITION FOR REAL TIME SYSTEMS
    Singh, Ranjodh
    Yadav, Hemant
    Sharma, Mohit
    Gosain, Sandeep
    Shah, Rajiv Ratn
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 189 - 198
  • [46] AUTOMATIC SPEECH RECOGNITION AND MEDICAL EXPERT SYSTEMS
    NORWICH, KH
    LANDAU, JA
    CANADIAN MEDICAL AND BIOLOGICAL ENGINEERING SOCIETY CONFERENCE : PROCEEDINGS - 1989, 1989, : 57 - 58
  • [47] Multilayer perceptrons combination applied to handwritten character recognition
    Gosselin, B
    NEURAL PROCESSING LETTERS, 1996, 3 (01) : 3 - 10
  • [48] Dynamics of learning in multilayer perceptrons near singularities
    Cousseau, Florent
    Ozeki, Tomoko
    Amari, Shun-ichi
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (08): : 1313 - 1328
  • [49] Multilayer Perceptrons applied to Traffic Sign Recognition Tasks
    Vicen-Bueno, R
    Gil-Pita, R
    Rosa-Zurera, M
    Utrilla-Manso, M
    López-Ferreras, F
    COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 865 - 872
  • [50] Human face recognition using accelerated multilayer perceptrons
    Zainuddin, Z
    Evans, DJ
    Fadzil, MHA
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2003, 80 (05) : 535 - 558