LOW-RANK MATRIX FACTORIZATION FOR DEEP NEURAL NETWORK TRAINING WITH HIGH-DIMENSIONAL OUTPUT TARGETS

被引:0
|
作者
Sainath, Tara N. [1 ]
Kingsbury, Brian [1 ]
Sindhwani, Vikas [1 ]
Arisoy, Ebru [1 ]
Ramabhadran, Bhuvana [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Deep Neural Networks; Speech Recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
While Deep Neural Networks (DNNs) have achieved tremendous success for large vocabulary continuous speech recognition (LVCSR) tasks, training of these networks is slow. One reason is that DNNs are trained with a large number of training parameters (i.e., 10-50 million) Because networks are trained with a large number of output targets to achieve good performance, the majority of these parameters are in the final weight layer. In this paper, we propose a low-rank matrix factorization of the final weight layer. We apply this low-rank technique to DNNs for both acoustic modeling and language modeling. We show on three different LVCSR tasks ranging between 50-400 hrs, that a low-rank factorization reduces the number of parameters of the network by 30-50%. This results in roughly an equivalent reduction in training time, without a significant loss in final recognition accuracy, compared to a full-rank representation.
引用
收藏
页码:6655 / 6659
页数:5
相关论文
共 50 条
  • [21] Low-Rank Matrix Completion Using Graph Neural Network
    Luong Trung Nguyen
    Shim, Byonghyo
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 17 - 21
  • [22] Quaternion Matrix Factorization for Low-Rank Quaternion Matrix Completion
    Chen, Jiang-Feng
    Wang, Qing-Wen
    Song, Guang-Jing
    Li, Tao
    MATHEMATICS, 2023, 11 (09)
  • [23] Low-rank diffusion matrix estimation for high-dimensional time-changed Levy processes
    Belomestny, Denis
    Trabs, Mathias
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2018, 54 (03): : 1583 - 1621
  • [24] Low-Rank and Sparse Matrix Factorization for Scientific Paper Recommendation in Heterogeneous Network
    Dai, Tao
    Gao, Tianyu
    Zhu, Li
    Cai, Xiaoyan
    Pan, Shirui
    IEEE ACCESS, 2018, 6 : 59015 - 59030
  • [25] Low-Rank Deep Convolutional Neural Network for Multitask Learning
    Su, Fang
    Shang, Hai-Yang
    Wang, Jing-Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [26] An algorithm for low-rank matrix factorization and its applications
    Chen, Baiyu
    Yang, Zi
    Yang, Zhouwang
    NEUROCOMPUTING, 2018, 275 : 1012 - 1020
  • [27] Low-Rank Matrix Factorization With Adaptive Graph Regularizer
    Lu, Gui-Fu
    Wang, Yong
    Zou, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2196 - 2205
  • [28] Multi-dataset Low-rank Matrix Factorization
    Valavi, Hossein
    Ramadge, Peter J.
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
  • [29] Low-rank matrix factorization with multiple Hypergraph regularizer
    Jin, Taisong
    Yu, Jun
    You, Jane
    Zeng, Kun
    Li, Cuihua
    Yu, Zhengtao
    PATTERN RECOGNITION, 2015, 48 (03) : 1011 - 1022
  • [30] Distributed Low-rank Matrix Factorization With Exact Consensus
    Zhu, Zhihui
    Li, Qiuwei
    Yang, Xinshuo
    Tang, Gongguo
    Wakin, Michael B.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32