IMPROVEMENTS TO FILTERBANK AND DELTA LEARNING WITHIN A DEEP NEURAL NETWORK FRAMEWORK

被引:0
|
作者
Sainath, Tara N. [1 ]
Kingsbury, Brian [1 ]
Mohamed, Abdel-rahman
Saon, George [1 ]
Ramabhadran, Bhuvana [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many features used in speech recognition tasks are hand-crafted and are not always related to the objective at hand, that is minimizing word error rate. Recently, we showed that replacing a perceptually motivated mel-filter bank with a filter bank layer that is learned jointly with the rest of a deep neural network was promising. In this paper, we extend filter learning to a speaker-adapted, state-of-the-art system. First, we incorporate delta learning into the filter learning framework. Second, we incorporate various speaker adaptation techniques, including VTLN warping and speaker identity features. On a 50-hour English Broadcast News task, we show that we can achieve a 5% relative improvement in word error rate (WER) using the filter and delta learning, compared to having a fixed set of filters and deltas. Furthermore, after speaker adaptation, we find that filter and delta learning allows for a 3% relative improvement in WER compared to a state-of-the-art CNN.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Deep Process Neural Network for Temporal Deep Learning
    Huang, Wenhao
    Hong, Haikun
    Song, Guojie
    Xie, Kunqing
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 451 - 458
  • [32] Further improvements on extreme learning machine for interval neural network
    Li-fen Yang
    Chong Liu
    Hao Long
    Rana Aamir Raza Ashfaq
    Yu-lin He
    Neural Computing and Applications, 2018, 29 : 311 - 318
  • [33] Further improvements on extreme learning machine for interval neural network
    Yang, Li-fen
    Liu, Chong
    Long, Hao
    Ashfaq, Rana Aamir Raza
    He, Yu-lin
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (08): : 311 - 318
  • [34] A Deep Learning Framework for Automated Transfer Learning of Neural Networks
    Balaiah, Thanasekhar
    Jeyadoss, Timothy Jones Thomas
    Thirumurugan, Sainee
    Ravi, Rahul Chander
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 428 - 432
  • [35] RDN-NET: A Deep Learning Framework for Asthma Prediction and Classification Using Recurrent Deep Neural Network
    Iqbal, Md. Asim
    Devarajan, K.
    Ahmed, Syed Musthak
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (06)
  • [36] A novel framework for predicting active flow control by combining deep reinforcement learning and masked deep neural network
    Liu, Yangwei
    Wang, Feitong
    Zhao, Shihang
    Tang, Yumeng
    PHYSICS OF FLUIDS, 2024, 36 (03)
  • [37] A robust deep neural network framework for the detection of diabetes
    Shahin, Osama R.
    Alshammari, Hamoud H.
    Alzahrani, Ahmad A.
    Alkhiri, Hassan
    Taloba, Ahmed I.
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 74 : 715 - 724
  • [38] A Deep Learning Framework Using Convolutional Neural Network for Multi-class Object Recognition
    Hayat, Shaukat
    She Kun
    Zuo Tengtao
    Yue Yu
    Tu, Tianyi
    Du, Yantong
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 194 - 198
  • [39] Deep Convolutional Neural Network Framework for Subpixel Mapping
    He, Da
    Zhong, Yanfei
    Wang, Xinyu
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11): : 9518 - 9539
  • [40] How Do Deep-Learning Framework Versions Affect the Reproducibility of Neural Network Models?
    Shahriari, Mostafa
    Ramler, Rudolf
    Fischer, Lukas
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (04): : 888 - 911