IMPROVEMENTS TO FILTERBANK AND DELTA LEARNING WITHIN A DEEP NEURAL NETWORK FRAMEWORK

被引:0
|
作者
Sainath, Tara N. [1 ]
Kingsbury, Brian [1 ]
Mohamed, Abdel-rahman
Saon, George [1 ]
Ramabhadran, Bhuvana [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many features used in speech recognition tasks are hand-crafted and are not always related to the objective at hand, that is minimizing word error rate. Recently, we showed that replacing a perceptually motivated mel-filter bank with a filter bank layer that is learned jointly with the rest of a deep neural network was promising. In this paper, we extend filter learning to a speaker-adapted, state-of-the-art system. First, we incorporate delta learning into the filter learning framework. Second, we incorporate various speaker adaptation techniques, including VTLN warping and speaker identity features. On a 50-hour English Broadcast News task, we show that we can achieve a 5% relative improvement in word error rate (WER) using the filter and delta learning, compared to having a fixed set of filters and deltas. Furthermore, after speaker adaptation, we find that filter and delta learning allows for a 3% relative improvement in WER compared to a state-of-the-art CNN.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Obstructive sleep apnoea detection using convolutional neural network based deep learning framework
    Dey D.
    Chaudhuri S.
    Munshi S.
    Biomedical Engineering Letters, 2018, 8 (1) : 95 - 100
  • [42] An Optimal Framework for SDN Based on Deep Neural Network
    Abdallah, Abdallah
    Ishak, Mohamad Khairi
    Sani, Nor Samsiah
    Khan, Imran
    Albogamy, Fahad R.
    Amano, Hirofumi
    Mostafa, Samih M.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1125 - 1140
  • [43] Learning to Compensate: A Deep Neural Network Framework for 5G Power Amplifier Compensation
    Chen, Po-Yu
    Chen, Hao
    Tsai, Yi-Min
    Kuo, Hsien-Kai
    Huang, Hantao
    Chen, Hsin-Hung
    Yan, Sheng-Hong
    Ou, Wei-Lun
    Cheng, Chia-Ming
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [44] A novel adversarial learning framework in deep convolutional neural network for intelligent diagnosis of mechanical faults
    Han, Te
    Liu, Chao
    Yang, Wenguang
    Jiang, Dongxiang
    KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 474 - 487
  • [45] Implementation of Deep Learning Neural Network for Real-time Object Recognition in OpenCL Framework
    Luo, Yukui
    Li, Shuai
    Sun, Kuangyuan
    Renteria, Raul
    Choi, Ken
    PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 298 - 299
  • [46] Bayesian Learning for Deep Neural Network Adaptation
    Xie, Xurong
    Liu, Xunying
    Lee, Tan
    Wang, Lan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2096 - 2110
  • [47] Deep Reinforcement Learning with the Random Neural Network
    Serrano, Will
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [48] Efficient Framework for Brain Tumour Classification using Hierarchical Deep Learning Neural Network Classifier
    Shajin, Francis H.
    Rajesh, P.
    Rao, Venu Kadur Nagoji
    Salini, P.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (03): : 750 - 757
  • [49] Deep Learning Binary Neural Network on an FPGA
    Zhou, Yuteng
    Redkar, Shrutika
    Huang, Xinming
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 281 - 284
  • [50] RANDOM NEURAL NETWORK METHODS AND DEEP LEARNING
    Yin, Yonghua
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2021, 35 (01) : 6 - 36