Deep Q-network-based noise suppression for robust speech recognition

被引:1
|
作者
Park, Tae-Jun [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Dept Elect Engn, Seoul, South Korea
关键词
Deep Q-network; reinforcement learning; speech recognition; noise suppression; speech enhancement; deep neural network;
D O I
10.3906/elk-2011-144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study develops the deep Q-network (DQN)-based noise suppression for robust speech recognition purposes under ambient noise. We thus design a reinforcement algorithm that combines DQN training with a deep neural networks (DNN) to let reinforcement learning (RL) work for complex and high dimensional environments like speech recognition. For this, we elaborate on the DQN training to choose the best action that is the quantized noise suppression gain by the observation of noisy speech signal with the rewards of DQN including both the word error rate (WER) and objective speech quality measure. Experiments demonstrate that the proposed algorithm improves speech recognition in various noisy conditions while reducing the computational burden compared to the DNN-based noise suppression method.
引用
收藏
页码:2362 / 2373
页数:12
相关论文
共 50 条
  • [1] Deep Q-network-based noise suppression for robust speech recognition
    Park T.-J.
    Chang J.-H.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2021, 25 (09) : 2362 - 2373
  • [2] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
  • [3] Binaural Deep Neural Network for Noise Robust Automatic Speech Recognition
    Jiang, Yi
    Zu, Yuan-Yuan
    INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND AUTOMATION (ICCEA 2014), 2014, : 512 - 517
  • [4] Deep Neural Network Based Speech Separation for Robust Speech Recognition
    Tu Yanhui
    Jun, Du
    Xu Yong
    Dai Lirong
    Chin-Hui, Lee
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
  • [5] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
    Fan, Hao-teng
    Lin, Pao-han
    Hung, Jeih-weih
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
  • [6] Heuristic-Deep Q-Network-based Network Slicing in LoRaWAN
    Mardi, Fatima Zahra
    Bagaa, Miloud
    Hadjadj-Aoul, Yassine
    Benamar, Nabil
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4731 - 4736
  • [7] Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition
    Tan, Tian
    Qian, Yanmin
    Hu, Hu
    Zhou, Ying
    Ding, Wen
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1393 - 1405
  • [8] On a Deep Q-Network-based Approach for Active Queue Management
    AlWahab, Dhulfiqar A.
    Gombos, Gergo
    Laki, Sandor
    2021 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT (EUCNC/6G SUMMIT), 2021, : 371 - 376
  • [9] Deep Q-network-based traffic signal control models
    Park, Sangmin
    Han, Eum
    Park, Sungho
    Jeong, Harim
    Yun, Ilsoo
    PLOS ONE, 2021, 16 (09):
  • [10] Noise suppression based on auditory-like filters for robust speech recognition
    Zhao, JH
    Xie, X
    Kuang, JM
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 560 - 563