Deep Q-network-based noise suppression for robust speech recognition

被引:1
|
作者
Park, Tae-Jun [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Dept Elect Engn, Seoul, South Korea
关键词
Deep Q-network; reinforcement learning; speech recognition; noise suppression; speech enhancement; deep neural network;
D O I
10.3906/elk-2011-144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study develops the deep Q-network (DQN)-based noise suppression for robust speech recognition purposes under ambient noise. We thus design a reinforcement algorithm that combines DQN training with a deep neural networks (DNN) to let reinforcement learning (RL) work for complex and high dimensional environments like speech recognition. For this, we elaborate on the DQN training to choose the best action that is the quantized noise suppression gain by the observation of noisy speech signal with the rewards of DQN including both the word error rate (WER) and objective speech quality measure. Experiments demonstrate that the proposed algorithm improves speech recognition in various noisy conditions while reducing the computational burden compared to the DNN-based noise suppression method.
引用
收藏
页码:2362 / 2373
页数:12
相关论文
共 50 条
  • [41] Fuzzy Double Deep Q-Network-Based Gait Pattern Controller for Humanoid Robots
    Li, Tzuu-Hseng S.
    Kuo, Ping-Huan
    Chen, Lin-Han
    Hung, Chia-Ching
    Luan, Po-Chien
    Hsu, Hao-Ping
    Chang, Chien-Hsin
    Hsieh, Yi-Ting
    Lin, Wen-Hsun
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (01) : 147 - 161
  • [42] Minimum based noise suppression for improved automatic speech recognition
    Fernández, J
    Meyer, C
    Fischer, A
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 243 - 248
  • [43] Deep Q-Network-Based Route Scheduling for TNC Vehicles With Passengers' Location Differential Privacy
    Shi, Dian
    Ding, Jiahao
    Errapotu, Sai Mounika
    Yue, Hao
    Xu, Wenjun
    Zhou, Xiangwei
    Pan, Miao
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) : 7681 - 7692
  • [44] Noise robust HRRP sequence recognition based on a deep unfolded go decomposition network
    Liu, Mei
    Gao, Xunzhang
    Zhang, Zhiwei
    SIGNAL PROCESSING, 2025, 230
  • [45] Noise Adaptive Deep Belief Network For Robust Speech Features Extraction
    Abdollahi, Mohammadreza
    Nasersharif, Babak
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1491 - 1496
  • [46] Signal trajectory based noise compensation for robust speech recognition
    Yan, Zhi-Jie
    Zhou, Jian-Lai
    Soong, Frank
    Wang, Ren-Hua
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 335 - +
  • [47] Assessment of signal subspace based speech enhancement for noise robust speech recognition
    Hermus, K
    Wambacq, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 945 - 948
  • [48] Robust-Noise Speech Recognition Based on Laguerre Filter
    Zhang Xue-ying
    Huang Li-xia
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEM, 2008, : 881 - 885
  • [49] Incorporating a Generative Front-end Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition
    Kundu, Souvik
    Sim, Khe Chai
    Gales, Mark
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2359 - 2363
  • [50] Multi-agent deep Q-network-based metaheuristic algorithm for Nurse Rostering Problem
    Zhang, Xinzhi
    Yang, Yeming
    Zhu, Qingling
    Lin, Qiuzhen
    Chen, Weineng
    Li, Jianqiang
    Coello, Carlos A. Coello
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 87