Audio-Noise Power Spectral Density Estimation Using Long Short-Term Memory

被引:7
|
作者
Li, Xiaofei [1 ,2 ]
Leglaive, Simon [1 ,2 ]
Girin, Laurent [1 ,3 ]
Horaud, Radu [1 ,2 ]
机构
[1] Inria Grenoble Rhone Alpes, F-38330 Montbonnot St Martin, France
[2] Univ Grenoble Alpes, F-38400 St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble INP, GIPSA Lab, F-38400 St Martin Dheres, France
基金
欧洲研究理事会;
关键词
LSTM; noise PSD; speech enhancement; SPEECH ENHANCEMENT;
D O I
10.1109/LSP.2019.2911879
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a method using a long short-term memory (LSTM) network to estimate the noise power spectral density (PSD) of single-channel audio signals represented in the short-time Fourier transform (STFT) domain. An LSTM network common to all frequency bands is trained, which processes each frequency band individually by mapping the noisy STFT magnitude sequence to its corresponding noise PSD sequence. Unlike deep-learning-based speech-enhancement methods, which learn the full-band spectral structure of speech segments, the proposed method exploits the sub-band STFT magnitude evolution of noise with long time dependence, in the spirit of the unsupervised noise estimators described in the literature. Speaker- and speech-independent experiments with different types of noise show that the proposed method outperforms the unsupervised estimators, and it generalizes well to noise types that are not present in the training set.
引用
收藏
页码:918 / 922
页数:5
相关论文
共 50 条
  • [31] State Estimation for DC Microgrids using Modified Long Short-Term Memory Networks
    Adi, Faya Safirra
    Lee, Yee Jin
    Song, Hwachang
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [32] Remaining Useful Life Estimation Using Long Short-Term Memory Deep Learning
    Hsu, Che-Sheng
    Jiang, Jehn-Ruey
    PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 58 - 61
  • [33] Speech Dereverberation Using Long Short-Term Memory
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2435 - 2439
  • [34] Text Classification Using Long Short-Term Memory
    Sari, Winda Kurnia
    Rini, Dian Palupi
    Malik, Reza Firsandaya
    2019 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS 2019), 2019, : 150 - 155
  • [35] Long Short-Term Memory Networks for Noise Robust Speech Recognition
    Woellmer, Martin
    Sun, Yang
    Eyben, Florian
    Schuller, Bjoern
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2966 - 2969
  • [36] Short-term power prediction for renewable energy using hybrid graph convolutional network and long short-term memory approach
    Liao, Wenlong
    -Jensen, Birgitte Bak
    Pillai, Jayakrishnan Radhakrishna
    Yang, Zhe
    Liu, Kuangpu
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211
  • [37] Short-term power prediction for renewable energy using hybrid graph convolutional network and long short-term memory approach
    Liao, Wenlong
    Bak-Jensen, Birgitte
    Pillai, Jayakrishnan Radhakrishna
    Yang, Zhe
    Liu, Kuangpu
    arXiv, 2021,
  • [38] Short-term power prediction of photovoltaic power station based on long short-term memory-back-propagation
    Hua, Chi
    Zhu, Erxi
    Kuang, Liang
    Pi, Dechang
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (10)
  • [39] Refining Short-Term Power Load Forecasting: An Optimized Model with Long Short-Term Memory Network
    Hu S.
    Cai W.
    Liu J.
    Shi H.
    Yu J.
    Journal of Computing and Information Technology, 2023, 31 (03) : 151 - 166
  • [40] Behavioral Modeling of GaN Power Amplifiers Using Long Short-Term Memory Networks
    Chen, Peng
    Alsahali, Sattam
    Alt, Alexander
    Lees, Jonathan
    Tasker, Paul J.
    INTERNATIONAL WORKSHOP ON INTEGRATED NONLINEAR MICROWAVE AND MILLIMETRE-WAVE CIRCUITS (INMMIC), 2018,