A Low Complexity Long Short-Term Memory Based Voice Activity Detection

被引:0
|
作者
Yang, Ruiting [1 ]
Liu, Jie [1 ]
Deng, Xiang [1 ]
Zheng, Zhuochao [1 ]
机构
[1] Harman Int, Shenzhen, Peoples R China
关键词
Voice activity detection; long short-term memory; Gammatone cepstral coefficients; spectral features;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Voice Activity Detection (VAD) plays an important role in audio processing, but it is also a common challenge when a voice signal is corrupted with strong and transient noise. In this paper, an accurate and causal VAD module using a long short-term memory (LSTM) deep neural network is proposed. A set of features including Gammatone cepstral coefficients (GTCC) and selected spectral features are used. The low complex structure allows it can be easily implemented in speech processing algorithms and applications. With carefully pre-processing and labeling the collected training data in the classes of speech or non-speech and training on the LSTM net, experiments show the proposed VAD is able to distinguish speech from different types of noisy background effectively. Its robustness against changes including varying frame length, moving speech sources and speaking in different languages, are further investigated.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] VOICE CONVERSION USING DEEP BIDIRECTIONAL LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS
    Sun, Lifa
    Kang, Shiyin
    Li, Kun
    Meng, Helen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4869 - 4873
  • [32] Prediction of Short-term Load of Microgrid Based on Multivariable and Multistep Long Short-term Memory
    Li, Dashuang
    SENSORS AND MATERIALS, 2022, 34 (04) : 1275 - 1285
  • [33] Spam SMS Detection Based on Long Short-Term Memory and Recurrent Neural Network
    Alseid, Marya
    Nassif, Ali Bou
    AlShabi, Mohammad
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [34] SHIP DETECTION IN RADAR IMAGE SERIES BASED ON THE LONG SHORT-TERM MEMORY NETWORK
    Xu, Yi
    Sun, Bing
    Li, Chunsheng
    Chen, Jie
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1229 - 1232
  • [35] Intelligent intrusion detection based on federated learning aided long short-term memory
    Zhao, Ruijie
    Yin, Yue
    Shi, Yong
    Xue, Zhi
    PHYSICAL COMMUNICATION, 2020, 42
  • [36] ECG Characteristic Wave Detection Based on Deep Recursive Long Short-Term Memory
    Qi, Jin
    Shi, Peng
    Hu, Lin
    Zhang, Tao
    Xie, Shenghua
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (09) : 1920 - 1924
  • [37] Bed Exit Action Detection Based on Patient Posture with Long Short-Term Memory
    Inoue, Madoka
    Taguchi, Ryo
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 4390 - 4393
  • [38] Epileptic Seizure Detection Based on Stockwell Transform and Bidirectional Long Short-Term Memory
    Geng, Minxing
    Zhou, Weidong
    Liu, Guoyang
    Li, Chaosong
    Zhang, Yanli
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2020, 28 (03) : 573 - 580
  • [39] Exposing DeepFake Video Detection Based on Convolutional Long Short-Term Memory Network
    Zheng Bowen
    Xia Huawei
    Chen Ruidong
    Han Qiankun
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (24)
  • [40] Brain tumor detection: a long short-term memory (LSTM)-based learning model
    Javaria Amin
    Muhammad Sharif
    Mudassar Raza
    Tanzila Saba
    Rafiq Sial
    Shafqat Ali Shad
    Neural Computing and Applications, 2020, 32 : 15965 - 15973