A Low Complexity Long Short-Term Memory Based Voice Activity Detection

被引:0
|
作者
Yang, Ruiting [1 ]
Liu, Jie [1 ]
Deng, Xiang [1 ]
Zheng, Zhuochao [1 ]
机构
[1] Harman Int, Shenzhen, Peoples R China
关键词
Voice activity detection; long short-term memory; Gammatone cepstral coefficients; spectral features;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Voice Activity Detection (VAD) plays an important role in audio processing, but it is also a common challenge when a voice signal is corrupted with strong and transient noise. In this paper, an accurate and causal VAD module using a long short-term memory (LSTM) deep neural network is proposed. A set of features including Gammatone cepstral coefficients (GTCC) and selected spectral features are used. The low complex structure allows it can be easily implemented in speech processing algorithms and applications. With carefully pre-processing and labeling the collected training data in the classes of speech or non-speech and training on the LSTM net, experiments show the proposed VAD is able to distinguish speech from different types of noisy background effectively. Its robustness against changes including varying frame length, moving speech sources and speaking in different languages, are further investigated.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] PMU bad data detection method based on long short-term memory network
    Yang Z.
    Liu H.
    Bi T.
    Yang Q.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2020, 48 (07): : 1 - 9
  • [42] Collective Anomaly Detection Based on Long Short-Term Memory Recurrent Neural Networks
    Bontemps, Loic
    Van Loi Cao
    McDermott, James
    Nhien-An Le-Khac
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2016, 2016, 10018 : 141 - 152
  • [43] A Deep Long Short-Term Memory based classifier for Wireless Intrusion Detection System
    Kasongo, Sydney Mambwe
    Sun, Yanxia
    ICT EXPRESS, 2020, 6 (02): : 98 - 103
  • [44] Fault detection in automated production systems based on a long short-term memory autoencoder
    Windmann, Stefan
    Westerhold, Tim
    AT-AUTOMATISIERUNGSTECHNIK, 2024, 72 (01) : 47 - 58
  • [45] Device Anomaly Detection Algorithm Based on Enhanced Long Short-Term Memory Network
    罗辛
    陈静
    袁德鑫
    杨涛
    JournalofDonghuaUniversity(EnglishEdition), 2023, 40 (05) : 548 - 559
  • [46] Electricity price default detection model based on long short-term memory network
    Zhang Jing
    Chen Yan
    Yan Furong
    Wan Quan
    Guo Hongbo
    Liu Junling
    Zhang Mingzhu
    Tan Yuxuan
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ELECTRONICS AND ELECTRICAL ENGINEERING, AUTEEE, 2024, : 593 - 597
  • [47] Adversarial learning for Mirai botnet detection based on long short-term memory and XGBoost
    Vajrobol V.
    Gupta B.B.
    Gaurav A.
    Chuang H.-M.
    International Journal of Cognitive Computing in Engineering, 2024, 5 : 153 - 160
  • [48] Long Short-term Memory Network Based Fatigue Detection with Sequential Mouth Feature
    Fei, Yanling
    Li, Bin
    Wang, Heng
    Tian, Lianfang
    2020 INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS), 2020, : 17 - 22
  • [49] Brain tumor detection: a long short-term memory (LSTM)-based learning model
    Amin, Javaria
    Sharif, Muhammad
    Raza, Mudassar
    Saba, Tanzila
    Sial, Rafiq
    Shad, Shafqat Ali
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 15965 - 15973
  • [50] Short-term Load Forecasting with Distributed Long Short-Term Memory
    Dong, Yi
    Chen, Yang
    Zhao, Xingyu
    Huang, Xiaowei
    2023 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE, ISGT, 2023,