Noise Robust Voice Activity Detection Using Features Extracted From the Time-Domain Autocorrelation Function

被引:0
|
作者
Ghaemmaghami, Houman [1 ]
Baker, Brendan [1 ]
Vogt, Robbie [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld 4001, Australia
关键词
voice activity detection; high noise; autocorrelation; zero-crossing rate; time-domain analysis; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a method of voice activity detection (VAD) for high noise scenarios, using a noise robust voiced speech detection feature. The developed method is based on the fusion of two systems. The first system utilises the maximum peak of the normalised time-domain autocorrelation function (MaxPeak). The second system uses a novel combination of cross-correlation and zero-crossing rate of the normalised autocorrelation to approximate a measure of signal pitch and periodicity (CrossCorr) that is hypothesised to be noise robust. The score outputs by the two systems are then merged using weighted sum fusion to create the proposed autocorrelation zero-crossing rate (AZR) VAD. Accuracy of AZR was compared to state-of-the-art and standardised VAD methods and was shown to outperform the best performing system with an average relative improvement of 24.8% in half-total error rate (HTER) on the QUT-NOISE-TIMIT database created using real recordings from high-noise environments.
引用
收藏
页码:3118 / 3121
页数:4
相关论文
共 50 条
  • [21] Detection of an invisible needle in ultrasound using a probabilistic SVM and time-domain features
    Beigi, Parmida
    Rohling, Robert
    Salcudean, Tim
    Lessoway, Victoria A.
    Ng, Gary C.
    ULTRASONICS, 2017, 78 : 18 - 22
  • [22] A robust voice activity detection based on noise eigenspace projection
    Ying, Dongwen
    Shi, Yu
    Soong, Frank
    Dang, Jianwu
    Lu, Xugang
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 76 - +
  • [23] Noise robust model-based Voice Activity Detection
    de la Torre, Angel
    Ramirez, Javier
    Benitez, Carmen
    Segura, Jose C.
    Garcia, Luz
    Rubio, Antonio J.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1954 - 1957
  • [24] On training targets for noise-robust voice activity detection
    Braun, Sebastian
    Tashev, Ivan
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 421 - 425
  • [25] Robust voice activity detection algorithm for estimating noise spectrum
    Woo, KH
    Yang, TY
    Park, KJ
    Lee, C
    ELECTRONICS LETTERS, 2000, 36 (02) : 180 - 181
  • [26] Speech Waveform Compression Using Robust Adaptive Voice Activity Detection for Nonstationary Noise
    Syed, Waheeduddin Q.
    Wu, Hsiao-Chun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008,
  • [27] ADA-VAD: UNPAIRED ADVERSARIAL DOMAIN ADAPTATION FOR NOISE-ROBUST VOICE ACTIVITY DETECTION
    Kim, Taesoo
    Chang, Jiho
    Ko, Jong Hwan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7327 - 7331
  • [28] Movement intention detection from SEMG signals using time-domain features and discriminant analysis classifiers
    Herle, S.
    2018 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2018,
  • [29] A voice activity detection algorithm using deep learning in time–frequency domain
    Samira Mavaddati
    Neural Computing and Applications, 2025, 37 (4) : 2581 - 2595
  • [30] Extracting time-domain Green's function estimates from ambient seismic noise
    Sabra, KG
    Gerstoft, P
    Roux, P
    Kuperman, WA
    Fehler, MC
    GEOPHYSICAL RESEARCH LETTERS, 2005, 32 (03) : 1 - 5