Noise Robust Voice Activity Detection Using Features Extracted From the Time-Domain Autocorrelation Function

被引:0
|
作者
Ghaemmaghami, Houman [1 ]
Baker, Brendan [1 ]
Vogt, Robbie [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld 4001, Australia
关键词
voice activity detection; high noise; autocorrelation; zero-crossing rate; time-domain analysis; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a method of voice activity detection (VAD) for high noise scenarios, using a noise robust voiced speech detection feature. The developed method is based on the fusion of two systems. The first system utilises the maximum peak of the normalised time-domain autocorrelation function (MaxPeak). The second system uses a novel combination of cross-correlation and zero-crossing rate of the normalised autocorrelation to approximate a measure of signal pitch and periodicity (CrossCorr) that is hypothesised to be noise robust. The score outputs by the two systems are then merged using weighted sum fusion to create the proposed autocorrelation zero-crossing rate (AZR) VAD. Accuracy of AZR was compared to state-of-the-art and standardised VAD methods and was shown to outperform the best performing system with an average relative improvement of 24.8% in half-total error rate (HTER) on the QUT-NOISE-TIMIT database created using real recordings from high-noise environments.
引用
收藏
页码:3118 / 3121
页数:4
相关论文
共 50 条
  • [31] Artificial neural network based epileptic detection using time-domain and frequency-domain features
    Srinivasan V.
    Eswaran C.
    Sriraam A.N.
    Journal of Medical Systems, 2005, 29 (6) : 647 - 660
  • [32] The robust identification of exchange from T2-T2 time-domain features
    Song, Ruobing
    Song, Yi-Qiao
    Vembusubramanian, Muthusamy
    Paulsen, Jeffrey L.
    JOURNAL OF MAGNETIC RESONANCE, 2016, 265 : 164 - 171
  • [33] Discriminative Time-Domain Features for Activity Recognition on a Mobile Phone
    Buber, Ebubekir
    Guvensan, Amac M.
    2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [34] Noise robust voice activity detection using joint phase and magnitude based feature enhancement
    Khomdet Phapatanaburi
    Longbiao Wang
    Zeyan Oo
    Weifeng Li
    Seiichi Nakagawa
    Masahiro Iwahashi
    Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 845 - 859
  • [35] A noise-robust voice activity detection algorithm using wavelets and support vector machines
    Chen, Shi-Huang
    Chen, Shih-Hao
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 447 - 450
  • [36] Noise Robust Speech Recognition Using Parallel Model Compensation and Voice Activity Detection Methods
    Hizlisoy, Serhat
    Tufekci, Zekeriya
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [37] Noise robust voice activity detection using joint phase and magnitude based feature enhancement
    Phapatanaburi, Khomdet
    Wang, Longbiao
    Oo, Zeyan
    Li, Weifeng
    Nakagawa, Seiichi
    Iwahashi, Masahiro
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 845 - 859
  • [38] Noise Robust Voice Activity Detection Based on Switching Kalman Filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 965 - 968
  • [39] A technique for noise robust voice activity detection under uncontrolled environment
    Nagaraja, B.G.
    Thimmaraja Yadava, G.
    Kabballi, Prashanth
    Raghudathesh, G.P.
    Multimedia Tools and Applications, 2024,
  • [40] Noise robust voice activity detection based on switching Kalman filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 467 - 477