Voice activity detection algorithm based on long-term pitch information

被引:5
|
作者
Yang, Xu-Kui [1 ,2 ]
He, Liang [3 ]
Qu, Dan [1 ]
Zhang, Wei-Qiang [3 ]
机构
[1] Zhengzhou Informat Sci & Technol Inst, Zhengzhou, Peoples R China
[2] State Key Lab Integrated Serv Networks, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Voice activity detection; Non-stationary noise; Long-term pitch envelop; Long-term pitch divergence; NOISE;
D O I
10.1186/s13636-016-0092-y
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new voice activity detection algorithm based on long-term pitch divergence is presented. The long-term pitch divergence not only decomposes speech signals with a bionic decomposition but also makes full use of long-term information. It is more discriminative comparing with other feature sets, such as long-term spectral divergence. Experimental results show that among six analyzed algorithms, the proposed algorithm is the best one with the highest non-speech hit rate and a reasonably high speech hit rate.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Voice activity detection algorithm based on long-term pitch information
    Xu-Kui Yang
    Liang He
    Dan Qu
    Wei-Qiang Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [2] Adaptive Voice Activity Detection Based on Long-Term Information
    Yang X.-K.
    Qu D.
    Zhang W.-L.
    Yan H.-G.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2018, 46 (04): : 878 - 885
  • [3] Long-term speech information based threshold for voice activity detection in massive microphone network
    Zhu, Mengyao
    Wu, Xiukun
    Lu, Zhihua
    Wang, Tao
    Zhu, Xiaoqiang
    DIGITAL SIGNAL PROCESSING, 2019, 94 : 156 - 164
  • [4] Efficient voice activity detection algorithms using long-term speech information
    Ramírez, J
    Segura, JC
    Benítez, C
    de la Torre, A
    Rubio, A
    SPEECH COMMUNICATION, 2004, 42 (3-4) : 271 - 287
  • [5] Efficient voice activity detection algorithm using long-term spectral flatness measure
    Yanna Ma
    Akinori Nishihara
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [6] Efficient voice activity detection algorithm using long-term spectral flatness measure
    Ma, Yanna
    Nishihara, Akinori
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [7] Erratum to: Efficient voice activity detection algorithm using long-term spectral flatness measure
    Yanna Ma
    Akinori Nishihara
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [8] Robust Voice Activity Detection Using Long-Term Signal Variability
    Ghosh, Prasanta Kumar
    Tsiartas, Andreas
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 600 - 613
  • [9] Voice activity detection with noise reduction and long-term spectral divergence estimation
    Ramírez, J
    Segura, JC
    Benítez, C
    de la Torre, A
    Rubio, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING SIGNAL PROCESSING THEORY AND METHODS, 2004, : 1093 - 1096
  • [10] LONG-TERM AUTO-CORRELATION STATISTICS BASED VOICE ACTIVITY DETECTION FOR STRONG NOISY SPEECH
    Shi, Wei
    Zou, Yuexian
    Liu, Yi
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 100 - 104