Voice activity detection algorithm based on long-term pitch information

被引:5
|
作者
Yang, Xu-Kui [1 ,2 ]
He, Liang [3 ]
Qu, Dan [1 ]
Zhang, Wei-Qiang [3 ]
机构
[1] Zhengzhou Informat Sci & Technol Inst, Zhengzhou, Peoples R China
[2] State Key Lab Integrated Serv Networks, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2016年
基金
中国国家自然科学基金;
关键词
Voice activity detection; Non-stationary noise; Long-term pitch envelop; Long-term pitch divergence; NOISE;
D O I
10.1186/s13636-016-0092-y
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new voice activity detection algorithm based on long-term pitch divergence is presented. The long-term pitch divergence not only decomposes speech signals with a bionic decomposition but also makes full use of long-term information. It is more discriminative comparing with other feature sets, such as long-term spectral divergence. Experimental results show that among six analyzed algorithms, the proposed algorithm is the best one with the highest non-speech hit rate and a reasonably high speech hit rate.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A Comparative Study of Pitch Detection Algorithms for Microcontroller Based Voice Pitch Detector
    Ruslan, Nuraina Suryani Binti
    Mamat, Mazlina
    Porle, Rosalyn R.
    Parimon, Norfarariyanti
    ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11521 - 11524
  • [42] Diary-Like Long-Term Activity Recognition: Touch or Voice Interaction?
    Scholl, Philipp M.
    Borazio, Marko
    Jaensch, Martin
    Van Laerhoven, Kristof
    2014 11TH INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS WORKSHOPS (BSN WORKSHOPS), 2014, : 42 - 45
  • [43] Onset Detection Algorithm in Voice Activity Detection for Mandarin
    Wang, Huan
    Wang, Lei
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1148 - 1151
  • [44] VISUAL VOICE ACTIVITY DETECTION BASED ON SPATIOTEMPORAL INFORMATION AND BAG OF WORDS
    Patrona, Foteini
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2334 - 2338
  • [45] Audiovisual Voice Activity Detection Based on Microphone Arrays and Color Information
    Minotto, Vicente P.
    Lopes, Carlos B. O.
    Scharcanski, Jacob
    Jung, Claudio R.
    Lee, Bowon
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (01) : 147 - 156
  • [46] Long-term outcome of hyperfunctional voice disorders based on a multiparameter approach
    Van Lierde, K. M.
    Claeys, S.
    De Bodt, M.
    van Cauwenberge, P.
    JOURNAL OF VOICE, 2007, 21 (02) : 179 - 188
  • [47] An Algorithm of Voice Activity Detection Based on EMD and Wavelet Entropy Ratio
    Xiao-Bing Zhang
    Ting-Ting Sun
    Yan-Ping Li
    JournalofElectronicScienceandTechnology, 2017, 15 (01) : 64 - 68
  • [48] A wavelet-based voice activity detection algorithm in noisy environments
    Chen, SH
    Wang, JF
    ICES 2002: 9TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-111, CONFERENCE PROCEEDINGS, 2002, : 995 - 998
  • [49] Voice Activity Detection: Merging Source and Filter-based Information
    Drugman, Thomas
    Stylianou, Yannis
    Kida, Yusuke
    Akamine, Masami
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (02) : 252 - 256
  • [50] A New Robust Voice Activity Detection method based on Genetic Algorithm
    Farsinejad, M.
    Analoui, M.
    ATNAC: 2008 AUSTRALASIAN TELECOMMUNICATION NETWOKS AND APPLICATIONS CONFERENCE, 2008, : 80 - 84