Noise-Robust Voice Activity Detector Based On Four States-Based HMM

被引:1
|
作者
Zhou, Bin [1 ]
Liu, Jing [1 ]
Pei, Zheng [1 ]
机构
[1] Xihua Univ, Ctr Radio Adm & Technol Dev, Chengdu, Peoples R China
关键词
Voice activity detection; k-means clustering; left-right hidden Markov model; low signal-to-noise ratio;
D O I
10.4028/www.scientific.net/AMM.411-414.743
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Voice activity detection (VAD) is more and more essential in the noisy environments to provide an accuracy performance in the speech recognition. In this paper, we provide a method based on left-right hidden Markov model (HMM) to identify the start and end of the speech. The method builds two models of non-speech and speech instead of existed two states, formally, each model could include several states, we also analysis other features, such as pitch index, pitch magnitude and fractal dimension of speech and non-speech.. We compare the VAD results with the proposed algorithm and two states HMM. Experiments show that the proposed method make a better performance than two state HMMs in VAD, especially in the low signal-to-noise ratio (SNR) environment.
引用
收藏
页码:743 / 748
页数:6
相关论文
共 50 条
  • [21] Noise estimation using negentropy based voice-activity detector
    Prasad, R
    Saruwatari, H
    Shikano, K
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, CONFERENCE PROCEEDINGS, 2004, : 149 - 152
  • [22] A noise-robust voice activity detection algorithm using wavelets and support vector machines
    Chen, Shi-Huang
    Chen, Shih-Hao
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 447 - 450
  • [23] A noise-robust estimator of volatility based on interquantile ranges
    Yeh J.-H.
    Wang J.-N.
    Kuan C.-M.
    Review of Quantitative Finance and Accounting, 2014, 43 (4) : 751 - 779
  • [24] Noise-Robust Gaussian Distribution Based Imbalanced Oversampling
    Shao, Xuetao
    Yan, Yuanting
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 221 - 234
  • [25] Noise-Robust Feature Extraction Based on Forward Masking
    Chiou, Sheng-Chiuan
    Chen, Chia-Ping
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1243 - 1246
  • [26] Noise Robust Voice Activity Detection Based on Switching Kalman Filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 965 - 968
  • [27] Noise robust voice activity detection based on switching Kalman filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 467 - 477
  • [28] A noise-robust voice conversion method with controllable background sounds
    Chen, Lele
    Zhang, Xiongwei
    Li, Yihao
    Sun, Meng
    Chen, Weiwei
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3981 - 3994
  • [29] A robust polynomial regression-based voice activity detector for speaker verification
    Disken, Gokay
    Tufekci, Zekeriya
    Cevik, Ulus
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
  • [30] A robust polynomial regression-based voice activity detector for speaker verification
    Gökay Dişken
    Zekeriya Tüfekci
    Ulus Çevik
    EURASIP Journal on Audio, Speech, and Music Processing, 2017