Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation frequency

被引:0
|
作者
Chuangsuwanich, Ekapol [1 ]
Glass, James [1 ]
机构
[1] MIT Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
voice activity detection; modulation frequency; harmonicity; human-robot interaction; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of robustly detecting distant speech in low SNR environments for automatic speech recognition is examined using a two-stage approach based on two distinguishing features of speech, namely harmonicity and modulation frequency (MF). A modified metric for harmonicity is used as a gating function to a set of parallel classifiers that incorporate MFs computed on different frequency bands. Performance is evaluated on both the frame-level discriminative power and also the system level ASR results on a real-world robotic forklift task. Compared to other previously proposed features such as relative spectral entropy, and classification strategies involving MFs, the combined approach shows good generalization across different kinds of dynamic noise conditions, and obtains a significant improvement on the false alarm rate at low speech miss rate settings. The overall ASR results also improved significantly compared to the ESTI AMR-VAD2, while reducing the number of false alarms by a factor of two.
引用
收藏
页码:2656 / 2659
页数:4
相关论文
共 50 条
  • [1] A robust voice activity detector using an acoustic Doppler radar
    Hu, RQ
    Raj, B
    2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 319 - 324
  • [2] Robust voice activity detector for wireless communications using soft computing
    Univ of Catania, Catania, Italy
    IEEE J Sel Areas Commun, 9 (1818-1829):
  • [3] A robust voice activity detector for wireless communications using soft computing
    Beritelli, F
    Casale, S
    Cavallaro, A
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1998, 16 (09) : 1818 - 1829
  • [4] A robust Voice Activity Detector applied for AMR
    Chen, D
    Kuang, JM
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 687 - 692
  • [5] Voice activity detection in noise using modulation spectrum of speech: Investigation of speech frequency and modulation frequency ranges
    Pek, Kimhuoch
    Arai, Takayuki
    Kanedera, Noboru
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2012, 33 (01) : 33 - 44
  • [6] Effective jointly pdf-based voice activity detector for real-time applications
    Gorriz, J. M.
    Ramirez, J.
    Puntonet, C. G.
    ELECTRONICS LETTERS, 2007, 43 (04) : 251 - 253
  • [7] SPARSE POWER SPECTRUM BASED ROBUST VOICE ACTIVITY DETECTOR
    You, Datao
    Han, Jiqing
    Zheng, Guibin
    Zheng, Tieran
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 289 - 292
  • [8] A gated recurrent unit based robust voice activity detector
    Han, Il
    Om, Chol-Nam
    Kim, Un-Il
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41939 - 41949
  • [9] A gated recurrent unit based robust voice activity detector
    Il Han
    Chol-Nam Om
    Un-Il Kim
    Multimedia Tools and Applications, 2024, 83 : 41939 - 41949
  • [10] A low power Voice Activity Detector for portable applications
    Meoni, Gabriele
    Pilato, Luca
    Fanucci, Luca
    2018 14TH CONFERENCE ON PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIME 2018), 2018, : 41 - 44