A pre-processing method for improvement of vowel onset point detection under noisy conditions

被引:1
|
作者
Saha, P. [1 ]
Laskar, R. H. [1 ]
Laskar, A. [1 ]
机构
[1] Natl Inst Technol, Speech & Image Proc Lab, Silchar 788010, Assam, India
关键词
Vowel onset point; Excitation source; Spectral peak; Modulation spectrum; Perceptual filter; SPEAKER VERIFICATION;
D O I
10.1016/j.specom.2016.04.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Vowel onset point (VOP) is the instant of time at which vowel region starts in a speech signal. VOP plays a vital role in different applications of speech processing, such as syllable detection, speaker verification, duration modification, language identification etc. There are different existing algorithms for the detection of instance of VOP in a speech signal. The algorithm based on the combined evidences extracted from the source excitation, spectral peaks and modulation spectrum has been used as a baseline system for the present work. The baseline system performs well under clean speech data. However, under noisy conditions the performance of the baseline system degrades. The performance of the system degrades in terms of more number of spurious VOPs, which get detected under noisy speech conditions. According to the available literature, this degraded performance is due to the spectral broadening of the speech in the noisy environments. In this paper we have proposed a pre-processing technique on top of the baseline system to reduce this spectral broadening effect of noise. The noisy speech data are passed through the pre-processing algorithm in order to minimize the spectral broadening effect of speech signal. The pre-processed speech is then passed through the baseline system to detect the VOPs in the speech signal. Experiments were carried out under clean and different noisy speech signals. The results of the experiment show an improvement of 16-21% in terms of removal of spurious VOPs, over the existing baseline system under different noisy speech conditions. Further, the performance of the proposed method has been compared with two different best performing techniques for detection of VOP, and found that the proposed method gives a superior level of performance in terms of identification accuracy and identification rate. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:71 / 83
页数:13
相关论文
共 50 条
  • [1] Robust analysis for improvement of vowel onset point detection under noisy conditions
    Saha P.
    Baruah U.
    Laskar R.H.
    Mishra S.
    Choudhury S.P.
    Das T.K.
    International Journal of Speech Technology, 2016, 19 (3) : 433 - 448
  • [2] Pre-processing of noisy speech for voice coders
    Agarwal, T
    Kabal, P
    2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 169 - 171
  • [3] Vowel onset point detection for noisy speech using spectral energy at formant frequencies
    Vuppala A.K.
    Rao K.S.
    International Journal of Speech Technology, 2013, 16 (02) : 229 - 235
  • [4] Performance improvement in a classification method by using statistical pre-processing
    Han E.-H.
    Cha H.-T.
    Journal of Institute of Control, Robotics and Systems, 2019, 25 (01) : 69 - 75
  • [5] Detection of vowel onset point in speech
    Prasanna, SRM
    Zachariah, JM
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4159 - 4159
  • [6] Pre-processing of on-line signals in noisy environments
    McCabe, Alan
    Trevathan, Jarrod
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 933 - +
  • [7] Image Pre-processing Algorithms for Detection of Small/Point Airborne Targets
    Srivastava, Hari Babu
    Kurnar, Vinod
    Verma, H. K.
    Sundaram, S. S.
    DEFENCE SCIENCE JOURNAL, 2009, 59 (02) : 166 - 174
  • [8] Study on pre-processing algorithm of point target detection in low SNR
    Wang, CP
    Wang, JH
    Qian, JS
    ICEMI'2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1-3, 2003, : 720 - 723
  • [9] Effect of Noise on Vowel Onset Point Detection
    Vuppala, Anil Kumar
    Yadav, Jainath
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    CONTEMPORARY COMPUTING, 2011, 168 : 201 - +
  • [10] A new pre-processing method for regression
    Jing, Wen-Feng
    Meng, De-Yu
    Dai, Ming-Wei
    Xu, Zongben
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 765 - 770