Group Delay based Methods for Detection and Recognition of Whispered Speech

被引:0
|
作者
Vedvyasan, Kishore [1 ]
Nathwani, Karan [1 ]
Hegde, Rajesh M. [1 ]
机构
[1] IIT Kanpur, Dept Elect Engn, Kanpur, India
关键词
Group Delay Function; MVDR; NGD; LP and Whisper Detection; FEATURES;
D O I
10.1109/ICPR56361.2022.9956404
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present study demonstrates the effectiveness of the group delay function for detection and recognition of whispered speech. The group delay function in its spectral form is able to differentiate phonated from the whispered speech. In particular, the mean height-bandwidth product of the formants in the lower and mid frequency regions of the short term spectrum of speech is used herein for whisper speech detection. The mean height-bandwidth vector is obtained herein across all frames and is further smoothed using a moving average filter. The smoothed temporal version on this vector is able to detect the phonated to whisper change points. Towards this end, the current study investigates the cepstral, linear prediction (LP), minimum variance distortionless response (MVDR) and numerator of the group delay based smoothing techniques for whisper change point detection. Experiments on whispered speech detection are performed on the CHAINS database. Experimental results are compared to various methods for whispered speech detection available in literature.
引用
收藏
页码:499 / 505
页数:7
相关论文
共 50 条
  • [1] Group Delay Based Methods for Recognition of Distant talking Speech
    Mandala, Rohan
    Shukla, Mrityunjaya
    Hegde, Rajesh
    2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 1702 - 1706
  • [2] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
    Mathur, Arpit
    Reddy, Shankar M.
    Hegde, Rajesh M.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [3] Significance of parametric spectral ratio methods in detection and recognition of whispered speech
    Arpit Mathur
    Shankar M Reddy
    Rajesh M Hegde
    EURASIP Journal on Advances in Signal Processing, 2012
  • [4] Whispered Speech Detection Using Fusion of Group-Delay-Based Subband Modulation Spectrum and Correntropy Features
    Wang, Jinfang
    Shang, Yongqiang
    Jiang, Shuangshuang
    Gowda, Dhananjaya
    Lv, Ke
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1042 - 1046
  • [5] HTK-Based Recognition of Whispered Speech
    Galic, Jovan
    Jovicic, Slobodan T.
    Grozdic, Dorde
    Markovic, Branko
    SPEECH AND COMPUTER, 2014, 8773 : 251 - 258
  • [6] Emotion Recognition and Spoof Detection from Whispered Speech
    Sivan, Dawn
    Gopakumar, C.
    2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 1091 - 1095
  • [7] Analysis and recognition of whispered speech
    Ito, T
    Takeda, K
    Itakura, F
    SPEECH COMMUNICATION, 2005, 45 (02) : 139 - 152
  • [8] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 389 - 392
  • [9] Study on the Emotion Recognition of Whispered Speech
    Jin, Yun
    Zhao, Yan
    Huang, Chengwei
    Zhao, Li
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 242 - 246
  • [10] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 429 - 432