WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION

被引:21
|
作者
Farooq, O. [1 ]
Datta, S. [2 ]
Shrotriya, M. C. [1 ]
机构
[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India
[2] Loughborough Univ Technol, Dept Elect Engn, Loughborough LE11 3TU, Leics, England
关键词
Feature extraction; Hindi speech; phoneme recognition; wavelet transform; SPEECH; SYSTEM;
D O I
10.1142/S0219691310003845
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes the use of wavelet transform-based feature extraction technique for Hindi speech recognition application. The new proposed features take into account temporal as well as frequency band energy variations for the task of Hindi phoneme recognition. The recognition performance achieved by the proposed features is compared with the standard MFCC and 24-band admissible wavelet packet-based features using a linear discriminant function based classifier. To evaluate robustness of these features, the NOISEX database is used to add different types of noise into phonemes to achieve signal-to-noise ratios in the range of 20 dB to -5 dB. The recognition results show that under noisy background the proposed technique always achieves a better performance over MFCC-based features.
引用
收藏
页码:847 / 859
页数:13
相关论文
共 50 条
  • [21] Mel Sub-Band Filtering and Compression for Robust Speech Recognition
    Nasersharif, Babak
    Akbari, Ahmad
    Homayounpour, Mohammad Mehdi
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 105 - +
  • [22] Sub-band speech recognition
    Primor, D
    Furst-Yust, M
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 10 - 12
  • [23] Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination
    Kim, Sungtak
    Ji, Miyoung
    Kim, Hoirin
    PATTERN RECOGNITION LETTERS, 2010, 31 (07) : 593 - 599
  • [24] Sub-Band Based Attention for Robust Polyp Segmentation
    Fang, Xianyong
    Shi, Yuqing
    Guo, Qingqing
    Wang, Linbo
    Liu, Zhengyi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 736 - 744
  • [25] Sub-band weighted projection measure for sub-band speech recognition in noise
    Nasersharif, B.
    Akbari, A.
    ELECTRONICS LETTERS, 2006, 42 (14) : 829 - 831
  • [26] Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition
    Nagpal, Ankit
    Patil, Hemant A.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 342 - 350
  • [27] Modeling sub-band correlation for noise-robust speech recognition
    McAuley, J
    Ming, J
    Hanna, P
    Stewart, D
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 1017 - 1020
  • [28] Phone recognition in critical bands using sub-band temporal modulations
    Li, Feipeng
    Mallidi, Sri Harish
    Hermansky, Hynek
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1814 - 1817
  • [29] Robust speech detection based on phoneme recognition features
    Mihelic, France
    Zibert, Janez
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 455 - 462
  • [30] Recognition of Cough Using Features Improved by Sub-band Energy Transformation
    Zhu, Chunmei
    Tian, Lianfang
    Li, Xiangyang
    Mo, Hongqiang
    Zheng, Zeguang
    PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 251 - 255