An application of discriminative feature extraction lo filter-bank-based speech recognition

被引:55
|
作者
Biem, A [1 ]
Katagiri, S [1 ]
McDermott, E [1 ]
Juang, BH [1 ]
机构
[1] ATR, Human Informat Proc Res Labs, Kyoto 61902, Japan
来源
关键词
feature extraction; filter-bank; generalized probabilistic descent; minimum classification error; pattern recognition; speech recognition;
D O I
10.1109/89.902277
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A pattern recognizer is usually a modular system which consists of a feature extractor module and a classifier module. Traditionally, these two modules have been designed separately, which may not result in an optimal recognition accuracy. To alleviate this fundamental problem, the authors have developed a design method, named Discriminative Feature Extraction (DFE), that enables one to design the overall recognizer, i.e., both the feature extractor and the classifier, in a manner consistent with the objective of minimizing recognition errors. This paper investigates the application of this method to designing a speech recognizer that consists of a filter-bank feature extractor and a multi-prototype distance classifier. Carefully investigated experiments demonstrate that DFE achieves the design of a better recognizer and provides an innovative recognition-oriented analysis of the filter-bank, as an alternative to conventional analysis based on psychoacoustic expertise or heuristics.
引用
收藏
页码:96 / 110
页数:15
相关论文
共 50 条
  • [21] Adaptive Wavelet Packet Filter-Bank Based Acoustic Feature for Speech Emotion Recognition
    Li, Yue
    Zhang, Guobao
    Huang, Yongming
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 359 - 366
  • [22] The application of the additive model in the feature extraction of speech recognition
    Xi, WB
    Fang, L
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 753 - 756
  • [23] Iris recognition with tunable filter bank based feature
    Barpanda, Soubhagya Sankar
    Sa, Pankaj K.
    Marques, Oge
    Majhi, Banshidhar
    Bakshi, Sambit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (06) : 7637 - 7674
  • [24] Iris recognition with tunable filter bank based feature
    Soubhagya Sankar Barpanda
    Pankaj K. Sa
    Oge Marques
    Banshidhar Majhi
    Sambit Bakshi
    Multimedia Tools and Applications, 2018, 77 : 7637 - 7674
  • [25] Cepstrum-based filter-bank design using discriminative feature extraction training at various levels
    Biem, A
    Katagiri, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1503 - 1506
  • [26] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    INTERSPEECH 2020, 2020, : 4094 - 4097
  • [27] Discriminative Feature Learning for Speech Emotion Recognition
    Zhang, Yuying
    Zou, Yuexian
    Peng, Junyi
    Luo, Danqing
    Huang, Dongyan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 198 - 210
  • [28] Generalized Discriminative Feature Transformation for Speech Recognition
    Hsiao, Roger
    Schultz, Tanja
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 672 - 675
  • [29] FEATURE-EXTRACTION USING A MATRIX COEFFICIENT FILTER FOR SPEECH RECOGNITION
    KATAGISHI, K
    SINGER, H
    AIKAWA, K
    SAGAYAMA, S
    SPEECH COMMUNICATION, 1993, 13 (3-4) : 297 - 306
  • [30] Particle Filter Based Object Tracking with Discriminative Feature Extraction and Fusion
    Shen, Yao
    Guturu, Parthasarathy
    Damarla, Thyagaraju
    Buckles, Bill P.
    ADVANCES IN VISUAL COMPUTING, PT II, PROCEEDINGS, 2008, 5359 : 246 - +