Distinctive phonetic feature extraction for robust speech recognition

被引:0
|
作者
Fukuda, T [1 ]
Yamamoto, W [1 ]
Nitta, T [1 ]
机构
[1] Toyohashi Univ Technol, Grad Sch Engn, Tempa Ku, Toyohashi, Aichi, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes an attempt to extract distinctive phonetic features (DPFs) that represent articulatory gestures in linguistic theory by using a multi-layer neural network (MLN) and to apply the DPFs to noise-robust speech recognition. In the DPF extraction stage, after converting a speech signal to acoustic features composed of local features (LFs), an MLN with 33 output units corresponding to context-dependent DPFs of 11 DPFs, 11 preceding context DPFs, and 11 following context DPFs maps the Us to DPFs. The proposed DPF parameters without MFCC were firstly evaluated in comparison with a standard parameter set of MFCC and dynamic features on a word recognition task using clean speech and the result showed the same performance as that of the standard set. Noise robustness of these parameters was then tested with four types of additive noise and the proposed DPF parameters outperformed the standard set except one additive noise type.
引用
收藏
页码:25 / 28
页数:4
相关论文
共 50 条
  • [1] Orthogonalized distinctive phonetic feature extraction for noise-robust automatic speech recognition
    Fukuda, T
    Nitta, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1110 - 1118
  • [2] Canonicalization of feature parameters for robust speech recognition based on distinctive phonetic feature (DPF) vectors
    Huda, Mohammad Nurul
    Ghulam, Muhammad
    Fukuda, Takashi
    Katsurada, Kouichi
    Nitta, Tsuneo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 488 - 498
  • [3] Canonicalization of feature parameters for robust speech recognition based on distinctive phonetic feature (DPF) vectors
    Nurul Huda, Mohammad
    Ghulam, Muhammad
    Fukuda, Takashi
    Katsurada, Kouichi
    Nitta, Tsuneo
    IEICE Transactions on Information and Systems, 2008, E91-D (03) : 488 - 498
  • [4] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [5] Geometrical feature extraction for robust speech recognition
    Li, Xiaokun
    Kwan, Chiman
    2005 39TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2005, : 558 - 562
  • [6] Perceptual speech processing and phonetic feature mapping for robust vowel recognition
    Bu, LK
    Chiueh, TD
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 105 - 114
  • [7] MVDR based feature extraction for robust speech recognition
    Dharanipragada, S
    Rao, BD
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 309 - 312
  • [8] Modified feature extraction methods in robust speech recognition
    Rajnoha, Josef
    Pollak, Petr
    2007 17TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, VOLS 1 AND 2, 2007, : 337 - +
  • [9] Discriminative temporal feature extraction for robust speech recognition
    Shen, JL
    ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600
  • [10] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91