Multi-band automatic speech recognition

被引:8
|
作者
Cerisara, C [1 ]
Fohr, D [1 ]
机构
[1] LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
来源
COMPUTER SPEECH AND LANGUAGE | 2001年 / 15卷 / 02期
关键词
D O I
10.1006/csla.2001.0163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psyche-acoustic work of Fletcher (1953) who proposed a similar principle for the human auditory system. Jont B. Alien published a paper in 1994 in which he summarized the work of Fletcher and also proposed to adapt the multi-band paradigm to automatic speech recognition (ASR) (Allen, 1994). Many researchers have then studied this principle and built such ASR systems. The goal of this paper is to analyse some of the most important issues in the design of a multi-band ASR system in order to determine which architecture it should have in which environment. Two other major problems are then considered: how to train multi-band systems and how to use them for continuous ASR. (C) 2001 Academic Press.
引用
收藏
页码:151 / 174
页数:24
相关论文
共 50 条
  • [1] Dynamic Bayesian networks for multi-band automatic speech recognition
    Daoudi, K
    Fohr, D
    Antoine, C
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 263 - 285
  • [2] Asynchrony in multi-band speech recognition
    Cerisara, C
    Fohr, D
    Haton, JP
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1121 - 1124
  • [3] A recombination model for multi-band speech recognition
    Cerisara, C
    Haton, JP
    Mari, JF
    Fohr, D
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 717 - 720
  • [4] Multi-band speech recognition in noisy environments
    Okawa, S
    Bocchieri, E
    Potamianos, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 641 - 644
  • [5] Continuous multi-band speech recognition using Bayesian networks
    Daoudi, K
    Fohr, D
    Antoine, C
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 41 - 44
  • [6] Robust Speech Recognition Based on Multi-band Spectral Subtraction
    Wan, Yi-Long
    Zhang, Tian-Qi
    Wang, Zhi-Chao
    Jin, Jing
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 36 - 40
  • [7] An efficient multi-band spectral subtraction method for robust speech recognition
    Safayani, M.
    Sameti, H.
    Babaali, B.
    Shalmani, M. T. Manzuri
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 748 - 751
  • [8] A Weight Estimation Method Using LDA for Multi-Band Speech Recognition
    Iwano, Koji
    Kojima, Kaname
    Furui, Sadaoki
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2534 - 2537
  • [9] Optimized Spectral Floor in Multi-band Spectral Subtraction for Dysarthric Speech Recognition
    Chadha, Ankita N.
    Zaveri, Mukesh A.
    Sarvaiya, Jignesh N.
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 245 - 250
  • [10] Examining the combination of multi-band processing and channel dropout for robust speech recognition
    Kovacs, Gyorgy
    Toth, Laszlo
    Van Compernolle, Dirk
    Liwicki, Marcus
    INTERSPEECH 2019, 2019, : 421 - 425