Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments

被引:0
|
作者
Miki, K
Nishiura, T
Nakamura, S
Shikano, K
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
[2] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
hands-free; microphone array; HMM decomposition and composition; noisy and echo environment; speech recognition;
D O I
10.1002/ecjb.10068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handling background noise or echo (reverberation) etc. is very important for having an automated robot etc. recognize remote speech in a real environment. As effective schemes for handling this problem, noise reducing schemes such as model adaptation schemes including HMM decomposition and composition or microphone array (beam-former) signal processing, spectral subtraction, etc. have been proposed. In particular, a model adaptation scheme is very effective for speech recognition in a noisy environment and its recognition performance increases in proportion to the signal-to-noise ratio (SNR). In this paper, improving the recognition performance in a low-SNR environment by receiving speech at a high SNR using a: microphone array before HMM decomposition and composition is attempted. The results of speech recognition experiments conducted in a noisy environment in an acoustic laboratory show an improvement in the recognition rate of about 25% by the proposed method for the case in which the SNR in a single microphone is 0 dB, As compared with the cases of using microphone array signal processing, HMM decomposition and composition. alone. In addition, the proposed method shows recognition performance comparable to the case of using cepstrum mean normalization and spectral subtraction performed with an optimal coefficient given to the speech after microphone array processing. (C) 2002 Wiley Periodicals, Inc.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [41] IMPROVED SPEECH RECOGNITION IN NOISY ENVIRONMENTS BY USING A THROAT MICROPHONE FOR ACCURATE VOICING DETECTION
    Dekens, Tomas
    Verhelst, Werner
    Capman, Francois
    Beaugendre, Frederic
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1978 - 1982
  • [42] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [43] A Predefined Command Recognition System Using a Ceiling Microphone Array in Noisy Housing Environments
    Sasaki, Yoko
    Kagami, Satoshi
    Mizoguchi, Hiroshi
    Enomoto, Tadashi
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 2178 - +
  • [44] Estimation of speech recognition performance in noisy and reverberant environments using PESQ score and acoustic parameters
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [45] Strategies for distant speech recognition in reverberant environments
    Delcroix, Marc
    Yoshioka, Takuya
    Ogawa, Atsunori
    Kubo, Yotaro
    Fujimoto, Masakiyo
    Ito, Nobutaka
    Kinoshita, Keisuke
    Espi, Miquel
    Araki, Shoko
    Hori, Takaaki
    Nakatani, Tomohiro
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [46] Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions
    Suman, Maloji
    Khan, Habibulla
    Latha, M. Madhavi
    Kumari, Devarakonda Aruna
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 379 - +
  • [47] Survey on Approaches to Speech Recognition in Reverberant Environments
    Yoshioka, Takuya
    Sehr, Armin
    Delcroix, Marc
    Kinoshita, Keisuke
    Maas, Roland
    Nakatani, Tomohiro
    Kellermann, Walter
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [48] Distributed Acoustic Source Tracking in Noisy and Reverberant Environments With Distributed Microphone Networks
    Zhang, Qiaoling
    Zhang, Weiwei
    Feng, Jie
    Tang, Roubing
    IEEE ACCESS, 2020, 8 : 9913 - 9927
  • [49] Beamforming microphone arrays for speech acquisition in noisy environments
    Fischer, S
    Simmer, KU
    SPEECH COMMUNICATION, 1996, 20 (3-4) : 215 - 227
  • [50] Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments
    Bae, Ara
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 51 - 55