Spectrum-entropy based beam-former with speaker tracking for hands-free continuous speech recognition in noise

被引:0
|
作者
George, N [1 ]
Evangelos, D [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, Patras 26500, Greece
关键词
D O I
10.1109/ICDSP.2002.1027881
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In hands-free speech recognition of moving speakers, the time interval where the source position can be assumed stationary varies. It is very common for the speaker, to move rapidly within the data window exploited. In such cases the conventional fixed-window direction of arrival (DOA) estimation may lead to poor tracking performance. In this paper we present a novel speech beam-former for moving speakers in noisy environments. The localization algorithm extracts a set of candidate DOA of the signal sources using array signal processing methods in the frequency domain. A minimum variance (MV) beam-former identifies the speech signal DOA in the direction where the signal's spectrum entropy is minimized. The same localization algorithm is used to detect the closest direction to the initial estimation using a smaller window. The proposed method is evaluated using a phoneme recognition system and noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25-0 dB SNR, show significant improvement in the recognition rate of moving speakers especially in very low SNRs.
引用
收藏
页码:251 / 254
页数:4
相关论文
共 23 条
  • [1] Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy
    George, N
    Evangelos, D
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 889 - 892
  • [2] Speaker tracking for hands-free continuous speech recognition in noise based on a spectrum-entropy beamforming method
    Nokas, George
    Dermatas, Evangelos
    IEICE Transactions on Information and Systems, 2003, E86-D (04) : 755 - 758
  • [3] Speaker tracking for hands-free continuous speech recognition in noise based on a spectrum-entropy beamforming method
    Nokas, G
    Dermatas, E
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (04): : 755 - 758
  • [4] Continuous speech recognition in noise using a spectrum-entropy beam-former
    Department of Electrical and Computer Engineering, University of Patras, Patras 26500, Hellas, Greece
    Int J Rob Autom, 2007, 2 (103-110):
  • [5] Continuous speech recognition in noise using a spectrum-entropy beam-former
    Nokas, G.
    Dermatas, E.
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2007, 22 (02): : 103 - 111
  • [6] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y.
    Godfrey, John J.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 297 - 300
  • [7] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y
    Godfrey, JJ
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 297 - 300
  • [8] Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition
    Ohashi, Y
    Nishikawa, T
    Saruwatari, H
    Lee, A
    Shikano, K
    2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 533 - 537
  • [9] Frame-synchronous noise compensation for hands-free speech recognition in car environments
    Chien, JT
    Lin, MS
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 508 - 515
  • [10] Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface
    Even, Jani
    Ishi, Carlos
    Saruwatari, Hiroshi
    Hagita, Norihiro
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 977 - 980