A robust speech recognition system for communication robots in noisy environments

被引:18
|
作者
Ishi, Carlos Toshinori [1 ]
Matsuda, Shigeki [2 ,3 ]
Kanda, Takayuki [1 ]
Jitsuhiro, Takatoshi [3 ]
Ishiguro, Hiroshi [1 ,4 ]
Nakamura, Satoshi [2 ,3 ]
Hagita, Norihiro [1 ]
机构
[1] Adv Telecommun Res Inst Int, Intelligent Robot & Commun Labs, Kyoto 6190288, Japan
[2] Natl Inst Informat & Commun Technol, Koganei, Tokyo 1848795, Japan
[3] Adv Telecommun Res Inst Int, Spoken Language Commun Res Labs, Kyoto 6190288, Japan
[4] Osaka Univ, Dept Adapt Machine Syst, Suita, Osaka 5650871, Japan
关键词
acoustic noise; children speech; communication robots; robustness; speech recognition;
D O I
10.1109/TRO.2008.919305
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The application range of communication robots could be widely expanded by the use of automatic speech recognition (ASR) systems with improved robustness for noise and for speakers of different ages. In past researches, several modules have been proposed and evaluated for improving the robustness of ASR systems in noisy environments. However, this performance might be degraded when applied to robots, due to problems caused by distant speech and the robot's own noise. In this paper, we implemented the individual modules in a humanoid robot, and evaluated the ASR performance in a real-world noisy environment for adults' and children's speech. The performance of each module was verified by adding different levels of real environment noise recorded in a cafeteria. Experimental results indicated that our ASR system could achieve over 80% word accuracy in 70-dBA noise. Further evaluation of adult speech recorded in a real noisy environment resulted in 73% word accuracy.
引用
收藏
页码:759 / 763
页数:5
相关论文
共 50 条
  • [1] Robust speech recognition system for communication robots in real environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    2006 6TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, VOLS 1 AND 2, 2006, : 340 - +
  • [2] Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
    Kingsbury, B
    Saon, G
    Mangu, L
    Padmanabhan, M
    Sarikaya, R
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 53 - 56
  • [3] Linearized distortion model for robust speech recognition in noisy environments
    He, Yong-Jun
    Han, Ji-Qing
    Tongxin Xuebao/Journal on Communications, 2010, 31 (09): : 8 - 14
  • [4] A robust feature extraction for automatic speech recognition in noisy environments
    Lima, C
    Almeida, LB
    Monteiro, JL
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 540 - 543
  • [5] Robust Feature Extraction Methods for Speech Recognition in Noisy Environments
    Mukheolkar, Ajinkya Sunil
    Alex, John Sahaya Rani
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 295 - 299
  • [6] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [7] Auditory model for robust speech recognition in real world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    Zhu, XL
    ELECTRONICS LETTERS, 1997, 33 (01) : 12 - 13
  • [8] Blind source extraction for robust speech recognition in multisource noisy environments
    Nesta, Francesco
    Matassoni, Marco
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 703 - 725
  • [9] ROBUST SPEECH RECOGNITION UNDER NOISY ENVIRONMENTS USING ASYMMETRIC TAPERS
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1638 - 1642
  • [10] Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments
    Kim, Hyun-Don
    Kim, Jinsung
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    ADVANCED ROBOTICS, 2009, 23 (15) : 2093 - 2111