Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition

被引:0
|
作者
Takeda, Ryu [1 ]
Nakadai, Kazuhiro [2 ]
Komatani, Kazunori [1 ]
Ogata, Tetsuya [1 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[2] Honda Res Inst, Wako, Saitama 3510114, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.
引用
收藏
页码:1763 / +
页数:2
相关论文
共 50 条
  • [31] Adaptive ICA for separation of complex signals with known source distributions in time-varying channels
    Ranganathan, R.
    Yang, T. T.
    Mikhael, W. B.
    ELECTRONICS LETTERS, 2007, 43 (15) : 838 - 840
  • [32] Integration of Sound Source Localization and Separation to Improve Dialogue Management on a Robot
    Frechette, Maxime
    Letourneau, Dominic
    Valin, Jean-Marc
    Michaud, Francois
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2358 - 2363
  • [33] Pitch-Cluster-Map Based Daily Sound Recognition for Mobile Robot Audition
    Sasaki, Yoko
    Kaneyoshi, Masahito
    Kagami, Satoshi
    Mizoguchi, Hiroshi
    Enomoto, Tadashi
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2010, 22 (03) : 402 - 410
  • [34] ICA Based Informed Source Separation for Digitally Watermarked Audio Signals
    Sharanya, R.
    Sugumar, D.
    Sujithra, T. L.
    Bose, Susan Mary
    Koshy, Divya Mary
    INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 477 - 480
  • [35] Blind separation of speech signals based on a lattice-ica geometric procedure
    Rodríguez-Alvarez, M
    Rojas, F
    Puntonet, CG
    Mansour, A
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 424 - 428
  • [36] Blind source separation for moving speech signals using blockwise ICA and residual crosstalk subtraction
    Mukai, R
    Sawada, H
    Araki, S
    Makino, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (08) : 1941 - 1948
  • [37] Fast-convergence algorithm for ICA-based blind source separation using array signal processing
    Saruwatari, H
    Kawamura, T
    Shikano, K
    PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 91 - 94
  • [38] Enhanced robot speech recognition based on microphone array source separation and missing feature theory
    Yamamoto, S
    Valin, JM
    Nakadai, K
    Rouat, J
    Michaud, F
    Ogata, T
    Okuno, HG
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 1477 - 1482
  • [39] Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
    Davila-Chacon, Jorge
    Liu, Jindong
    Wermter, Stefan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) : 138 - 150
  • [40] Fast-convergence algorithm for ICA-based blind source separation using array signal processing
    Saruwatari, H
    Kawamura, T
    Shikano, K
    2001 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING PROCEEDINGS, 2001, : 464 - 467