Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition

被引：0

作者：

Takeda, Ryu ^{[1
]}

Nakadai, Kazuhiro ^{[2
]}

Komatani, Kazunori ^{[1
]}

Ogata, Tetsuya ^{[1
]}

Okuno, Hiroshi G. ^{[1
]}

机构：

[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan

[2] Honda Res Inst, Wako, Saitama 3510114, Japan

来源：

2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.

引用

页码：1763 / +

页数：2

共 50 条

[31] Adaptive ICA for separation of complex signals with known source distributions in time-varying channels
Ranganathan, R.
Yang, T. T.
Mikhael, W. B.
ELECTRONICS LETTERS, 2007, 43 (15) : 838 - 840
[32] Integration of Sound Source Localization and Separation to Improve Dialogue Management on a Robot
Frechette, Maxime
Letourneau, Dominic
Valin, Jean-Marc
Michaud, Francois
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2358 - 2363
[33] Pitch-Cluster-Map Based Daily Sound Recognition for Mobile Robot Audition
Sasaki, Yoko
Kaneyoshi, Masahito
Kagami, Satoshi
Mizoguchi, Hiroshi
Enomoto, Tadashi
JOURNAL OF ROBOTICS AND MECHATRONICS, 2010, 22 (03) : 402 - 410
[34] ICA Based Informed Source Separation for Digitally Watermarked Audio Signals
Sharanya, R.
Sugumar, D.
Sujithra, T. L.
Bose, Susan Mary
Koshy, Divya Mary
INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 477 - 480
[35] Blind separation of speech signals based on a lattice-ica geometric procedure
Rodríguez-Alvarez, M
Rojas, F
Puntonet, CG
Mansour, A
7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 424 - 428
[36] Blind source separation for moving speech signals using blockwise ICA and residual crosstalk subtraction
Mukai, R
Sawada, H
Araki, S
Makino, S
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (08) : 1941 - 1948
[37] Fast-convergence algorithm for ICA-based blind source separation using array signal processing
Saruwatari, H
Kawamura, T
Shikano, K
PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 91 - 94
[38] Enhanced robot speech recognition based on microphone array source separation and missing feature theory
Yamamoto, S
Valin, JM
Nakadai, K
Rouat, J
Michaud, F
Ogata, T
Okuno, HG
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 1477 - 1482
[39] Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Davila-Chacon, Jorge
Liu, Jindong
Wermter, Stefan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) : 138 - 150
[40] Fast-convergence algorithm for ICA-based blind source separation using array signal processing
Saruwatari, H
Kawamura, T
Shikano, K
2001 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING PROCEEDINGS, 2001, : 464 - 467

← 1 2 3 4 5 →