Robot ego-noise suppression with labanotation-template subtraction

被引:2
|
作者
Jaroslavceva, Jekaterina [1 ]
Wake, Naoki [1 ]
Sasabuchi, Kazuhiro [1 ]
Ikeuchi, Katsushi [1 ]
机构
[1] Microsoft, Appl Robot Res, Redmond, WA 98052 USA
关键词
ego-noise; labanotation; automatic speech recognition; human-robot interaction;
D O I
10.1002/tee.23523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, we aim to improve automatic-speech-recognition (ASR) accuracy in the presence of robot ego-noise toward a better human-robot interaction. Although several noise reduction methods have been proposed to increase ASR accuracy or signal-to-noise ratio (SNR) by predicting ego-noises through a short-time motion-template subtraction or a neural network, these methods showed poor performance in some practical use cases, such as attenuating long-term motion-associated ego-noise. Based on the motion-template subtraction method, we address the problem of creating ego-noise templates associated with a wide variety of robot motions. For representing robot motions, we employ a dance notation referred to as Labanotation. The rationales behind our approach are: (i) Labanotation allows quantizing infinite motion patterns using a finite number of Labanotation combinations; (ii) Labanotation-based motion description is hardware-independent; and (iii) long-time noise templates facilitate the localization of noise templates in a speech-with-noise signal compared to short-time templates. The effectiveness of the Labanotation-template subtraction (LTS) method was tested for five commercial ASRs in terms of ASR accuracy, SNR, and source-to-distortion ratio. We show that LTS leads to a reasonable performance, comparable to the other methods. The contribution of this study is (i) to propose to use Labanotation to reasonably collect noise templates, (ii) to demonstrate the practical effectiveness of LTS as well as examples of Labanotations for household actions. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [41] Online Learning for Template-based Multi-channel Ego Noise Estimation
    Ince, Goekhan
    Nakadai, Kazuhiro
    Nakamura, Keisuke
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3284 - 3289
  • [42] Improved noise reduction in single fiber auditory neural responses using template subtraction
    Woo, Jihwan
    Miller, Charles A.
    Abbas, Paul J.
    Hong, Sung Hwa
    Kim, In Young
    JOURNAL OF NEUROSCIENCE METHODS, 2006, 155 (02) : 319 - 327
  • [43] Using Spectral Subtraction for Suppression of Noise in Speech Signals with Analog Integrated Circuits
    Stanislav Gruden
    Baldomir Zajc
    Analog Integrated Circuits and Signal Processing, 1999, 18 : 195 - 207
  • [44] Using spectral subtraction for suppression of noise in speech signals with analog integrated circuits
    Gruden, S
    Zajc, B
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 1999, 18 (2-3) : 195 - 207
  • [45] Suppression of noise in modulation frequency range of interferometer using spectral subtraction method
    Wei, Dong
    Nagata, Yusuke
    Aketagawa, Masato
    OPTICS COMMUNICATIONS, 2020, 475
  • [46] Random Noise Suppression in fMRI time-series Using Modified Spectral Subtraction
    Monir, Syed Muhammad
    Siyal, Mohammed Yakoob
    Maheshweri, Harish Kumar
    INMIC: 2008 INTERNATIONAL MULTITOPIC CONFERENCE, 2008, : 441 - 442
  • [47] Fuzzy adaptive vibration suppression and noise filtering for flexible robot control
    Green, A
    Sasiadek, JZ
    ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 1359 - 1364
  • [48] Multi-Band Spectral Subtraction Based Zoom-Noise Suppression for Digital Cameras
    Jeon, Kwang Myung
    Park, Nam In
    Kim, Hong Kook
    Choi, Myung Kyu
    Hwang, Kwang Il
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 401 - +
  • [49] Online Audio Beat Tracking for a Dancing Robot in the Presence of Ego-Motion Noise in a Real Environment
    Oliveira, Joao Lobato
    Ince, Goekhan
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 403 - 408
  • [50] Ego Noise Reduction for Hose-Shaped Rescue Robot Combining Independent Low-Rank Matrix Analysis and Noise Cancellation
    Mae, Narumi
    Kitamura, Daichi
    Ishimura, Masaru
    Yamada, Takeshi
    Makino, Shoji
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,