Robot ego-noise suppression with labanotation-template subtraction

被引:2
|
作者
Jaroslavceva, Jekaterina [1 ]
Wake, Naoki [1 ]
Sasabuchi, Kazuhiro [1 ]
Ikeuchi, Katsushi [1 ]
机构
[1] Microsoft, Appl Robot Res, Redmond, WA 98052 USA
关键词
ego-noise; labanotation; automatic speech recognition; human-robot interaction;
D O I
10.1002/tee.23523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, we aim to improve automatic-speech-recognition (ASR) accuracy in the presence of robot ego-noise toward a better human-robot interaction. Although several noise reduction methods have been proposed to increase ASR accuracy or signal-to-noise ratio (SNR) by predicting ego-noises through a short-time motion-template subtraction or a neural network, these methods showed poor performance in some practical use cases, such as attenuating long-term motion-associated ego-noise. Based on the motion-template subtraction method, we address the problem of creating ego-noise templates associated with a wide variety of robot motions. For representing robot motions, we employ a dance notation referred to as Labanotation. The rationales behind our approach are: (i) Labanotation allows quantizing infinite motion patterns using a finite number of Labanotation combinations; (ii) Labanotation-based motion description is hardware-independent; and (iii) long-time noise templates facilitate the localization of noise templates in a speech-with-noise signal compared to short-time templates. The effectiveness of the Labanotation-template subtraction (LTS) method was tested for five commercial ASRs in terms of ASR accuracy, SNR, and source-to-distortion ratio. We show that LTS leads to a reasonable performance, comparable to the other methods. The contribution of this study is (i) to propose to use Labanotation to reasonably collect noise templates, (ii) to demonstrate the practical effectiveness of LTS as well as examples of Labanotations for household actions. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [1] Ego Noise Suppression of a Robot Using Template Subtraction
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Hasegawa, Yuji
    Tsujino, Hiroshi
    Imura, Jun-ichi
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 199 - 204
  • [2] Predictive Models for Robot Ego-Noise Learning and Imitation
    Villalpando, Antonio Pico
    Schillaci, Guido
    Hafner, Verena V.
    2018 JOINT IEEE 8TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2018, : 263 - 268
  • [3] JOINT EGO-NOISE SUPPRESSION AND KEYWORD SPOTTING ON SWEEPING ROBOTS
    Na, Yueyue
    Wang, Ziteng
    Wang, Liang
    Fu, Qiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7547 - 7551
  • [4] DETECTING ACOUSTIC REFLECTORS USING A ROBOT'S EGO-NOISE
    Saqib, Usama
    Deleforge, Antoine
    Jensen, Jesper Rindom
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 466 - 470
  • [5] Enhancing listening capability of humanoid robot by reduction of stationary ego-noise
    Wake, Naoki
    Fukumoto, Masaaki
    Takahashi, Hirokazu
    Ikeuchi, Katsushi
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2019, 14 (12) : 1815 - 1822
  • [6] Robust Ego Noise Suppression of a Robot
    Ince, Gokhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Tsujino, Hiroshi
    Imura, Jun-Ichi
    TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT I, PROCEEDINGS, 2010, 6096 : 62 - +
  • [7] INFORMED EGO-NOISE SUPPRESSION USING MOTOR DATA-DRIVEN DICTIONARIES
    Schmidt, Alexander
    Kellermann, Walter
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 116 - 120
  • [8] Motor data-regularized nonnegative matrix factorization for ego-noise suppression
    Schmidt, Alexander
    Brendel, Andreas
    Haubner, Thomas
    Kellermann, Walter
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
  • [9] Motor data-regularized nonnegative matrix factorization for ego-noise suppression
    Alexander Schmidt
    Andreas Brendel
    Thomas Haubner
    Walter Kellermann
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [10] Active Ego-noise Control Based on Metamaterial in Small-size Robot
    Gu, Xihan
    Chen, Yun
    Wu, Xiaofeng
    2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 859 - 862