Robot ego-noise suppression with labanotation-template subtraction

被引:2
|
作者
Jaroslavceva, Jekaterina [1 ]
Wake, Naoki [1 ]
Sasabuchi, Kazuhiro [1 ]
Ikeuchi, Katsushi [1 ]
机构
[1] Microsoft, Appl Robot Res, Redmond, WA 98052 USA
关键词
ego-noise; labanotation; automatic speech recognition; human-robot interaction;
D O I
10.1002/tee.23523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, we aim to improve automatic-speech-recognition (ASR) accuracy in the presence of robot ego-noise toward a better human-robot interaction. Although several noise reduction methods have been proposed to increase ASR accuracy or signal-to-noise ratio (SNR) by predicting ego-noises through a short-time motion-template subtraction or a neural network, these methods showed poor performance in some practical use cases, such as attenuating long-term motion-associated ego-noise. Based on the motion-template subtraction method, we address the problem of creating ego-noise templates associated with a wide variety of robot motions. For representing robot motions, we employ a dance notation referred to as Labanotation. The rationales behind our approach are: (i) Labanotation allows quantizing infinite motion patterns using a finite number of Labanotation combinations; (ii) Labanotation-based motion description is hardware-independent; and (iii) long-time noise templates facilitate the localization of noise templates in a speech-with-noise signal compared to short-time templates. The effectiveness of the Labanotation-template subtraction (LTS) method was tested for five commercial ASRs in terms of ASR accuracy, SNR, and source-to-distortion ratio. We show that LTS leads to a reasonable performance, comparable to the other methods. The contribution of this study is (i) to propose to use Labanotation to reasonably collect noise templates, (ii) to demonstrate the practical effectiveness of LTS as well as examples of Labanotations for household actions. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [21] A Blind Source Separation Framework for Ego-Noise Reduction on Multi-Rotor Drones
    Wang, Lin
    Cavallaro, Andrea
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 2523 - 2537
  • [22] Defect detection with ego-noise reduction based on multimodal information in UAV hammering inspection
    Shoda, Koki
    Kasahara, Jun Younes Louhi
    Asama, Hajime
    An, Qi
    Yamashita, Atsushi
    ADVANCED ROBOTICS, 2024, 38 (17) : 1218 - 1230
  • [23] EGO-NOISE REDUCTION FOR A HOSE-SHAPED RESCUE ROBOT USING DETERMINED RANK-1 MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION
    Takakusaki, Moe
    Kitamura, Daichi
    Ono, Nobutaka
    Yamada, Takeshi
    Makino, Shoji
    Saruwatari, Hiroshi
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [24] Microphone-Array Ego-Noise Reduction Algorithms for Auditory Micro Aerial Vehicles
    Wang, Lin
    Cavallaro, Andrea
    IEEE SENSORS JOURNAL, 2017, 17 (08) : 2447 - 2455
  • [25] Incremental Learning for Ego Noise Estimation of a Robot
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Imura, Jun-ichi
    Nakamura, Keisuke
    Nakajima, Hirofumi
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 131 - 136
  • [26] A Hybrid Framework for Ego Noise Cancellation of a Robot
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Hasegawa, Yuji
    Tsujino, Hiroshi
    Imura, Jun-ichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 3623 - 3628
  • [27] SPHERICAL HARMONIC DIAGONAL UNLOADING BEAMFORMING WITH EGO-NOISE REDUCTION FOR DOA ESTIMATION FROM AUTONOMOUS SYSTEMS
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 216 - 220
  • [28] Drone Ego-Noise Cancellation for Improved Speech Capture using Deep Convolutional Autoencoder Assisted Multistage Beamforming
    Song, Yanjue
    Kindt, Stijn
    Madhu, Nilesh
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [29] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [30] Ego noise cancellation of a robot using missing feature masks
    Gökhan Ince
    Kazuhiro Nakadai
    Tobias Rodemann
    Hiroshi Tsujino
    Jun-ichi Imura
    Applied Intelligence, 2011, 34 : 360 - 371