Robot ego-noise suppression with labanotation-template subtraction

被引:2
|
作者
Jaroslavceva, Jekaterina [1 ]
Wake, Naoki [1 ]
Sasabuchi, Kazuhiro [1 ]
Ikeuchi, Katsushi [1 ]
机构
[1] Microsoft, Appl Robot Res, Redmond, WA 98052 USA
关键词
ego-noise; labanotation; automatic speech recognition; human-robot interaction;
D O I
10.1002/tee.23523
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, we aim to improve automatic-speech-recognition (ASR) accuracy in the presence of robot ego-noise toward a better human-robot interaction. Although several noise reduction methods have been proposed to increase ASR accuracy or signal-to-noise ratio (SNR) by predicting ego-noises through a short-time motion-template subtraction or a neural network, these methods showed poor performance in some practical use cases, such as attenuating long-term motion-associated ego-noise. Based on the motion-template subtraction method, we address the problem of creating ego-noise templates associated with a wide variety of robot motions. For representing robot motions, we employ a dance notation referred to as Labanotation. The rationales behind our approach are: (i) Labanotation allows quantizing infinite motion patterns using a finite number of Labanotation combinations; (ii) Labanotation-based motion description is hardware-independent; and (iii) long-time noise templates facilitate the localization of noise templates in a speech-with-noise signal compared to short-time templates. The effectiveness of the Labanotation-template subtraction (LTS) method was tested for five commercial ASRs in terms of ASR accuracy, SNR, and source-to-distortion ratio. We show that LTS leads to a reasonable performance, comparable to the other methods. The contribution of this study is (i) to propose to use Labanotation to reasonably collect noise templates, (ii) to demonstrate the practical effectiveness of LTS as well as examples of Labanotations for household actions. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [11] A NOVEL EGO-NOISE SUPPRESSION ALGORITHM FOR ACOUSTIC SIGNAL ENHANCEMENT IN AUTONOMOUS SYSTEMS
    Schmidt, Alexander
    Loellmann, Heinrich W.
    Kellermann, Walter
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6583 - 6587
  • [12] How Do I Sound Like? Forward Models for Robot Ego-Noise Prediction
    Pico, Antonio
    Schillaci, Guido
    Hafner, Verena V.
    Lara, Bruno
    2016 JOINT IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2016, : 246 - 251
  • [13] Ego-Noise Predictions for Echolocation in Wheeled Robots
    Villalpando, Antonio Pico
    Schillaci, Guido
    Hafner, Verena V.
    Guzman, Bruno Lara
    ALIFE 2019: THE 2019 CONFERENCE ON ARTIFICIAL LIFE, 2019, : 567 - 573
  • [14] DRONAR: OBSTACLE ECHOLOCATION USING DRONE EGO-NOISE
    Nilsson, Henrik
    Rydell, Joakim
    Kullberg, Anton
    Hendeby, Gustaf
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 184 - 188
  • [15] Ego-noise reduction of a mobile robot using noise spatial covariance matrix learning and minimum variance distortionless response
    Lagace, Pierre-Olivier
    Ferland, Francois
    Grondin, Francois
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3533 - 3538
  • [16] Body Representations for Robot Ego-Noise Modelling and Prediction. Towards the Development of a Sense of Agency in Artificial Agents
    Schillaci, Guido
    Ritter, Claas N.
    Hafner, Verena V.
    Lara, Bruno
    ALIFE 2016, THE FIFTEENTH INTERNATIONAL CONFERENCE ON THE SYNTHESIS AND SIMULATION OF LIVING SYSTEMS, 2016, : 390 - 397
  • [17] A Robust Auditory System for Ego-noise Suppression Based on Active Acoustic Metamaterials in Small-size Robots
    Gu, Xihan
    Chen, Yun
    Wu, Xiaofeng
    Wang, Yuan
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 607 - 610
  • [18] Ear in the sky: ego-noise reduction for auditory micro aerial vehicles
    Wang, Lin
    Cavallaro, Andrea
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2016, : 152 - 158
  • [19] On the application of SEGAN for the attenuation of the ego-noise in the speech sound source localization problem
    Spadini, Tito
    Aldeia, Guilherme Seidyo Imai
    Barreto, Guilherme
    Alves, Kaleb
    Ferreira, Henrique
    Suyama, Ricardo
    Nose-Filho, Kenji
    2019 WORKSHOP ON COMMUNICATION NETWORKS AND POWER SYSTEMS (WCNPS), 2019,
  • [20] Ego-Noise Reduction Using a Motor Data-Guided Multichannel Dictionary
    Schmidt, Alexander
    Deleforge, Antoine
    Kellermann, Walter
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 1281 - 1286