Speech-based Human-Robot Interaction Robust to Acoustic Reflections in Real Environment

被引:0
|
作者
Gomez, Randy [1 ]
Inoue, Koji
Nakamura, Keisuke [1 ]
Mizumoto, Takeshi [1 ]
Nakadai, Kazuhiro [1 ]
机构
[1] Honda Res Inst Japan Ltd Co, Wako, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic reflection inside an enclosed environment is detrimental to human-robot interaction. Reflection may manifest as phantom sources emanating from unknown directions. In effect, a single speaker may falsely manifest as multiple speakers to the robot audition system, impeding the robot's ability to correctly associate the speech command to the actual speaker. Moreover, speech reflection smears the original speech signal due to reverberation. This degrades speech recognition and understanding performance. Conventional robot audition schemes that rely purely on acoustics and spatial information are very sensitive to acoustic reflection which ultimately leads to the failure in human-robot interaction. We propose a method for human-robot interaction robust to the effect of acoustic reflection. First, visual information is utilized and head tracking scheme is employed to reinforce the acoustic information with the visual presence of a prospect user. Second, we employ a model-based sound event identification scheme and scrutinize whether the acoustic information is likely to be speech or non-speech. Using all the information we have gathered, we create a simple rule construct to effectively discriminate the original source (actual speaker) from phantom sources (reflection). Consequently, the corresponding source identified as phantom (reflection) is used to estimate the unwanted smearing for effective suppression via speech enhancement. Experiments are conducted in human-robot interaction setting in which the proposed method outperforms the conventional method.
引用
收藏
页码:1367 / 1373
页数:7
相关论文
共 50 条
  • [41] Speech to Head Gesture Mapping in Multimodal Human-Robot Interaction
    Aly, Amir
    Tapus, Adriana
    SERVICE ORIENTATION IN HOLONIC AND MULTI-AGENT MANUFACTURING CONTROL, 2012, 402 : 183 - 196
  • [42] Mutual assistance between speech and vision for human-robot interaction
    Burger, Brice
    Lerasle, Frederic
    Ferrane, Isabelle
    Clodic, Aurelie
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 4011 - +
  • [43] A Training Tool for Speech Driven Human-Robot Interaction Applications
    Hudson, Christopher
    Bethel, Cindy L.
    Carruth, Daniel W.
    Pleva, Matus
    Juhar, Jozef
    Ondas, Stanislav
    2017 15TH IEEE INTERNATIONAL CONFERENCE ON EMERGING ELEARNING TECHNOLOGIES AND APPLICATIONS (ICETA 2017), 2017, : 167 - 172
  • [44] Emotion Recognition From Speech to Improve Human-robot Interaction
    Zhu, Changrui
    Ahamd, Wasim
    IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 370 - 375
  • [45] An application of speech/speaker recognition system for human-robot interaction
    Jo, Hyun
    Kim, Gyeongho
    Park, Youngjin
    2007 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-6, 2007, : 757 - 760
  • [46] A Robot Navigation Method Based on Human-Robot Interaction for 3D Environment Mapping
    Zhao, Lijun
    Li, Xiaoyu
    Sun, Zhenye
    Wang, Ke
    Yang, Chenguang
    2017 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (RCAR), 2017, : 409 - 414
  • [47] Minimal representation of speech signals for generation of emotion speech and human-robot interaction
    Lee, Heyoung
    Bien, Z. Zenn
    2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 137 - +
  • [48] Human-Robot Interaction
    Jia, Yunyi
    Zhang, Biao
    Li, Miao
    King, Brady
    Meghdari, Ali
    JOURNAL OF ROBOTICS, 2018, 2018
  • [49] A Real-Time Vision-Based Framework for Human-Robot Interaction
    Lam, Meng Chun
    Prabuwono, Anton Satria
    Arshad, Haslina
    Chan, Chee Seng
    VISUAL INFORMATICS: SUSTAINING RESEARCH AND INNOVATIONS, PT I, 2011, 7066 : 257 - +
  • [50] Human-Robot Interaction
    Sidobre, Daniel
    Broquere, Xavier
    Mainprice, Jim
    Burattini, Ernesto
    Finzi, Alberto
    Rossi, Silvia
    Staffa, Mariacarla
    ADVANCED BIMANUAL MANIPULATION: RESULTS FROM THE DEXMART PROJECT, 2012, 80 : 123 - +