A Novel Multimodal Situated Spoken Dialog System for Human Robot Communication in Emergency Evacuation

被引:1
|
作者
Paul, Sheuli [1 ]
Sintek, Michael [2 ]
Silaghi, Marius [3 ]
Kepuska, Veton [3 ]
Robertson, Liam [4 ]
机构
[1] DRDC, Ottawa, ON, Canada
[2] DFKI GmbH, Kaiserslautern, Germany
[3] Florida Inst Technol, Melbourne, FL 32901 USA
[4] DND, Ottawa, ON, Canada
关键词
Axios" word used as WUW; MSSDS; MA; HRI; MDP;
D O I
10.1109/ICMLA55696.2022.00255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the need for multimodal autonomous human-robot interactive systems serving complex situations, a noise-robust and contextaware situated multimodal spoken dialogue system (MSSDS) in emergency evacuation missions is presented. The MSSDS system is composed of: (1) a context and noise robust Wake-Up-Word (WUW) (a) to initiate the dialogue, and (b) to detect the context switching by explicit use of WUW during emergency evacuation mission steps, (2) multi-turn userinitiated interactive text-and-spoken dialog communication system, and (3) an interactive voice and text interface for human-robot communication designed based on dialogues used in real-world emergency situations. In an emergency environment, speech is mixed with different noises, and therefore communication using speech in such an environment is challenging. We handle the noise by using Team Connect Ceiling (TCC) beam-forming microphone arrays. Innovative and useful applications of spoken dialogue systems, presented as proof of concept, constitute another contribution. Numerous digital assistants, keyword spotting, and wake-up-words-based technologies have already been developed, but these are mainly used indoors. Our objective is to support communication in complex environments, e.g., indoors and outdoors, in human and machine teaming, via a wake-up-word-based multimodal interactive system. The development of the real-world application to communicate with the robot using multimodalities in complex situations based on the presented approach is in progress while the presented simulated approach is reflecting parts of this development. Numerous machine learning technologies and toolkits have been applied in this ongoing development process. The novelty of WUW-based MSSDS is discussed in this paper. Our Markov Decision Process (MDP) evaluation shows that the WUW-based MSSDS performs better.
引用
收藏
页码:1660 / 1665
页数:6
相关论文
共 50 条
  • [31] Rapid Simulation-Driven Reinforcement Learning of Multimodal Dialog Strategies in Human-Robot Interaction
    Prommer, Thomas
    Holzapfel, Hartwig
    Waibel, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1918 - 1921
  • [32] A Horizontal Approach to Communication for Human-Robot Joint Action: Towards Situated and Sustainable Robotics
    Belhassein, Kathleen
    Fernandez Castro, Victor
    Mayima, Amandine
    CULTURALLY SUSTAINABLE SOCIAL ROBOTICS, 2020, 335 : 204 - 214
  • [33] The development and evaluation of Robot Light Skin: A novel robot signalling system to improve communication in industrial human-robot collaboration
    Tang, Gilbert
    Webb, Phil
    Thrower, John
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2019, 56 : 85 - 94
  • [34] Multimodal Communication for Human-Friendly Robot Partners in Informationally Structured Space
    Kubota, Naoyuki
    Toda, Yuichiro
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1142 - 1151
  • [35] Enhancing Safe Human-Robot Collaboration through Natural Multimodal Communication
    Maurtua, Inaki
    Fernandez, Izaskun
    Kildal, Johan
    Susperregi, Loreto
    Tellaeche, Alberto
    Ibarguren, Aitor
    2016 IEEE 21ST INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2016,
  • [36] A novel visual interface for human-robot communication
    Zelinsky, A
    Heinzmann, J
    ADVANCED ROBOTICS, 1998, 11 (08) : 827 - 852
  • [37] Evaluation of Unimodal and Multimodal Communication Cues for Attracting Attention in Human–Robot Interaction
    Elena Torta
    Jim van Heumen
    Francesco Piunti
    Luca Romeo
    Raymond Cuijpers
    International Journal of Social Robotics, 2015, 7 : 89 - 96
  • [38] The impact of human-robot multimodal communication on mental workload, usability preference, and expectations of robot behavior
    Abich, Julian
    Barber, Daniel J.
    JOURNAL ON MULTIMODAL USER INTERFACES, 2017, 11 (02) : 211 - 225
  • [39] Improvement of Speech Recognition Performance for Spoken-Oriented Robot Dialog System Using End-fire Array
    Sawada, Hiroshi
    Even, Jani
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    Takatani, Tomoya
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 970 - 975
  • [40] A System to Generate Robot Emotional Reaction for Robot-Human Communication
    Olgun, Zehra Nur
    Chae, YuJung
    Kim, ChangHwan
    2018 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2018, : 383 - 387