A Novel Multimodal Situated Spoken Dialog System for Human Robot Communication in Emergency Evacuation

被引:1
|
作者
Paul, Sheuli [1 ]
Sintek, Michael [2 ]
Silaghi, Marius [3 ]
Kepuska, Veton [3 ]
Robertson, Liam [4 ]
机构
[1] DRDC, Ottawa, ON, Canada
[2] DFKI GmbH, Kaiserslautern, Germany
[3] Florida Inst Technol, Melbourne, FL 32901 USA
[4] DND, Ottawa, ON, Canada
关键词
Axios" word used as WUW; MSSDS; MA; HRI; MDP;
D O I
10.1109/ICMLA55696.2022.00255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the need for multimodal autonomous human-robot interactive systems serving complex situations, a noise-robust and contextaware situated multimodal spoken dialogue system (MSSDS) in emergency evacuation missions is presented. The MSSDS system is composed of: (1) a context and noise robust Wake-Up-Word (WUW) (a) to initiate the dialogue, and (b) to detect the context switching by explicit use of WUW during emergency evacuation mission steps, (2) multi-turn userinitiated interactive text-and-spoken dialog communication system, and (3) an interactive voice and text interface for human-robot communication designed based on dialogues used in real-world emergency situations. In an emergency environment, speech is mixed with different noises, and therefore communication using speech in such an environment is challenging. We handle the noise by using Team Connect Ceiling (TCC) beam-forming microphone arrays. Innovative and useful applications of spoken dialogue systems, presented as proof of concept, constitute another contribution. Numerous digital assistants, keyword spotting, and wake-up-words-based technologies have already been developed, but these are mainly used indoors. Our objective is to support communication in complex environments, e.g., indoors and outdoors, in human and machine teaming, via a wake-up-word-based multimodal interactive system. The development of the real-world application to communicate with the robot using multimodalities in complex situations based on the presented approach is in progress while the presented simulated approach is reflecting parts of this development. Numerous machine learning technologies and toolkits have been applied in this ongoing development process. The novelty of WUW-based MSSDS is discussed in this paper. Our Markov Decision Process (MDP) evaluation shows that the WUW-based MSSDS performs better.
引用
收藏
页码:1660 / 1665
页数:6
相关论文
共 50 条
  • [41] A Laser Projection System for Robot Intention Communication and Human Robot Interaction
    Wengefeld, Tim
    Hoechemer, Dominik
    Lewandowski, Benjamin
    Koehler, Mona
    Beer, Manuel
    Gross, Horst-Michael
    2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 259 - 265
  • [42] Human-Humanoid Robot Interaction System Based on Spoken Dialogue and Vision
    Mu, Yanhua
    Yin, YiXin
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 328 - 332
  • [43] PARLOMA - A Novel Human-Robot Interaction System for Deaf-Blind Remote Communication
    Russo, Ludovico Orlando
    Farulla, Giuseppe Airo
    Pianu, Daniele
    Salgarella, Alice Rita
    Controzzi, Marco
    Cipriani, Christian
    Oddo, Calogero Maria
    Geraci, Carlo
    Rosa, Stefano
    Indaco, Marco
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2015, 12
  • [44] Human-Robot Communication System for an Isolated Environment
    Diddeniya, Isanka
    Wanniarachchi, Indika
    Gunasinghe, Hansi
    Premachandra, Chinthaka
    Kawanaka, Hiroharu
    IEEE ACCESS, 2022, 10 : 63258 - 63269
  • [45] A Data-Driven Paradigm to Understand Multimodal Communication in Human-Human and Human-Robot Interaction
    Yu, Chen
    Smith, Thomas G.
    Hidaka, Shohei
    Scheutz, Matthias
    Smith, Linda B.
    ADVANCES IN INTELLIGENT DATA ANALYSIS IX, PROCEEDINGS, 2010, 6065 : 232 - 244
  • [46] Evaluation of Unimodal and Multimodal Communication Cues for Attracting Attention in Human-Robot Interaction
    Torta, Elena
    van Heumen, Jim
    Piunti, Francesco
    Romeo, Luca
    Cuijpers, Raymond
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2015, 7 (01) : 89 - 96
  • [47] A Multimodal Emotion Detection System during Human-Robot Interaction
    Alonso-Martin, Fernando
    Malfaz, Maria
    Sequeira, Joao
    Gorostiza, Javier F.
    Salichs, Miguel A.
    SENSORS, 2013, 13 (11) : 15549 - 15581
  • [48] HandTalker:: A multimodal dialog system using sign language and 3-D virtual human
    Gao, W
    Ma, JY
    Shan, SG
    Chen, XL
    Zheng, W
    Zhang, HM
    Yan, J
    Wu, JQ
    ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 564 - 571
  • [49] Telecommunicator: A novel robot system for human communications
    Tsumaki, Y
    Fujita, Y
    Kasai, A
    Sato, C
    Nenchev, DN
    Uchiyama, M
    IEEE ROMAN 2002, PROCEEDINGS, 2002, : 35 - 40
  • [50] Spoken language interaction with model uncertainty: an adaptive human-robot interaction system
    Doshi, Finale
    Roy, Nicholas
    CONNECTION SCIENCE, 2008, 20 (04) : 299 - 318