A Novel Multimodal Situated Spoken Dialog System for Human Robot Communication in Emergency Evacuation

被引：1

作者：

Paul, Sheuli ^{[1
]}

Sintek, Michael ^{[2
]}

Silaghi, Marius ^{[3
]}

Kepuska, Veton ^{[3
]}

Robertson, Liam ^{[4
]}

机构：

[1] DRDC, Ottawa, ON, Canada

[2] DFKI GmbH, Kaiserslautern, Germany

[3] Florida Inst Technol, Melbourne, FL 32901 USA

[4] DND, Ottawa, ON, Canada

来源：

2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA | 2022年

关键词：

Axios" word used as WUW; MSSDS; MA; HRI; MDP;

D O I：

10.1109/ICMLA55696.2022.00255

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given the need for multimodal autonomous human-robot interactive systems serving complex situations, a noise-robust and contextaware situated multimodal spoken dialogue system (MSSDS) in emergency evacuation missions is presented. The MSSDS system is composed of: (1) a context and noise robust Wake-Up-Word (WUW) (a) to initiate the dialogue, and (b) to detect the context switching by explicit use of WUW during emergency evacuation mission steps, (2) multi-turn userinitiated interactive text-and-spoken dialog communication system, and (3) an interactive voice and text interface for human-robot communication designed based on dialogues used in real-world emergency situations. In an emergency environment, speech is mixed with different noises, and therefore communication using speech in such an environment is challenging. We handle the noise by using Team Connect Ceiling (TCC) beam-forming microphone arrays. Innovative and useful applications of spoken dialogue systems, presented as proof of concept, constitute another contribution. Numerous digital assistants, keyword spotting, and wake-up-words-based technologies have already been developed, but these are mainly used indoors. Our objective is to support communication in complex environments, e.g., indoors and outdoors, in human and machine teaming, via a wake-up-word-based multimodal interactive system. The development of the real-world application to communicate with the robot using multimodalities in complex situations based on the presented approach is in progress while the presented simulated approach is reflecting parts of this development. Numerous machine learning technologies and toolkits have been applied in this ongoing development process. The novelty of WUW-based MSSDS is discussed in this paper. Our Markov Decision Process (MDP) evaluation shows that the WUW-based MSSDS performs better.

引用

页码：1660 / 1665

页数：6

共 50 条

[41] A Laser Projection System for Robot Intention Communication and Human Robot Interaction
Wengefeld, Tim
Hoechemer, Dominik
Lewandowski, Benjamin
Koehler, Mona
Beer, Manuel
Gross, Horst-Michael
2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 259 - 265
[42] Human-Humanoid Robot Interaction System Based on Spoken Dialogue and Vision
Mu, Yanhua
Yin, YiXin
PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 328 - 332
[43] PARLOMA - A Novel Human-Robot Interaction System for Deaf-Blind Remote Communication
Russo, Ludovico Orlando
Farulla, Giuseppe Airo
Pianu, Daniele
Salgarella, Alice Rita
Controzzi, Marco
Cipriani, Christian
Oddo, Calogero Maria
Geraci, Carlo
Rosa, Stefano
Indaco, Marco
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2015, 12
[44] Human-Robot Communication System for an Isolated Environment
Diddeniya, Isanka
Wanniarachchi, Indika
Gunasinghe, Hansi
Premachandra, Chinthaka
Kawanaka, Hiroharu
IEEE ACCESS, 2022, 10 : 63258 - 63269
[45] A Data-Driven Paradigm to Understand Multimodal Communication in Human-Human and Human-Robot Interaction
Yu, Chen
Smith, Thomas G.
Hidaka, Shohei
Scheutz, Matthias
Smith, Linda B.
ADVANCES IN INTELLIGENT DATA ANALYSIS IX, PROCEEDINGS, 2010, 6065 : 232 - 244
[46] Evaluation of Unimodal and Multimodal Communication Cues for Attracting Attention in Human-Robot Interaction
Torta, Elena
van Heumen, Jim
Piunti, Francesco
Romeo, Luca
Cuijpers, Raymond
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2015, 7 (01) : 89 - 96
[47] A Multimodal Emotion Detection System during Human-Robot Interaction
Alonso-Martin, Fernando
Malfaz, Maria
Sequeira, Joao
Gorostiza, Javier F.
Salichs, Miguel A.
SENSORS, 2013, 13 (11) : 15549 - 15581
[48] HandTalker:: A multimodal dialog system using sign language and 3-D virtual human
Gao, W
Ma, JY
Shan, SG
Chen, XL
Zheng, W
Zhang, HM
Yan, J
Wu, JQ
ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 564 - 571
[49] Telecommunicator: A novel robot system for human communications
Tsumaki, Y
Fujita, Y
Kasai, A
Sato, C
Nenchev, DN
Uchiyama, M
IEEE ROMAN 2002, PROCEEDINGS, 2002, : 35 - 40
[50] Spoken language interaction with model uncertainty: an adaptive human-robot interaction system
Doshi, Finale
Roy, Nicholas
CONNECTION SCIENCE, 2008, 20 (04) : 299 - 318

← 1 2 3 4 5 →