Eliciting Multimodal Gesture plus Speech Interactions in a Multi-Object Augmented Reality Environment

被引:4
|
作者
Zhou, Xiaoyan [1 ]
Williams, Adam S. [1 ]
Ortega, Francisco R. [1 ]
机构
[1] Colorado State Univ, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
elicitation; multimodal interaction; augmented reality; gesture and speech interaction; multi-object AR environment;
D O I
10.1145/3562939.3565637
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As augmented reality (AR) technology and hardware become more mature and affordable, researchers have been exploring more intuitive and discoverable interaction techniques for immersive environments. This paper investigates multimodal interaction for 3D object manipulation in a multi-object AR environment. To identify the user-defined gestures, we conducted an elicitation study involving 24 participants and 22 referents using an augmented reality headset. It yielded 528 proposals and generated a winning gesture set with 25 gestures after binning and ranking all gesture proposals. We found that for the same task, the same gesture was preferred for both one and two-object manipulation, although both hands were used in the two-object scenario. We present the gestures and speech results, and the differences compared to similar studies in a single object AR environment. The study also explored the association between speech expressions and gesture stroke during object manipulation, which could improve the recognizer efficiency in augmented reality headsets.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Sculpting in Augmented Reality Redefining Digital Crafts Through Multimodal Interactions
    Scotto, Fabio
    VIRTUAL, AUGMENTED AND MIXED REALITY: APPLICATIONS IN EDUCATION, AVIATION AND INDUSTRY, PT II, 2022, 13318 : 91 - 107
  • [42] SenseShapes: Using statistical geometry for object selection in a multimodal augmented reality system
    Olwal, A
    Benko, H
    Feiner, S
    SECOND IEEE AND ACM INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, PROCEEDINGS, 2003, : 300 - 301
  • [43] Generation of augmented video sequences combining behavioral animation and multi-object tracking
    Baiget, Pau
    Fernandez, Carles
    Roca, Xavier
    Gonzalez, Jordi
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2009, 20 (04) : 473 - 489
  • [44] Not channels but composite signals: Speech, gesture, diagrams and object demonstrations are integrated in multimodal explanations
    Engle, RA
    PROCEEDINGS OF THE TWENTIETH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1998, : 321 - 326
  • [45] Learning shared template representation with augmented feature for multi-object pose estimation
    Luo, Qifeng
    Xu, Ting -Bing
    Liu, Fulin
    Li, Tianren
    Wei, Zhenzhong
    NEURAL NETWORKS, 2024, 176
  • [46] YoLite plus : a lightweight multi-object detection approach in traffic scenarios
    You Shuai
    Chen Zhiyu
    Liu Shangdong
    Wu Mengxue
    Tian Feng
    Ji Yimu
    8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2020 & 2021): DEVELOPING GLOBAL DIGITAL ECONOMY AFTER COVID-19, 2022, 199 : 346 - 353
  • [47] MF-Net: A Multimodal Fusion Model for Fast Multi-Object Tracking
    Tian, Shirui
    Duan, Mingxing
    Deng, Jiayan
    Luo, Huizhang
    Hu, Yikun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (08) : 10948 - 10962
  • [48] Boosting Aerial Object Detection Performance via Virtual Reality Data and Multi-Object Training
    Koutsoubis, Nikolas
    Naddeo, Kyle
    Williams, Garrett
    Lecakes, George, Jr.
    Ditzler, Gregory
    Bouaynaya, Nidhal C.
    Kiel, Thomas
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [49] Real time virtualized real object manipulation in an augmented reality environment
    Nini, B
    Batouche, M
    BRAIN, VISION, AND ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3704 : 477 - 486
  • [50] Sensory Extension of a Tangible Object for Physical User Interactions in Augmented Reality
    Doering, Dagny C.
    Horst, Robin
    Rau, Linda
    Doerner, Ralf
    HUCAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 2: HUCAPP, 2021, : 153 - 160