Attention-Based Variational Autoencoder Models for Human-Human Interaction Recognition via Generation

被引:1
|
作者
Banerjee, Bonny [1 ,2 ]
Baruah, Murchana [1 ,2 ]
机构
[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA
[2] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA
关键词
embodied AI agent; intent prediction; human-human interaction recognition; human-human interaction generation; attention; perception; proprioception; multimodal; variational autoencoder; recurrent neural network (RNN); long-short term memory (LSTM); FRAMEWORK; EMERGENCE; FUSION;
D O I
10.3390/s24123922
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The remarkable human ability to predict others' intent during physical interactions develops at a very early age and is crucial for development. Intent prediction, defined as the simultaneous recognition and generation of human-human interactions, has many applications such as in assistive robotics, human-robot interaction, video and robotic surveillance, and autonomous driving. However, models for solving the problem are scarce. This paper proposes two attention-based agent models to predict the intent of interacting 3D skeletons by sampling them via a sequence of glimpses. The novelty of these agent models is that they are inherently multimodal, consisting of perceptual and proprioceptive pathways. The action (attention) is driven by the agent's generation error, and not by reinforcement. At each sampling instant, the agent completes the partially observed skeletal motion and infers the interaction class. It learns where and what to sample by minimizing the generation and classification errors. Extensive evaluation of our models is carried out on benchmark datasets and in comparison to a state-of-the-art model for intent prediction, which reveals that classification and generation accuracies of one of the proposed models are comparable to those of the state of the art even though our model contains fewer trainable parameters. The insights gained from our model designs can inform the development of efficient agents, the future of artificial intelligence (AI).
引用
收藏
页数:35
相关论文
共 50 条
  • [31] STA-HAR: A Spatiotemporal Attention-Based Framework for Human Activity Recognition
    Khaliluzzaman, Md.
    Furquan, Md.
    Khan, Mohammod Sazid Zaman
    Hoque, Md. Jiabul
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2024, 2024
  • [32] TAHAR: A Transferable Attention-Based Adversarial Network for Human Activity Recognition with RFID
    Chen, Dinghao
    Yang, Lvqing
    Cao, Hua
    Wang, Qingkai
    Dong, Wensheng
    Yu, Bo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 247 - 259
  • [33] Attention-Based Deep Learning Framework for Human Activity Recognition With User Adaptation
    Buffelli, Davide
    Vandin, Fabio
    IEEE SENSORS JOURNAL, 2021, 21 (12) : 13474 - 13483
  • [34] Attention-based vector quantisation variational autoencoder for colour-patterned fabrics defect detection
    Zhang, Hongwei
    Qiao, Guanhua
    Liu, Shuting
    Lyu, Yuting
    Yao, Le
    Ge, Zhiqiang
    COLORATION TECHNOLOGY, 2023, 139 (03) : 223 - 238
  • [35] RUL Prediction Using a Fusion of Attention-Based Convolutional Variational AutoEncoder and Ensemble Learning Classifier
    Remadna, Ikram
    Terrissa, Labib Sadek
    Al Masry, Zeina
    Zerhouni, Noureddine
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (01) : 106 - 124
  • [36] Socially-Aware Navigation Planner Using Models of Human-Human Interaction
    Sebastian, Meera
    Banisetty, Santosh Balajee
    Feil-Seifer, David
    2017 26TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2017, : 405 - 410
  • [37] Predicting Human Mobility via Variational Attention
    Gao, Qiang
    Zhou, Fan
    Trajcevski, Goce
    Zhang, Kunpeng
    Zhong, Ting
    Zhang, Fengli
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2750 - 2756
  • [38] On Attention Models for Human Activity Recognition
    Murahari, Vishvak S.
    Plotz, Thomas
    ISWC'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2018, : 100 - 103
  • [39] Attention-Augmented Convolutional Autoencoder for Radar-Based Human Activity Recognition
    Campbell, Christopher
    Ahmad, Fauzia
    2020 IEEE INTERNATIONAL RADAR CONFERENCE (RADAR), 2020, : 990 - 995
  • [40] Hierarchical Self Attention Based Autoencoder for Open-Set Human Activity Recognition
    Tonmoy, M. Tanjid Hasan
    Mahmud, Saif
    Rahman, A. K. M. Mahbubur
    Amin, M. Ashraful
    Ali, Amin Ahsan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 351 - 363