Attention-Based Variational Autoencoder Models for Human-Human Interaction Recognition via Generation

被引:1
|
作者
Banerjee, Bonny [1 ,2 ]
Baruah, Murchana [1 ,2 ]
机构
[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA
[2] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA
关键词
embodied AI agent; intent prediction; human-human interaction recognition; human-human interaction generation; attention; perception; proprioception; multimodal; variational autoencoder; recurrent neural network (RNN); long-short term memory (LSTM); FRAMEWORK; EMERGENCE; FUSION;
D O I
10.3390/s24123922
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The remarkable human ability to predict others' intent during physical interactions develops at a very early age and is crucial for development. Intent prediction, defined as the simultaneous recognition and generation of human-human interactions, has many applications such as in assistive robotics, human-robot interaction, video and robotic surveillance, and autonomous driving. However, models for solving the problem are scarce. This paper proposes two attention-based agent models to predict the intent of interacting 3D skeletons by sampling them via a sequence of glimpses. The novelty of these agent models is that they are inherently multimodal, consisting of perceptual and proprioceptive pathways. The action (attention) is driven by the agent's generation error, and not by reinforcement. At each sampling instant, the agent completes the partially observed skeletal motion and infers the interaction class. It learns where and what to sample by minimizing the generation and classification errors. Extensive evaluation of our models is carried out on benchmark datasets and in comparison to a state-of-the-art model for intent prediction, which reveals that classification and generation accuracies of one of the proposed models are comparable to those of the state of the art even though our model contains fewer trainable parameters. The insights gained from our model designs can inform the development of efficient agents, the future of artificial intelligence (AI).
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Modeling human-human interaction with attention-based high-order GCN for trajectory prediction
    Fang, Yanyan
    Jin, Zhiyu
    Cui, Zhenhua
    Yang, Qiaowen
    Xie, Tianyi
    Hu, Bo
    VISUAL COMPUTER, 2022, 38 (07): : 2257 - 2269
  • [2] Adversarial Attention-Based Variational Graph Autoencoder
    Weng, Ziqiang
    Zhang, Weiyu
    Dou, Wei
    IEEE ACCESS, 2020, 8 : 152637 - 152645
  • [3] Lattice generation in attention-based speech recognition models
    Zapotoczny, Michal
    Pietrzak, Piotr
    Lancucki, Adrian
    Chorowski, Jan
    INTERSPEECH 2019, 2019, : 2225 - 2229
  • [4] Speech Emotion Recognition via Generation using an Attention-based Variational Recurrent Neural Network
    Baruah, Murchana
    Banerjee, Bonny
    INTERSPEECH 2022, 2022, : 4710 - 4714
  • [5] CSI-IANet: An Inception Attention Network for Human-Human Interaction Recognition Based on CSI Signal
    Kabir, M. Humayun
    Rahman, M. Hafizur
    Shin, Wonjae
    IEEE ACCESS, 2021, 9 : 166624 - 166638
  • [6] Recognition of Human-Human Interaction using CWDTW
    Subetha, T.
    Chitrakala, S.
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
  • [7] HUMAN-HUMAN INTERACTION RECOGNITION BASED ON SPATIAL AND MOTION TREND FEATURE
    Liu, Bangli
    Cai, Haibin
    Ji, Xiaofei
    Liu, Honghai
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4547 - 4551
  • [8] Human-human interaction recognition based on ultra-wideband radar
    Liu, Haiping
    Yang, Ruixia
    Yang, Yang
    Hou, Chunping
    Hu, Zhiqi
    Jiang, Tianli
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (06) : 1181 - 1188
  • [9] Significance of handcrafted features in human activity recognition with attention-based RNN models
    Abraham, Sonia
    James, Rekha K.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (10) : 1151 - 1163
  • [10] Channel Attention-Based Approach with Autoencoder Network for Human Action Recognition in Low-Resolution Frames
    Dastbaravardeh, Elaheh
    Askarpour, Somayeh
    Saberi Anari, Maryam
    Rezaee, Khosro
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024