Attention-Based Variational Autoencoder Models for Human-Human Interaction Recognition via Generation

被引：1

作者：

Banerjee, Bonny ^{[1
,2
]}

Baruah, Murchana ^{[1
,2
]}

机构：

[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA

[2] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA

来源：

SENSORS | 2024年 / 24卷 / 12期

关键词：

embodied AI agent; intent prediction; human-human interaction recognition; human-human interaction generation; attention; perception; proprioception; multimodal; variational autoencoder; recurrent neural network (RNN); long-short term memory (LSTM); FRAMEWORK; EMERGENCE; FUSION;

D O I：

10.3390/s24123922

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The remarkable human ability to predict others' intent during physical interactions develops at a very early age and is crucial for development. Intent prediction, defined as the simultaneous recognition and generation of human-human interactions, has many applications such as in assistive robotics, human-robot interaction, video and robotic surveillance, and autonomous driving. However, models for solving the problem are scarce. This paper proposes two attention-based agent models to predict the intent of interacting 3D skeletons by sampling them via a sequence of glimpses. The novelty of these agent models is that they are inherently multimodal, consisting of perceptual and proprioceptive pathways. The action (attention) is driven by the agent's generation error, and not by reinforcement. At each sampling instant, the agent completes the partially observed skeletal motion and infers the interaction class. It learns where and what to sample by minimizing the generation and classification errors. Extensive evaluation of our models is carried out on benchmark datasets and in comparison to a state-of-the-art model for intent prediction, which reveals that classification and generation accuracies of one of the proposed models are comparable to those of the state of the art even though our model contains fewer trainable parameters. The insights gained from our model designs can inform the development of efficient agents, the future of artificial intelligence (AI).

引用

页数：35

共 50 条

[1] Modeling human-human interaction with attention-based high-order GCN for trajectory prediction
Fang, Yanyan
Jin, Zhiyu
Cui, Zhenhua
Yang, Qiaowen
Xie, Tianyi
Hu, Bo
VISUAL COMPUTER, 2022, 38 (07): : 2257 - 2269
[2] Adversarial Attention-Based Variational Graph Autoencoder
Weng, Ziqiang
Zhang, Weiyu
Dou, Wei
IEEE ACCESS, 2020, 8 : 152637 - 152645
[3] Lattice generation in attention-based speech recognition models
Zapotoczny, Michal
Pietrzak, Piotr
Lancucki, Adrian
Chorowski, Jan
INTERSPEECH 2019, 2019, : 2225 - 2229
[4] Speech Emotion Recognition via Generation using an Attention-based Variational Recurrent Neural Network
Baruah, Murchana
Banerjee, Bonny
INTERSPEECH 2022, 2022, : 4710 - 4714
[5] CSI-IANet: An Inception Attention Network for Human-Human Interaction Recognition Based on CSI Signal
Kabir, M. Humayun
Rahman, M. Hafizur
Shin, Wonjae
IEEE ACCESS, 2021, 9 : 166624 - 166638
[6] Recognition of Human-Human Interaction using CWDTW
Subetha, T.
Chitrakala, S.
PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
[7] HUMAN-HUMAN INTERACTION RECOGNITION BASED ON SPATIAL AND MOTION TREND FEATURE
Liu, Bangli
Cai, Haibin
Ji, Xiaofei
Liu, Honghai
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4547 - 4551
[8] Human-human interaction recognition based on ultra-wideband radar
Liu, Haiping
Yang, Ruixia
Yang, Yang
Hou, Chunping
Hu, Zhiqi
Jiang, Tianli
SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (06) : 1181 - 1188
[9] Significance of handcrafted features in human activity recognition with attention-based RNN models
Abraham, Sonia
James, Rekha K.
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (10) : 1151 - 1163
[10] Channel Attention-Based Approach with Autoencoder Network for Human Action Recognition in Low-Resolution Frames
Dastbaravardeh, Elaheh
Askarpour, Somayeh
Saberi Anari, Maryam
Rezaee, Khosro
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024

← 1 2 3 4 5 →