Attention-Based Variational Autoencoder Models for Human-Human Interaction Recognition via Generation

被引：1

作者：

Banerjee, Bonny ^{[1
,2
]}

Baruah, Murchana ^{[1
,2
]}

机构：

[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA

[2] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA

来源：

SENSORS | 2024年 / 24卷 / 12期

关键词：

embodied AI agent; intent prediction; human-human interaction recognition; human-human interaction generation; attention; perception; proprioception; multimodal; variational autoencoder; recurrent neural network (RNN); long-short term memory (LSTM); FRAMEWORK; EMERGENCE; FUSION;

D O I：

10.3390/s24123922

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The remarkable human ability to predict others' intent during physical interactions develops at a very early age and is crucial for development. Intent prediction, defined as the simultaneous recognition and generation of human-human interactions, has many applications such as in assistive robotics, human-robot interaction, video and robotic surveillance, and autonomous driving. However, models for solving the problem are scarce. This paper proposes two attention-based agent models to predict the intent of interacting 3D skeletons by sampling them via a sequence of glimpses. The novelty of these agent models is that they are inherently multimodal, consisting of perceptual and proprioceptive pathways. The action (attention) is driven by the agent's generation error, and not by reinforcement. At each sampling instant, the agent completes the partially observed skeletal motion and infers the interaction class. It learns where and what to sample by minimizing the generation and classification errors. Extensive evaluation of our models is carried out on benchmark datasets and in comparison to a state-of-the-art model for intent prediction, which reveals that classification and generation accuracies of one of the proposed models are comparable to those of the state of the art even though our model contains fewer trainable parameters. The insights gained from our model designs can inform the development of efficient agents, the future of artificial intelligence (AI).

引用

页数：35

共 50 条

[31] STA-HAR: A Spatiotemporal Attention-Based Framework for Human Activity Recognition
Khaliluzzaman, Md.
Furquan, Md.
Khan, Mohammod Sazid Zaman
Hoque, Md. Jiabul
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2024, 2024
[32] TAHAR: A Transferable Attention-Based Adversarial Network for Human Activity Recognition with RFID
Chen, Dinghao
Yang, Lvqing
Cao, Hua
Wang, Qingkai
Dong, Wensheng
Yu, Bo
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 247 - 259
[33] Attention-Based Deep Learning Framework for Human Activity Recognition With User Adaptation
Buffelli, Davide
Vandin, Fabio
IEEE SENSORS JOURNAL, 2021, 21 (12) : 13474 - 13483
[34] Attention-based vector quantisation variational autoencoder for colour-patterned fabrics defect detection
Zhang, Hongwei
Qiao, Guanhua
Liu, Shuting
Lyu, Yuting
Yao, Le
Ge, Zhiqiang
COLORATION TECHNOLOGY, 2023, 139 (03) : 223 - 238
[35] RUL Prediction Using a Fusion of Attention-Based Convolutional Variational AutoEncoder and Ensemble Learning Classifier
Remadna, Ikram
Terrissa, Labib Sadek
Al Masry, Zeina
Zerhouni, Noureddine
IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (01) : 106 - 124
[36] Socially-Aware Navigation Planner Using Models of Human-Human Interaction
Sebastian, Meera
Banisetty, Santosh Balajee
Feil-Seifer, David
2017 26TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2017, : 405 - 410
[37] Predicting Human Mobility via Variational Attention
Gao, Qiang
Zhou, Fan
Trajcevski, Goce
Zhang, Kunpeng
Zhong, Ting
Zhang, Fengli
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2750 - 2756
[38] On Attention Models for Human Activity Recognition
Murahari, Vishvak S.
Plotz, Thomas
ISWC'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2018, : 100 - 103
[39] Attention-Augmented Convolutional Autoencoder for Radar-Based Human Activity Recognition
Campbell, Christopher
Ahmad, Fauzia
2020 IEEE INTERNATIONAL RADAR CONFERENCE (RADAR), 2020, : 990 - 995
[40] Hierarchical Self Attention Based Autoencoder for Open-Set Human Activity Recognition
Tonmoy, M. Tanjid Hasan
Mahmud, Saif
Rahman, A. K. M. Mahbubur
Amin, M. Ashraful
Ali, Amin Ahsan
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 351 - 363

← 1 2 3 4 5 →