DDC3N: Doppler-Driven Convolutional 3D Network for Human Action Recognition

被引：2

作者：

Toshpulatov, Mukhiddin ^{[1
]}

Lee, Wookey ^{[1
]}

Lee, Suan ^{[2
]}

Yoon, Hoyoung ^{[3
]}

Kang, U. Kang ^{[3
]}

机构：

[1] Inha Univ, Biomed Sci & Engn, Incheon 22212, South Korea

[2] Semyung Univ, Sch Comp Sci, Jecheon 27136, South Korea

[3] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul 08826, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

3D pose estimation; discriminator; deep neural network; deep learning; generator; mesh estimation; metadata; skeleton; top-down approach; motion embedding; optical flow map; channel-wise; spatiotemporal; doppler; dataset; action recognition; 2D;

D O I：

10.1109/ACCESS.2024.3422428

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In deep learning (DL)-based human action recognition (HAR), considerable strides have been undertaken. Nevertheless, the precise classification of sports athletes' actions still needs to be completed. Primarily attributable to the exigency for exhaustive datasets about sports athletes' actions and the enduring quandaries imposed by variable camera perspectives, mercurial lighting conditions, and occlusions. This investigative endeavor thoroughly examines extant HAR datasets, furnishing a yardstick for gauging the efficacy of cutting-edge methodologies. In light of the paucity of accessible datasets delineating athlete actions, we have taken a proactive stance, endeavoring to curate two meticulously datasets tailored explicitly for sports athletes, subsequently scrutinizing their consequential impact on performance enhancement. While the superiority of 3D convolutional neural networks (3DCNN) over graph convolutional networks (GCN) in HAR is evident, it must be acknowledged that they entail a considerable computational overhead, particularly when confronted with voluminous datasets. Our inquiry introduces innovative methodologies and a more resource-efficient remedy for HAR, thereby alleviating the computational strain on the 3DCNN architecture. Consequently, it proffers a multifaceted approach towards augmenting HAR within the purview of surveillance cameras, bridging lacunae, surmounting computational impediments, and effectuating significant strides in the accuracy and efficacy of HAR frameworks.

引用

页码：93546 / 93567

页数：22

共 50 条

[31] 3D convolutional neural network for object recognition: a review
Singh, Rahul Dev
Mittal, Ajay
Bhatia, Rajesh K.
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
[32] 3D Convolutional Spiking Neural Network for Human Action Recognition Using Modulating STDP With Global Error Feedback
Nawarathne, Thoshara
Leung, Henry
18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
[33] Action recognition with motion map 3D network
Sun, Yuchao
Wu, Xinxiao
Yu, Wennan
Yu, Feiwu
NEUROCOMPUTING, 2018, 297 : 33 - 39
[34] 3D Contextual Transformer & Double Inception Network for Human Action Recognition
Liu, Enqi
Hirota, Kaoru
Liu, Chang
Dai, Yaping
Proceedings of the 35th Chinese Control and Decision Conference, CCDC 2023, 2023, : 1795 - 1800
[35] 3D Contextual Transformer & Double Inception Network for Human Action Recognition
Liu, Enqi
Hirota, Kaoru
Liu, Chang
Dai, Yaping
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1795 - 1800
[36] An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors‏
Saeedeh Zebhi
Multimedia Tools and Applications, 2024, 83 : 42233 - 42256
[37] An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors
Zebhi, Saeedeh
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42233 - 42256
[38] Trajectory-Pooled 3D Convolutional Descriptors for Action Recognition
Lu, Xiusheng
Yao, Hongxun
Sun, Xiaoshuai
Zhang, Shengping
Zhang, Yanhao
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 247 - 257
[39] Feature Fusion for Human Action Recognition based on Classical Descriptors and 3D convolutional networks
Qin, Yang
Mo, Lingfei
Xie, Benyi
2017 ELEVENTH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST), 2017, : 487 - 491
[40] 3D GLOH Features for Human Action Recognition
Abdulmunem, Ashwan
Lai, Yu-Kun
Sun, Xianfang
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 805 - 810

← 1 2 3 4 5 →