DDC3N: Doppler-Driven Convolutional 3D Network for Human Action Recognition

被引:2
|
作者
Toshpulatov, Mukhiddin [1 ]
Lee, Wookey [1 ]
Lee, Suan [2 ]
Yoon, Hoyoung [3 ]
Kang, U. Kang [3 ]
机构
[1] Inha Univ, Biomed Sci & Engn, Incheon 22212, South Korea
[2] Semyung Univ, Sch Comp Sci, Jecheon 27136, South Korea
[3] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
3D pose estimation; discriminator; deep neural network; deep learning; generator; mesh estimation; metadata; skeleton; top-down approach; motion embedding; optical flow map; channel-wise; spatiotemporal; doppler; dataset; action recognition; 2D;
D O I
10.1109/ACCESS.2024.3422428
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In deep learning (DL)-based human action recognition (HAR), considerable strides have been undertaken. Nevertheless, the precise classification of sports athletes' actions still needs to be completed. Primarily attributable to the exigency for exhaustive datasets about sports athletes' actions and the enduring quandaries imposed by variable camera perspectives, mercurial lighting conditions, and occlusions. This investigative endeavor thoroughly examines extant HAR datasets, furnishing a yardstick for gauging the efficacy of cutting-edge methodologies. In light of the paucity of accessible datasets delineating athlete actions, we have taken a proactive stance, endeavoring to curate two meticulously datasets tailored explicitly for sports athletes, subsequently scrutinizing their consequential impact on performance enhancement. While the superiority of 3D convolutional neural networks (3DCNN) over graph convolutional networks (GCN) in HAR is evident, it must be acknowledged that they entail a considerable computational overhead, particularly when confronted with voluminous datasets. Our inquiry introduces innovative methodologies and a more resource-efficient remedy for HAR, thereby alleviating the computational strain on the 3DCNN architecture. Consequently, it proffers a multifaceted approach towards augmenting HAR within the purview of surveillance cameras, bridging lacunae, surmounting computational impediments, and effectuating significant strides in the accuracy and efficacy of HAR frameworks.
引用
收藏
页码:93546 / 93567
页数:22
相关论文
共 50 条
  • [21] T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition
    Liu, Kun
    Liu, Wu
    Gan, Chuang
    Tan, Mingkui
    Ma, Huadong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7138 - 7145
  • [22] 3D CNN for Human Action Recognition
    Boualia, Sameh Neili
    Ben Amara, Najoua Essoukri
    2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 276 - 282
  • [23] Using Gabor Filter in 3D Convolutional Neural Networks for Human Action Recognition
    Li, Jiakun
    Wang, Tian
    Zhou, Yi
    Wang, Ziyu
    Snoussi, Hichem
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11139 - 11144
  • [24] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Mohammad Reza Falahzadeh
    Fardad Farokhi
    Ali Harimi
    Reza Sabbaghi-Nadooshan
    Circuits, Systems, and Signal Processing, 2023, 42 : 4271 - 4291
  • [25] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Falahzadeh, Mohammad Reza
    Farokhi, Fardad
    Harimi, Ali
    Sabbaghi-Nadooshan, Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4271 - 4291
  • [26] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [27] Video Action Recognition Based on Improved 3D Convolutional Network and Sparse Representation Classification
    Liu, Wang
    Fu, Qi
    Lu, Yuqiu
    Sun, Jinyu
    Ma, Shiwei
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [28] RECURRENT 3D CONVOLUTIONAL NETWORK FOR RODENT BEHAVIOR RECOGNITION
    Le, Van Anh
    Murari, Kartikeya
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1174 - 1178
  • [29] A Convolutional Vector Network for 3D Mesh Object Recognition
    Qiu Q.
    Zhao J.
    Chen Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 271 - 282
  • [30] ENHANCED ACTION RECOGNITION WITH VISUAL ATTRIBUTE-AUGMENTED 3D CONVOLUTIONAL NEURAL NETWORK
    Wang, Yunfeng
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,