Automated efficient traffic gesture recognition using swin transformer-based multi-input deep network with radar images

被引:0
|
作者
Firat, Huseyin [1 ]
Uzen, Huseyin [2 ]
Atila, Orhan [3 ]
Sengur, Abdulkadir [4 ]
机构
[1] Dicle Univ, Fac Engn, Dept Comp Engn, Diyarbakir, Turkiye
[2] Bingol Univ, Fac Engn & Architecture, Dept Comp Engn, Bingol, Turkiye
[3] Firat Univ, Technol Fac, Elect Elect Engn Dept, Elazig, Turkiye
[4] Firat Univ, Fac Technol, Dept Elect & Elect Engn, Elazig, Turkiye
关键词
Deep learning; Radar images; Swin transformers; Traffic hand gesture;
D O I
10.1007/s11760-024-03664-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Radar-based artificial intelligence (AI) applications have gained significant attention recently, spanning from fall detection to gesture recognition. The growing interest in this field has led to a shift towards deep convolutional networks, and transformers have emerged to address limitations in convolutional neural network methods, becoming increasingly popular in the AI community. In this paper, we present a novel hybrid approach for radar-based traffic hand gesture classification using transformers. Traffic hand gesture recognition (HGR) holds importance in AI applications, and our proposed three-phase approach addresses the efficiency and effectiveness of traffic HGR. In the initial phase, feature vectors are extracted from input radar images using the pre-trained DenseNet-121 model. These features are then consolidated by concatenating them to gather information from diverse radar sensors, followed by a patch extraction operation. The concatenated features from all inputs are processed in the Swin transformer block to facilitate further HGR. The classification stage involves sequential application of global average pooling, Dense, and Softmax layers. To assess the effectiveness of our method on ULM university radar dataset, we employ various performance metrics, including accuracy, precision, recall, and F1-score, achieving an average accuracy score of 90.54%. We compare this score with existing approaches to demonstrate the competitiveness of our proposed method.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Transformer-based multi-source images instance segmentation network for composite materials
    Ke Y.
    Fu Y.
    Zhou W.
    Zhu W.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2023, 52 (02):
  • [32] Multi-Stream Single Network: Efficient Compressed Video Action Recognition With a Single Multi-Input Multi-Output Network
    Terao, Hayato
    Noguchi, Wataru
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    IEEE ACCESS, 2024, 12 : 20983 - 20997
  • [33] Demo: Efficient Convolutional Neural Network for FMCW Radar Based Hand Gesture Recognition
    Cai, Xiaodong
    Ma, Jingyi
    Liu, Wei
    Han, Hemin
    Ma, Lili
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 17 - 20
  • [34] M-Swin: Transformer-Based Multiscale Feature Fusion Change Detection Network Within Cropland for Remote Sensing Images
    Pan, Jun
    Bai, Yuchuan
    Shu, Qidi
    Zhang, Zhuoer
    Hu, Jiarui
    Wang, Mi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [35] Progressive Guidance Categorization Using Transformer-Based Deep Neural Network Architecture
    Aurpa, Tanjim Taharat
    Ahmed, Md Shoaib
    Sadik, Rifat
    Anwar, Sabbir
    Adnan, Md Abdul Mazid
    Anwar, Md Musfique
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 344 - 353
  • [36] Locational marginal price forecasting using Transformer-based deep learning network
    Liao, Shengyi
    Wang, Zhuo
    Luo, Yao
    Liang, Haiyan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8457 - 8462
  • [37] Multi-input 1-dimensional deep belief network: action and activity recognition as case study
    Ali Mohammad Nickfarjam
    Hossein Ebrahimpour-Komleh
    Multimedia Tools and Applications, 2019, 78 : 17739 - 17761
  • [38] Improved Swin Transformer-Based Semantic Segmentation of Postearthquake Dense Buildings in Urban Areas Using Remote Sensing Images
    Cui, Liangyi
    Jing, Xin
    Wang, Yu
    Huan, Yixuan
    Xu, Yang
    Zhang, Qiangqiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 369 - 385
  • [39] Multi-input 1-dimensional deep belief network: action and activity recognition as case study
    Nickfarjam, Ali Mohammad
    Ebrahimpour-Komleh, Hossein
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 17739 - 17761
  • [40] Radar-Based Gesture Recognition System using Spiking Neural Network
    Arsalan, Muhammad
    Santra, Avik
    Chmurski, Mateusz
    El-Masry, Moamen
    Mauro, Gianfranco
    Issakov, Vadim
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,