Radar gait recognition using Dual-branch Swin Transformer with Asymmetric Attention Fusion

被引:0
|
作者
He, Wentao [1 ,2 ]
Ren, Jianfeng [2 ,3 ]
Bai, Ruibin [2 ,3 ]
Jiang, Xudong [4 ]
机构
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315211, Peoples R China
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Digital Port Technol Lab, 199 Taikang East Rd, Ningbo 315100, Peoples R China
[3] Univ Nottingham Ningbo China, Nottingham Ningbo China Beacons Excellence Res & I, 199 Taikang East Rd, Ningbo 315100, Peoples R China
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, 50 Nanyang Ave, Singapore City 639798, Singapore
基金
中国国家自然科学基金;
关键词
Micro-Doppler signature; Radar gait recognition; Spectrogram; Cadence velocity diagram; Asymmetric Attention Fusion; IDENTIFICATION; NETWORKS; IMAGE;
D O I
10.1016/j.patcog.2024.111101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based gait recognition suffers from potential privacy issues and performance degradation due to dim environments, partial occlusions, or camera view changes. Radar has recently become increasingly popular and overcome various challenges presented by vision sensors. To capture tiny differences in radar gait signatures of different people, a dual-branch Swin Transformer is proposed, where one branch captures the time variations of the radar micro-Doppler signature and the other captures the repetitive frequency patterns in the spectrogram. Unlike natural images where objects can be translated, rotated, or scaled, the spatial coordinates of spectrograms and CVDs have unique physical meanings, and there is no affine transformation for radar targets in these synthetic images. The patch splitting mechanism in Vision Transformer makes it ideal to extract discriminant information from patches, and learn the attentive information across patches, as each patch carries some unique physical properties of radar targets. Swin Transformer consists of a set of cascaded Swin blocks to extract semantic features from shallow to deep representations, further improving the classification performance. Lastly, to highlight the branch with larger discriminant power, an Asymmetric Attention Fusion is proposed to optimally fuse the discriminant features from the two branches. To enrich the research on radar gait recognition, a large-scale NTU-RGR dataset is constructed, containing 45,768 radar frames of 98 subjects. The proposed method is evaluated on the NTU-RGR dataset and the MMRGait-1.0 database. It consistently and significantly outperforms all the compared methods on both datasets. The codes are available at: https://github.com/wentaoheunnc/NTU-RGR.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Micro-expression Recognition Based on Dual-Branch Swin Transformer Network
    Xie, Zhihua
    Zhao, Chuwei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 544 - 554
  • [2] A Lightweight Dual-Branch Swin Transformer for Remote Sensing Scene Classification
    Zheng, Fujian
    Lin, Shuai
    Zhou, Wei
    Huang, Hong
    REMOTE SENSING, 2023, 15 (11)
  • [3] Dual-branch network based on transformer for texture recognition
    Liu, Yangqi
    Dong, Hao
    Wang, Guodong
    Chen, Chenglizhao
    DIGITAL SIGNAL PROCESSING, 2024, 153
  • [4] Ship Recognition for Complex SAR Images via Dual-Branch Transformer Fusion Network
    Sun, Zhongzhen
    Leng, Xiangguang
    Zhang, Xianghui
    Xiong, Boli
    Ji, Kefeng
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] DRST-Net: A Dual-Branch Feature Fusion Network Combining ResNet50 and Swin Transformer for Welding Light Strip Recognition
    Lu, Yuan
    Huang, Qingjiu
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [6] DTCA: Dual-Branch Transformer with Cross-Attention for EEG and Eye Movement Data Fusion
    Zhang, Xiaoshan
    Shi, Enze
    Yu, Sigang
    Zhang, Shu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 141 - 151
  • [7] Dual-Branch Cross-Attention Network for Micro-Expression Recognition with Transformer Variants
    Xie, Zhihua
    Zhao, Chuwei
    ELECTRONICS, 2024, 13 (02)
  • [8] DBT: multimodal emotion recognition based on dual-branch transformer
    Yufan Yi
    Yan Tian
    Cong He
    Yajing Fan
    Xinli Hu
    Yiping Xu
    The Journal of Supercomputing, 2023, 79 : 8611 - 8633
  • [9] DBT: multimodal emotion recognition based on dual-branch transformer
    Yi, Yufan
    Tian, Yan
    He, Cong
    Fan, Yajing
    Hu, Xinli
    Xu, Yiping
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 8611 - 8633
  • [10] DUAL-BRANCH ATTENTION NETWORK AND SWIN SPATIAL PYRAMID POOLING FOR RETINOPATHY OF PREMATURITY CLASSIFICATION
    Zhao, Jia
    Lei, Haijun
    Xie, Hai
    Li, Pingkang
    Liu, Yaling
    Zhang, Guoming
    Lei, Baiying
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,