TransPose Re-ID: transformers for pose invariant person Re-identification

被引:0
|
作者
Perwaiz, Nazia [1 ]
Shahzad, Muhammad [1 ,2 ]
Fraz, Muhammad Moazam [1 ]
机构
[1] Natl Univ Sci & Technol NUST, Islamabad, Pakistan
[2] Tech Univ Munich, Data Sci Earth Observat, Munich, Germany
关键词
Person re-identification; image patches; transformer; Self attention; Self context mapping; NEURAL-NETWORK;
D O I
10.1080/0952813X.2023.2214570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is a computer vision task that involves recognizing and tracking individuals across multiple non-overlapping cameras or over time within the same camera view. It is particularly important in surveillance systems, where it can help in identifying potential threats or tracking suspects. Convolutional neural networks (CNNs) have been used to extract invariant person representation for this challenging task. However, CNNs do not consider global dependencies in their initial layers, causing some vital information to be lost during the convolution process. The development of vision-based transformers has opened up new research avenues for person re-identification. This work proposes a purely transformer-based solution, called TansPose Re-ID, that learns pose-invariant person representations. The proposed system uses a vision transformer baseline and enhances its architecture by introducing multiple streams to learn global and local dependencies as well as pose invariance in person images. The architecture includes a Global Self-Attention Module (GSM) and a Local Self-Attention Module (LSM) that jointly learn global and local patch-based person embeddings. The LSM is further improved by stochastically grouping local patches and aligning them. Additionally, an attention feature learning module (AFLM) is introduced in the LSM to handle pose and viewpoint variations. The proposed method is evaluated on two public Re-ID benchmarks, Market1501 and DukeMTMC-ReID, and demonstrates superior performance compared to existing transformer baselines.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] HPAN: A Hybrid Pose Attention Network for Person Re-Identification
    Huan, Ruohong
    Chen, Tianya
    Zhan, Ziwei
    Chen, Peng
    Liang, Ronghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 198 - 211
  • [32] Person Re-Identification Combined with Style Transfer and Pose Generation
    Hui, Yan
    Liang, Yingyu
    Hu, Xiuhua
    Wu, Xi
    Liu, Huan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (02)
  • [33] Person Re-identification with pose variation aware data augmentation
    Lei Zhang
    Na Jiang
    Qishuai Diao
    Zhong Zhou
    Wei Wu
    Neural Computing and Applications, 2022, 34 : 11817 - 11830
  • [34] Pose-Guided Representation Learning for Person Re-Identification
    Li, Jianing
    Zhang, Shiliang
    Tian, Qi
    Wang, Meng
    Gao, Wen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 622 - 635
  • [35] Person Re-identification with pose variation aware data augmentation
    Zhang, Lei
    Jiang, Na
    Diao, Qishuai
    Zhou, Zhong
    Wu, Wei
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (14): : 11817 - 11830
  • [36] Re-ID-leak: Membership Inference Attacks Against Person Re-identification
    Gao, Junyao
    Jiang, Xinyang
    Dou, Shuguang
    Li, Dongsheng
    Miao, Duoqian
    Zhao, Cairong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4673 - 4687
  • [37] Pose matters: Pose guided graph attention network for person re-identification
    Zhijun HE
    Hongbo ZHAO
    Jianrong WANG
    Wenquan FENG
    Chinese Journal of Aeronautics , 2023, (05) : 447 - 464
  • [38] Pose matters: Pose guided graph attention network for person re-identification
    Zhijun HE
    Hongbo ZHAO
    Jianrong WANG
    Wenquan FENG
    Chinese Journal of Aeronautics, 2023, 36 (05) : 447 - 464
  • [39] Pose matters: Pose guided graph attention network for person re-identification
    He, Zhijun
    Zhao, Hongbo
    Wang, Jianrong
    Feng, Wenquan
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (05) : 447 - 464
  • [40] Camera Invariant Feature Learning for Unsupervised Person Re-Identification
    Pang, Zhiqi
    Zhao, Lingling
    Liu, Qiuyang
    Wang, Chunyu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6171 - 6182