TransPose Re-ID: transformers for pose invariant person Re-identification

被引:0
|
作者
Perwaiz, Nazia [1 ]
Shahzad, Muhammad [1 ,2 ]
Fraz, Muhammad Moazam [1 ]
机构
[1] Natl Univ Sci & Technol NUST, Islamabad, Pakistan
[2] Tech Univ Munich, Data Sci Earth Observat, Munich, Germany
关键词
Person re-identification; image patches; transformer; Self attention; Self context mapping; NEURAL-NETWORK;
D O I
10.1080/0952813X.2023.2214570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is a computer vision task that involves recognizing and tracking individuals across multiple non-overlapping cameras or over time within the same camera view. It is particularly important in surveillance systems, where it can help in identifying potential threats or tracking suspects. Convolutional neural networks (CNNs) have been used to extract invariant person representation for this challenging task. However, CNNs do not consider global dependencies in their initial layers, causing some vital information to be lost during the convolution process. The development of vision-based transformers has opened up new research avenues for person re-identification. This work proposes a purely transformer-based solution, called TansPose Re-ID, that learns pose-invariant person representations. The proposed system uses a vision transformer baseline and enhances its architecture by introducing multiple streams to learn global and local dependencies as well as pose invariance in person images. The architecture includes a Global Self-Attention Module (GSM) and a Local Self-Attention Module (LSM) that jointly learn global and local patch-based person embeddings. The LSM is further improved by stochastically grouping local patches and aligning them. Additionally, an attention feature learning module (AFLM) is introduced in the LSM to handle pose and viewpoint variations. The proposed method is evaluated on two public Re-ID benchmarks, Market1501 and DukeMTMC-ReID, and demonstrates superior performance compared to existing transformer baselines.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Person Re-Identification with Discriminatively Trained Viewpoint Invariant Dictionaries
    Karanam, Srikrishna
    Li, Yang
    Radke, Richard J.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4516 - 4524
  • [42] SCALE-INVARIANT SIAMESE NETWORK FOR PERSON RE-IDENTIFICATION
    Zhang, Yunzhou
    Shi, Weidong
    Liu, Shuangwei
    Bao, Jining
    Wei, Ying
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2436 - 2440
  • [43] Apparel-Invariant Feature Learning for Person Re-Identification
    Yu, Zhengxu
    Zhao, Yilun
    Hong, Bin
    Jin, Zhongming
    Huang, Jianqiang
    Cai, Deng
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4482 - 4492
  • [44] Unsupervised learning of visual invariant features for person re-identification
    Xia, Daoxun
    Guo, Fang
    Liu, Haojie
    Yu, Sheng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7495 - 7503
  • [45] Learning Camera-Invariant Representation for Person Re-identification
    Qin, Shizheng
    Gu, Kangzheng
    Wang, Lecheng
    Qi, Lizhe
    Zhang, Wenqiang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 125 - 137
  • [46] Learning Domain Invariant Representations for Generalizable Person Re-Identification
    Zhang, Yi-Fan
    Zhang, Zhang
    Li, Da
    Jia, Zhen
    Wang, Liang
    Tan, Tieniu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 509 - 523
  • [47] Enhanced keypoint information and pose-weighted re-ID features for multi-person pose estimation and tracking
    Wang, Xiangyang
    Pei, Tao
    Wang, Rui
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [48] Re-Ranking For Person Re-Identification
    Vu-Hoang Nguyen
    Thanh Duc Ngo
    Nguyen, Khang M. T. T.
    Duc Anh Duong
    Kien Nguyen
    Duy-Dinh Le
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 304 - 308
  • [49] ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification
    Yildiz, Serdar
    Kasim, Ahmet Nezih
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [50] Pose-driven Deep Convolutional Model for Person Re-identification
    Su, Chi
    Li, Jianing
    Zhang, Shiliang
    Xing, Junliang
    Gao, Wen
    Tian, Qi
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3980 - 3989