TransPose Re-ID: transformers for pose invariant person Re-identification

被引:0
|
作者
Perwaiz, Nazia [1 ]
Shahzad, Muhammad [1 ,2 ]
Fraz, Muhammad Moazam [1 ]
机构
[1] Natl Univ Sci & Technol NUST, Islamabad, Pakistan
[2] Tech Univ Munich, Data Sci Earth Observat, Munich, Germany
关键词
Person re-identification; image patches; transformer; Self attention; Self context mapping; NEURAL-NETWORK;
D O I
10.1080/0952813X.2023.2214570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is a computer vision task that involves recognizing and tracking individuals across multiple non-overlapping cameras or over time within the same camera view. It is particularly important in surveillance systems, where it can help in identifying potential threats or tracking suspects. Convolutional neural networks (CNNs) have been used to extract invariant person representation for this challenging task. However, CNNs do not consider global dependencies in their initial layers, causing some vital information to be lost during the convolution process. The development of vision-based transformers has opened up new research avenues for person re-identification. This work proposes a purely transformer-based solution, called TansPose Re-ID, that learns pose-invariant person representations. The proposed system uses a vision transformer baseline and enhances its architecture by introducing multiple streams to learn global and local dependencies as well as pose invariance in person images. The architecture includes a Global Self-Attention Module (GSM) and a Local Self-Attention Module (LSM) that jointly learn global and local patch-based person embeddings. The LSM is further improved by stochastically grouping local patches and aligning them. Additionally, an attention feature learning module (AFLM) is introduced in the LSM to handle pose and viewpoint variations. The proposed method is evaluated on two public Re-ID benchmarks, Market1501 and DukeMTMC-ReID, and demonstrates superior performance compared to existing transformer baselines.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Sparse Re-Id: Block Sparsity for Person Re-Identification
    Karanam, Srikrishna
    Li, Yang
    Radke, Richard J.
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [2] Open Set Person Re-identification Framework on Closed Set Re-Id Systems
    Vidanapathirana, Madhawa
    Sudasingha, Imesha
    Kanchana, Pasindu
    Vidanapathirana, Jayan
    Perera, Indika
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 66 - 71
  • [3] TrADe Re-ID - Live Person Re-Identification using Tracking and Anomaly Detection
    Machaca, Luigy
    Oliver Sumari H, F.
    Huaman, Jose
    Clua, Esteban
    Guerin, Joris
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 449 - 454
  • [4] Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification
    Xu, Boqiang
    He, Lingxiao
    Liao, Xingyu
    Liu, Wu
    Sun, Zhenan
    Mei, Tao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 673 - 681
  • [5] Pose-Invariant Embedding for Deep Person Re-Identification
    Zheng, Liang
    Huang, Yujia
    Lu, Huchuan
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4500 - 4509
  • [6] Pose Transferrable Person Re-Identification
    Liu, Jinxian
    Ni, Bingbing
    Yan, Yichao
    Zhou, Peng
    Cheng, Shuo
    Hu, Jianguo
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4099 - 4108
  • [7] Person re-identification by pose priors
    Bak, Slawomir
    Martins, Filipe
    Bremond, Francois
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS XIII, 2015, 9399
  • [8] Viewpoint Invariant Person Re-identification with Pose and Weighted Local Features
    Chen, Chun-Huei
    Chen, Ju-Chin
    Lin, Kawuu W.
    MODERN APPROACHES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2018, 769 : 387 - 396
  • [9] Event-driven Re-Id: A New Benchmark and Method Towards Privacy-Preserving Person Re-Identification
    Ahmad, Shafiq
    Scarpellini, Gianluca
    Morerio, Pietro
    Del Bue, Alessio
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 459 - 468
  • [10] Ubiquitous vision of transformers for person re-identification
    Perwaiz, N.
    Shahzad, M.
    Fraz, M. M.
    MACHINE VISION AND APPLICATIONS, 2023, 34 (02)