TransPose Re-ID: transformers for pose invariant person Re-identification

被引：0

作者：

Perwaiz, Nazia ^{[1
]}

Shahzad, Muhammad ^{[1
,2
]}

Fraz, Muhammad Moazam ^{[1
]}

机构：

[1] Natl Univ Sci & Technol NUST, Islamabad, Pakistan

[2] Tech Univ Munich, Data Sci Earth Observat, Munich, Germany

来源：

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE | 2023年

关键词：

Person re-identification; image patches; transformer; Self attention; Self context mapping; NEURAL-NETWORK;

D O I：

10.1080/0952813X.2023.2214570

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person re-identification (Re-ID) is a computer vision task that involves recognizing and tracking individuals across multiple non-overlapping cameras or over time within the same camera view. It is particularly important in surveillance systems, where it can help in identifying potential threats or tracking suspects. Convolutional neural networks (CNNs) have been used to extract invariant person representation for this challenging task. However, CNNs do not consider global dependencies in their initial layers, causing some vital information to be lost during the convolution process. The development of vision-based transformers has opened up new research avenues for person re-identification. This work proposes a purely transformer-based solution, called TansPose Re-ID, that learns pose-invariant person representations. The proposed system uses a vision transformer baseline and enhances its architecture by introducing multiple streams to learn global and local dependencies as well as pose invariance in person images. The architecture includes a Global Self-Attention Module (GSM) and a Local Self-Attention Module (LSM) that jointly learn global and local patch-based person embeddings. The LSM is further improved by stochastically grouping local patches and aligning them. Additionally, an attention feature learning module (AFLM) is introduced in the LSM to handle pose and viewpoint variations. The proposed method is evaluated on two public Re-ID benchmarks, Market1501 and DukeMTMC-ReID, and demonstrates superior performance compared to existing transformer baselines.

引用

页数：14

共 50 条

[41] Person Re-Identification with Discriminatively Trained Viewpoint Invariant Dictionaries
Karanam, Srikrishna
Li, Yang
Radke, Richard J.
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4516 - 4524
[42] SCALE-INVARIANT SIAMESE NETWORK FOR PERSON RE-IDENTIFICATION
Zhang, Yunzhou
Shi, Weidong
Liu, Shuangwei
Bao, Jining
Wei, Ying
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2436 - 2440
[43] Apparel-Invariant Feature Learning for Person Re-Identification
Yu, Zhengxu
Zhao, Yilun
Hong, Bin
Jin, Zhongming
Huang, Jianqiang
Cai, Deng
Hua, Xian-Sheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4482 - 4492
[44] Unsupervised learning of visual invariant features for person re-identification
Xia, Daoxun
Guo, Fang
Liu, Haojie
Yu, Sheng
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7495 - 7503
[45] Learning Camera-Invariant Representation for Person Re-identification
Qin, Shizheng
Gu, Kangzheng
Wang, Lecheng
Qi, Lizhe
Zhang, Wenqiang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 125 - 137
[46] Learning Domain Invariant Representations for Generalizable Person Re-Identification
Zhang, Yi-Fan
Zhang, Zhang
Li, Da
Jia, Zhen
Wang, Liang
Tan, Tieniu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 509 - 523
[47] Enhanced keypoint information and pose-weighted re-ID features for multi-person pose estimation and tracking
Wang, Xiangyang
Pei, Tao
Wang, Rui
MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
[48] Re-Ranking For Person Re-Identification
Vu-Hoang Nguyen
Thanh Duc Ngo
Nguyen, Khang M. T. T.
Duc Anh Duong
Kien Nguyen
Duy-Dinh Le
2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 304 - 308
[49] ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification
Yildiz, Serdar
Kasim, Ahmet Nezih
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
[50] Pose-driven Deep Convolutional Model for Person Re-identification
Su, Chi
Li, Jianing
Zhang, Shiliang
Xing, Junliang
Gao, Wen
Tian, Qi
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3980 - 3989

← 1 2 3 4 5 →