Temporal Correlation Vision Transformer for Video Person Re-Identification

被引:0
|
作者
Wu, Pengfei [1 ,2 ]
Wang, Le [1 ,2 ]
Zhou, Sanping [1 ,2 ]
Hua, Gang [4 ]
Sun, Changyin [3 ]
机构
[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intel, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
[3] Anhui Univ, Sch Artificial Intelligence, Hefei, Peoples R China
[4] Wormpex AI Res, Bellevue, WA USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Person Re-Identification (Re-ID) is a task of retrieving persons from multi-camera surveillance systems. Despite the progress made in leveraging spatio-temporal information in videos, occlusion in dense crowds still hinders further progress. To address this issue, we propose a Temporal Correlation Vision Transformer (TCViT) for video person Re-ID. TCViT consists of a Temporal Correlation Attention (TCA) module and a Learnable Temporal Aggregation (LTA) module. The TCA module is designed to reduce the impact of non-target persons by relative state, while the LTA module is used to aggregate frame-level features based on their completeness. Specifically, TCA is a parameter-free module that first aligns frame-level features to restore semantic coherence in videos and then enhances the features of the target person according to temporal correlation. Additionally, unlike previous methods that treat each frame equally with a pooling layer, LTA introduces a lightweight learnable module to weigh and aggregate frame-level features under the guidance of a classification score. Extensive experiments on four prevalent benchmarks demonstrate that our method achieves state-of-the-art performance in video Re-ID.
引用
收藏
页码:6083 / 6091
页数:9
相关论文
共 50 条
  • [31] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [32] TEMPORAL REGULARIZED SPATIAL ATTENTION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Wang, Xueying
    Zhao, Xu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2249 - 2253
  • [33] Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification
    Liu, Yiheng
    Yuan, Zhenxun
    Zhou, Wengang
    Li, Houqiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8786 - 8793
  • [34] Temporal Multi-Scale Complementary Feature for Video Person Re-Identification
    Hou R.-B.
    Chang H.
    Ma B.-P.
    Huang R.
    Shan S.-G.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (01): : 31 - 50
  • [35] Video-based Person Re-identification with Spatial and Temporal Memory Networks
    Eom, Chanho
    Lee, Geon
    Lee, Junghyup
    Ham, Bumsub
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12016 - 12025
  • [36] Temporal Model Adaptation for Person Re-identification
    Martinel, Niki
    Das, Abir
    Micheloni, Christian
    Roy-Chowdhury, Amit K.
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 858 - 877
  • [37] Improving Person Re-Identification with Temporal Constraints
    Dietlmeier, Julia
    Hu, Feiyan
    Ryan, Frances
    O'Connor, Noel E.
    McGuinness, Kevin
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 540 - 549
  • [38] Spatial-Temporal Person Re-Identification
    Wang, Guangcong
    Lai, Jianhuang
    Huang, Peigen
    Xie, Xiaohua
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8933 - 8940
  • [39] Feature Completion Transformer for Occluded Person Re-Identification
    Wang, Tao
    Liu, Mengyuan
    Liu, Hong
    Li, Wenhao
    Ban, Miaoju
    Guo, Tianyu
    Li, Yidi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8529 - 8542
  • [40] Person re-identification transformer with patch attention and pruning
    Ndayishimiye, Fabrice
    Yoon, Gang-Joon
    Lee, Joonjae
    Yoon, Sang Min
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 106