Frame-Guided Region-Aligned Representation for Video Person Re-Identification

被引:0
|
作者
Chen, Zengqun [1 ]
Zhou, Zhiheng [1 ]
Huang, Junchu [1 ]
Zhang, Pengyu [1 ]
Li, Bo [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrians in videos are usually in a moving state, resulting in serious spatial misalignment like scale variations and pose changes, which makes the video-based person re-identification problem more challenging. To address the above issue, in this paper, we propose a Frame-Guided Region-Aligned model (FGRA) for discriminative representation learning in two steps in an end-to-end manner. Firstly, based on a frame-guided feature learning strategy and a non-parametric alignment module, a novel alignment mechanism is proposed to extract well-aligned region features. Secondly, in order to form a sequence representation, an effective feature aggregation strategy that utilizes temporal alignment score and spatial attention is adopted to fuse region features in the temporal and spatial dimensions, respectively. Experiments are conducted on benchmark datasets to demonstrate the effectiveness of the proposed method to solve the misalignment problem and the superiority of the proposed method to the existing video-based person re-identification methods.
引用
收藏
页码:10591 / 10598
页数:8
相关论文
共 50 条
  • [31] Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
    Aich, Abhishek
    Zheng, Meng
    Karanam, Srikrishna
    Chen, Terrence
    Roy-Chowdhury, Amit K.
    Wu, Ziyan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 152 - 162
  • [32] Local and global aligned spatiotemporal attention network for video-based person re-identification
    Li Cheng
    Xiao-Yuan Jing
    Xiaoke Zhu
    Chang-Hui Hu
    Guangwei Gao
    Songsong Wu
    Multimedia Tools and Applications, 2020, 79 : 34489 - 34512
  • [33] Local and global aligned spatiotemporal attention network for video-based person re-identification
    Cheng, Li
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Hu, Chang-Hui
    Gao, Guangwei
    Wu, Songsong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34489 - 34512
  • [34] Multitask Multigranularity Aggregation With Global-Guided Attention for Video Person Re-Identification
    Sun, Dengdi
    Huang, Jiale
    Hu, Lei
    Tang, Jin
    Ding, Zhuanlian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7758 - 7771
  • [35] Pose-guided spatiotemporal alignment for video-based person Re-identification
    Gao, Changxin
    Chen, Yang
    Yu, Jin-Gang
    Sang, Nong
    INFORMATION SCIENCES, 2020, 527 : 176 - 190
  • [36] Multilevel deep representation fusion for person re-identification
    Zhao, Yu
    Fu, Keren
    Shu, Qiaoyuan
    Wei, Pengcheng
    Shi, Xi
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (02)
  • [37] Learning hybrid ranking representation for person re-identification
    Wu, Guile
    Zhu, Xiatian
    Gong, Shaogang
    PATTERN RECOGNITION, 2022, 121
  • [38] AAformer: Auto-Aligned Transformer for Person Re-Identification
    Zhu, Kuan
    Guo, Haiyun
    Zhang, Shiliang
    Wang, Yaowei
    Liu, Jing
    Wang, Jinqiao
    Tang, Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 17307 - 17317
  • [39] Improving Person Re-identification with Semantically Aligned Appearance Transformer
    Li, Hui
    Zheng, Yinglin
    Tan, Zhaodong
    Deng, Wenjin
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] An Enhanced Deep Feature Representation for Person Re-identification
    Wu, Shangxuan
    Chen, Ying-Cong
    Li, Xiang
    Wu, An-Cong
    You, Jin-Jie
    Zheng, Wei-Shi
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,