Frame-Guided Region-Aligned Representation for Video Person Re-Identification

被引:0
|
作者
Chen, Zengqun [1 ]
Zhou, Zhiheng [1 ]
Huang, Junchu [1 ]
Zhang, Pengyu [1 ]
Li, Bo [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrians in videos are usually in a moving state, resulting in serious spatial misalignment like scale variations and pose changes, which makes the video-based person re-identification problem more challenging. To address the above issue, in this paper, we propose a Frame-Guided Region-Aligned model (FGRA) for discriminative representation learning in two steps in an end-to-end manner. Firstly, based on a frame-guided feature learning strategy and a non-parametric alignment module, a novel alignment mechanism is proposed to extract well-aligned region features. Secondly, in order to form a sequence representation, an effective feature aggregation strategy that utilizes temporal alignment score and spatial attention is adopted to fuse region features in the temporal and spatial dimensions, respectively. Experiments are conducted on benchmark datasets to demonstrate the effectiveness of the proposed method to solve the misalignment problem and the superiority of the proposed method to the existing video-based person re-identification methods.
引用
收藏
页码:10591 / 10598
页数:8
相关论文
共 50 条
  • [1] TEMPORALLY ALIGNED POOLING REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Gao, Changxin
    Wang, Jin
    Liu, Leyuan
    Yu, Jin-Gang
    Sang, Nong
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4284 - 4288
  • [2] MRRNet: Learning multiple region representation for video person re-identification
    Fu, Hui
    Zhang, Ke
    Li, Haoyu
    Wang, Jingyu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [3] An Aligned Bidirectional Feature Representation for Person Re-identification
    Wang, Daiyin
    Hao, Lei
    Zhu, Yuesheng
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [4] Semantics-Aligned Representation Learning for Person Re-Identification
    Jin, Xin
    Lan, Cuiling
    Zeng, Wenjun
    Wei, Guoqiang
    Chen, Zhibo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11173 - 11180
  • [5] Superpixel-Based Temporally Aligned Representation for Video-Based Person Re-Identification
    Gao, Changxin
    Wang, Jin
    Liu, Leyuan
    Yu, Jin-Gang
    Sang, Nong
    SENSORS, 2019, 19 (18)
  • [6] Pose-Guided Representation Learning for Person Re-Identification
    Li, Jianing
    Zhang, Shiliang
    Tian, Qi
    Wang, Meng
    Gao, Wen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 622 - 635
  • [7] Adaptive Graph Representation Learning for Video Person Re-Identification
    Wu, Yiming
    Bourahla, Omar El Farouk
    Li, Xi
    Wu, Fei
    Tian, Qi
    Zhou, Xue
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 8821 - 8830
  • [8] Person Re-Identification by Semantic Region Representation and Topology Constraint
    Lei, Jianjun
    Niu, Lijie
    Fu, Huazhu
    Peng, Bo
    Huang, Qingming
    Hou, Chunping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2453 - 2466
  • [9] AN UNBIASED TEMPORAL REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Zhang, Xiu
    Bhanu, Bir
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 838 - 842
  • [10] Multiscale Aligned SpatialTemporal Interaction for Video-Based Person Re-Identification
    Ran, Zhidan
    Wei, Xuan
    Liu, Wei
    Lu, Xiaobo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8536 - 8546