A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引:35
|
作者
Zhang, Wei [1 ]
He, Xuanyu [1 ]
Yu, Xiaodong [1 ]
Lu, Weizhi [1 ]
Zha, Zhengjun [2 ]
Tian, Qi [3 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
中国国家自然科学基金;
关键词
Video-based person re-id; spatial-temporal attention; multi-scale pooling;
D O I
10.1109/TIP.2019.2959653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.
引用
收藏
页码:3365 / 3373
页数:9
相关论文
共 50 条
  • [31] Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification
    Zhang, Yaobin
    Lv, Jianming
    Liu, Chen
    Cai, Hongmin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3736 - 3744
  • [32] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
    Chen, Guangyi
    Lu, Jiwen
    Yang, Ming
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205
  • [33] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
    Jiacheng Pu
    Wei Zou
    Signal, Image and Video Processing, 2024, 18 : 243 - 253
  • [34] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
    Pu, Jiacheng
    Zou, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 243 - 253
  • [35] Multi-Scale Attention Network Based on Multi-Feature Fusion for Person Re-Identification
    Li, Minghao
    Yuan, Liming
    Wen, Xianbin
    Wang, Jianchen
    Xie, Gengsheng
    Jia, Yansong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [36] Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification
    Qian, Xuelin
    Fu, Yanwei
    Xiang, Tao
    Jiang, Yu-Gang
    Xue, Xiangyang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 371 - 385
  • [37] Multi-Scale Body-Part Mask Guided Attention for Person Re-identification
    Cai, Honglong
    Wang, Zhiguan
    Cheng, Jinxing
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1555 - 1564
  • [38] A Multi-Scale Graph Attention-Based Transformer for Occluded Person Re-Identification
    Ma, Ming
    Wang, Jianming
    Zhao, Bohan
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [39] MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification
    Wang, Jinghui
    Wang, Jun
    ELECTRONICS, 2024, 13 (08)
  • [40] Contextual Multi-Scale Feature Learning for Person Re-Identification
    Fan, Baoyu
    Wang, Li
    Zhang, Runze
    Guo, Zhenhua
    Zhao, Yaqian
    Li, Rengang
    Gong, Weifeng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 655 - 663