A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引:35
|
作者
Zhang, Wei [1 ]
He, Xuanyu [1 ]
Yu, Xiaodong [1 ]
Lu, Weizhi [1 ]
Zha, Zhengjun [2 ]
Tian, Qi [3 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
中国国家自然科学基金;
关键词
Video-based person re-id; spatial-temporal attention; multi-scale pooling;
D O I
10.1109/TIP.2019.2959653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.
引用
收藏
页码:3365 / 3373
页数:9
相关论文
共 50 条
  • [11] Person Re-identification Based on Multi-scale Network Attention Fusion
    Wang Fenhua
    Zhao Bo
    Huang Chao
    Yan Youqi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (12) : 3045 - 3052
  • [12] Multi-Branch Person Re-Identification Basedon Multi-Scale Attention
    Cong, Li
    Min, Jiang
    Jun, Kong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [13] STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification
    Fu, Yang
    Wang, Xiaoyang
    Wei, Yunchao
    Huang, Thomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8287 - 8294
  • [14] Progressive spatial-temporal transfer model for unsupervised person re-identification
    Zhou, Shuren
    Li, Zhixiong
    Liu, Jie
    Zhou, Jiarui
    Zhang, Jianming
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (02)
  • [15] Person Re-Identification with Weighted Spatial-Temporal Features
    Zhang, Dongyu
    Chen, Rongcong
    Qiu, Zhilin
    Zhang, Wei
    Wang, Qing
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1426 - 1431
  • [16] Multi-Scale Temporal Cues Learning for Video Person Re-Identification
    Li, Jianing
    Zhang, Shiliang
    Huang, Tiejun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4461 - 4473
  • [17] Temporal Multi-Scale Complementary Feature for Video Person Re-Identification
    Hou R.-B.
    Chang H.
    Ma B.-P.
    Huang R.
    Shan S.-G.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (01): : 31 - 50
  • [18] Spatial-Temporal Omni-Scale Feature Learning for Person Re-Identification
    Ploco, Aida
    Rodriguez, Andrea Macarulla
    Geradts, Zeno
    2020 8TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2020), 2020,
  • [19] Video-based person re-identification with parallel spatial-temporal attention module
    Kong, Jun
    Teng, Zhende
    Jiang, Min
    Huo, Hongtao
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
  • [20] Multi-scale attention vehicle re-identification
    Zheng, Aihua
    Lin, Xianmin
    Dong, Jiacheng
    Wang, Wenzhong
    Tang, Jin
    Luo, Bin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (23): : 17489 - 17503