A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引:35
|
作者
Zhang, Wei [1 ]
He, Xuanyu [1 ]
Yu, Xiaodong [1 ]
Lu, Weizhi [1 ]
Zha, Zhengjun [2 ]
Tian, Qi [3 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
中国国家自然科学基金;
关键词
Video-based person re-id; spatial-temporal attention; multi-scale pooling;
D O I
10.1109/TIP.2019.2959653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.
引用
收藏
页码:3365 / 3373
页数:9
相关论文
共 50 条
  • [41] Discriminative multi-scale adjacent feature for person re-identification
    Mengzan Qi
    Sixian Chan
    Feng Hong
    Yuan Yao
    Xiaolong Zhou
    Complex & Intelligent Systems, 2024, 10 : 4557 - 4569
  • [42] Multi-scale Deep Learning Architectures for Person Re-identification
    Qian, Xuelin
    Fu, Yanwei
    Jiang, Yu-Gang
    Xiang, Tao
    Xue, Xiangyang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5409 - 5418
  • [43] Spatial and Temporal Dual-Attention for Unsupervised Person Re-Identification
    He, Qiaolin
    Wang, Zihan
    Zheng, Zhijie
    Hu, Haifeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1953 - 1965
  • [44] Person re-identification based on multi-scale convolutional network
    Yang, XiuJie
    Chen, Ping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9299 - 9313
  • [45] Person re-identification based on multi-scale feature learning
    Li, Yueying
    Liu, Li
    Zhu, Lei
    Zhang, Huaxiang
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [46] Discriminative multi-scale adjacent feature for person re-identification
    Qi, Mengzan
    Chan, Sixian
    Hong, Feng
    Yao, Yuan
    Zhou, Xiaolong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4557 - 4569
  • [47] Person re-identification based on multi-scale constraint network
    Li, Sishang
    Liu, Xueliang
    Zhao, Ye
    Wang, Meng
    PATTERN RECOGNITION LETTERS, 2020, 138 : 403 - 409
  • [48] Person Re-Identification by Deep Learning Multi-Scale Representations
    Chen, Yanbei
    Zhu, Xiatian
    Gong, Shaogang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2590 - 2600
  • [49] Multi-scale feature fusion network for person re-identification
    Wang, Yongjie
    Zhang, Wei
    Liu, Yanyan
    IET IMAGE PROCESSING, 2020, 14 (17) : 4614 - 4620
  • [50] Person re-identification based on multi-scale convolutional network
    XiuJie Yang
    Ping Chen
    Multimedia Tools and Applications, 2020, 79 : 9299 - 9313