A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引:35
|
作者
Zhang, Wei [1 ]
He, Xuanyu [1 ]
Yu, Xiaodong [1 ]
Lu, Weizhi [1 ]
Zha, Zhengjun [2 ]
Tian, Qi [3 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
中国国家自然科学基金;
关键词
Video-based person re-id; spatial-temporal attention; multi-scale pooling;
D O I
10.1109/TIP.2019.2959653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.
引用
收藏
页码:3365 / 3373
页数:9
相关论文
共 50 条
  • [21] Multi-scale attention vehicle re-identification
    Aihua Zheng
    Xianmin Lin
    Jiacheng Dong
    Wenzhong Wang
    Jin Tang
    Bin Luo
    Neural Computing and Applications, 2020, 32 : 17489 - 17503
  • [22] Person re-identification with activity prediction based on hierarchical spatial-temporal model
    Li, Minxian
    Shen, Fumin
    Wang, Jingya
    Guan, Chao
    Tang, Jinhui
    NEUROCOMPUTING, 2018, 275 : 1200 - 1207
  • [23] Unsupervised Spatial-Temporal Model Based on Region Alignment for Person Re-identification
    Li, Wei
    Qi, Meibin
    Yang, Ning
    Zhou, Guowu
    Yang, Yubing
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [24] SMSNet: A Novel Multi-scale Siamese Model for Person Re-Identification
    Tagore, Nirbhay Kumar
    Chattopadhyay, Pratik
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - DCNET, OPTICS, SIGMAP AND WINSYS (ICETE), VOL 2, 2020, : 103 - 112
  • [25] Multi-scale saliency features fusion model for person re-identification
    Liao, Kaiyang
    Wang, Keer
    Zheng, Yuanlin
    Lin, Guangfeng
    Cao, Congjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (22) : 61605 - 61620
  • [26] Multi-scale feature combination for person re-identification
    Huang, Bailiang
    Piao, Yan
    Zhang, Hao
    Tang, Yanfeng
    IET IMAGE PROCESSING, 2022, 16 (07) : 2001 - 2011
  • [27] Multi-scale feature representation for person re-identification
    Lu J.
    Wang H.-Y.
    Chen X.
    Zhang K.-B.
    Liu W.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (12): : 3015 - 3022
  • [28] Multi-Scale Convolutional Network for Person Re-identification
    Wu, Qiong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 826 - 835
  • [29] Multi-scale joint learning for person re-identification
    Xie P.
    Xu X.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 613 - 622
  • [30] Multi-Scale Relation Network for Person Re-identification
    Ma, Yi
    Bai, Tian
    Zhang, Wenyu
    Li, Shuang
    Hu, Jian
    Lu, Mingzhe
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,