Local and global aligned spatiotemporal attention network for video-based person re-identification

被引:0
|
作者
Li Cheng
Xiao-Yuan Jing
Xiaoke Zhu
Chang-Hui Hu
Guangwei Gao
Songsong Wu
机构
[1] Wuhan University,School of Computer Science
[2] Guangdong University of Petrochemical Technology,School of Computer
[3] Nanjing University of Posts and Telecommunications,College of Automation
[4] Henan University,School of Computer and Information Engineering
[5] Nanjing University of Posts and Telecommunications,Institute of Advanced Technology
来源
关键词
Video-based person re-identification; Local and global; Aligned; Spatiotemporal; Attention;
D O I
暂无
中图分类号
学科分类号
摘要
Matching video clips of people across non-overlapping surveillance cameras (video-based person re-identification) is of significant importance in many real-world applications. In this paper, we address the video-based person re-identification by developing a Local and Global Aligned Spatiotemporal Attention (LGASA) network. Our LGASA network consists of five cascaded modules, including 3D convolutional layers, residual block, spatial transformer network (STN), multi-stream recurrent network and multiple-attention module. Specifically, the 3D convolutional layers are used to capture local short-term fast-varying motion information encoded in multiple adjacent original frames. The residual block is used to extract mid-level feature maps. STN is applied to align the mid-level feature maps. The multi-stream recurrent network is designed to exploit the useful local and global long-term temporal dependency from the aligned mid-level feature maps. The multiple-attention module is designed to aggregate feature vectors of the same body part (or global) from different frames within each video into a single vector according to their importance. Experimental results on three video pedestrian datasets verify the effectiveness of the proposed local and global aligned spatiotemporal attention network.
引用
收藏
页码:34489 / 34512
页数:23
相关论文
共 50 条
  • [1] Local and global aligned spatiotemporal attention network for video-based person re-identification
    Cheng, Li
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Hu, Chang-Hui
    Gao, Guangwei
    Wu, Songsong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34489 - 34512
  • [2] Spatiotemporal Attention on Sliced Parts for Video-based Person Re-identification
    Yang, Xu
    Zhang, Bin
    Dong, Yuan
    Xiong, Fengye
    Bai, Hongliang
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [3] Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
    Li, Shuang
    Bak, Slawomir
    Carr, Peter
    Wang, Xiaogang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 369 - 378
  • [4] Triplet Attention Network for Video-Based Person Re-Identification
    Sun, Rui
    Liang, Qili
    Yang, Zi
    Zhao, Zhenghui
    Zhang, Xudong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10) : 1775 - 1779
  • [5] A Duplex Spatiotemporal Filtering Network for Video-based Person Re-identification
    Zheng, Chong
    Wei, Ping
    Zheng, Nanning
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7551 - 7557
  • [6] SANet: Statistic Attention Network for Video-Based Person Re-Identification
    Bai, Shutao
    Ma, Bingpeng
    Chang, Hong
    Huang, Rui
    Shan, Shiguang
    Chen, Xilin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3866 - 3879
  • [7] Context Sensing Attention Network for Video-based Person Re-identification
    Wang, Kan
    Ding, Changxing
    Pang, Jianxin
    Xu, Xiangmin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [8] Video-Based Convolutional Attention for Person Re-Identification
    Zamprogno, Marco
    Passon, Marco
    Martinel, Niki
    Serra, Giuseppe
    Lancioni, Giuseppe
    Micheloni, Christian
    Tasso, Carlo
    Foresti, Gian Luca
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 3 - 14
  • [9] Non-Local Spatial and Temporal Attention Network for Video-Based Person Re-Identification
    Liu, Zheng
    Du, Feixiang
    Li, Wang
    Liu, Xu
    Zou, Qiang
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [10] Temporal Attention Quality Aware Network for Video-based Person Re-Identification
    Xu, Boqin
    Liu, Changhong
    Xue, Shengjun
    Jiang, Aiwen
    Wang, Shimin
    Ye, Jihua
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069