Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

被引:230
|
作者
Xu, Shuangjie [1 ]
Cheng, Yu [2 ]
Gu, Kang [1 ]
Yang, Yang [3 ]
Chang, Shiyu [4 ]
Zhou, Pan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] IBM Res, AI Fdn, Armonk, NY USA
[3] Northwestern Univ, Evanston, IL 60208 USA
[4] IBM TJ Watson Res Ctr, Ossining, NY 10562 USA
关键词
D O I
10.1109/ICCV.2017.507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-Identification (person re-id) is a crucial task as its applications in visual surveillance and human-computer interaction. In this work, we present a novel joint Spatial and Temporal Attention Pooling Network (ASTPN) for video-based person re-identification, which enables the feature extractor to be aware of the current input video sequences, in a way that interdependency from the matching items can directly influence the computation of each other's representation. Specifically, the spatial pooling layer is able to select regions from each frame, while the attention temporal pooling performed can select informative frames over the sequence, both pooling guided by the information from distance matching. Experiments are conduced on the iLIDS-VID, PRID-2011 and MARS datasets and the results demonstrate that this approach outperforms existing state-of-art methods. We also analyze how the joint pooling in both dimensions can boost the person re-id performance more effectively than using either of them separately(1).
引用
收藏
页码:4743 / 4752
页数:10
相关论文
共 50 条
  • [41] A spatial and temporal features mixture model with body parts for video-based person re-identification
    Jie Liu
    Cheng Sun
    Xiang Xu
    Baomin Xu
    Shuangyuan Yu
    Applied Intelligence, 2019, 49 : 3436 - 3446
  • [42] Spatial Quality Aware Network for Video-Based Person Re-identification
    Wang, Yujie
    Leng, Biao
    Song, Guanglu
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 34 - 43
  • [43] Video-based Person Re-identification Using Refined Attention Networks
    Rahman, Tanzila
    Rochan, Mrigank
    Wang, Yang
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [44] Person Re-Identification with Weighted Spatial-Temporal Features
    Zhang, Dongyu
    Chen, Rongcong
    Qiu, Zhilin
    Zhang, Wei
    Wang, Qing
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1426 - 1431
  • [45] Video-based person re-identification with scene and person attributes
    Gong, Xun
    Luo, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8117 - 8128
  • [46] VIDEO BASED PERSON RE-IDENTIFICATION BY RE-RANKING ATTENTIVE TEMPORAL INFORMATION IN DEEP RECURRENT CONVOLUTIONAL NETWORKS
    Saha, Bhaswati
    Ram, K. Sai
    Mukhopadhyay, Jayanta
    Roy, Aditi
    Navelkar, Anchit
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1663 - 1667
  • [47] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [48] Temporal Attention Quality Aware Network for Video-based Person Re-Identification
    Xu, Boqin
    Liu, Changhong
    Xue, Shengjun
    Jiang, Aiwen
    Wang, Shimin
    Ye, Jihua
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [49] Saliency and Granularity: Discovering Temporal Coherence for Video-Based Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Liu, Yimin
    Jiang, Jianguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6100 - 6112
  • [50] Temporal-Contextual Attention Network for Video-Based Person Re-identification
    Chen, Di
    Zha, Zheng-Jun
    Liu, Jiawei
    Xie, Hongtao
    Zhang, Yongdong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 146 - 157