Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

被引:230
|
作者
Xu, Shuangjie [1 ]
Cheng, Yu [2 ]
Gu, Kang [1 ]
Yang, Yang [3 ]
Chang, Shiyu [4 ]
Zhou, Pan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] IBM Res, AI Fdn, Armonk, NY USA
[3] Northwestern Univ, Evanston, IL 60208 USA
[4] IBM TJ Watson Res Ctr, Ossining, NY 10562 USA
关键词
D O I
10.1109/ICCV.2017.507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-Identification (person re-id) is a crucial task as its applications in visual surveillance and human-computer interaction. In this work, we present a novel joint Spatial and Temporal Attention Pooling Network (ASTPN) for video-based person re-identification, which enables the feature extractor to be aware of the current input video sequences, in a way that interdependency from the matching items can directly influence the computation of each other's representation. Specifically, the spatial pooling layer is able to select regions from each frame, while the attention temporal pooling performed can select informative frames over the sequence, both pooling guided by the information from distance matching. Experiments are conduced on the iLIDS-VID, PRID-2011 and MARS datasets and the results demonstrate that this approach outperforms existing state-of-art methods. We also analyze how the joint pooling in both dimensions can boost the person re-id performance more effectively than using either of them separately(1).
引用
收藏
页码:4743 / 4752
页数:10
相关论文
共 50 条
  • [31] COMPLEX SPATIAL-TEMPORAL ATTENTION AGGREGATION FOR VIDEO PERSON RE-IDENTIFICATION
    Ding, Wenjie
    Wei, Xing
    Hong, Xiaopeng
    Gong, Yihong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2441 - 2445
  • [32] Multi-granularity graph pooling for video-based person re-identification
    Pan, Honghu
    Chen, Yongyong
    He, Zhenyu
    NEURAL NETWORKS, 2023, 160 : 22 - 33
  • [33] Occluded Video-Based Person Re-Identification Based on Spatial- Temporal Trajectory Fusion
    Yun Xiao
    Song Kaili
    Zhang Xiaoguang
    Yuan Xinchao
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (10)
  • [34] See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-based Person Re-identification
    Zhou, Zhen
    Huang, Yan
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6776 - 6785
  • [35] Temporal Extension Topology Learning for Video-Based Person Re-identification
    Ning, Jiaqi
    Li, Fei
    Liu, Rujie
    Takeuchi, Shun
    Suzuki, Genta
    COMPUTER VISION - ACCV 2022 WORKSHOPS, 2023, 13848 : 213 - 225
  • [36] CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Rahman, Tanzila
    Rochan, Mrigank
    Wang, Yang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1102 - 1107
  • [37] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [38] A spatial and temporal features mixture model with body parts for video-based person re-identification
    Liu, Jie
    Sun, Cheng
    Xu, Xiang
    Xu, Baomin
    Yu, Shuangyuan
    APPLIED INTELLIGENCE, 2019, 49 (09) : 3436 - 3446
  • [39] Non-Local Spatial and Temporal Attention Network for Video-Based Person Re-Identification
    Liu, Zheng
    Du, Feixiang
    Li, Wang
    Liu, Xu
    Zou, Qiang
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [40] Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification
    Li, Xingze
    Zhou, Wengang
    Zhou, Yun
    Li, Houqiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11434 - 11441