Local and global aligned spatiotemporal attention network for video-based person re-identification

被引：0

作者：

Li Cheng

Xiao-Yuan Jing

Xiaoke Zhu

Chang-Hui Hu

Guangwei Gao

Songsong Wu

机构：

[1] Wuhan University,School of Computer Science

[2] Guangdong University of Petrochemical Technology,School of Computer

[3] Nanjing University of Posts and Telecommunications,College of Automation

[4] Henan University,School of Computer and Information Engineering

[5] Nanjing University of Posts and Telecommunications,Institute of Advanced Technology

来源：

Multimedia Tools and Applications | 2020年 / 79卷

关键词：

Video-based person re-identification; Local and global; Aligned; Spatiotemporal; Attention;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Matching video clips of people across non-overlapping surveillance cameras (video-based person re-identification) is of significant importance in many real-world applications. In this paper, we address the video-based person re-identification by developing a Local and Global Aligned Spatiotemporal Attention (LGASA) network. Our LGASA network consists of five cascaded modules, including 3D convolutional layers, residual block, spatial transformer network (STN), multi-stream recurrent network and multiple-attention module. Specifically, the 3D convolutional layers are used to capture local short-term fast-varying motion information encoded in multiple adjacent original frames. The residual block is used to extract mid-level feature maps. STN is applied to align the mid-level feature maps. The multi-stream recurrent network is designed to exploit the useful local and global long-term temporal dependency from the aligned mid-level feature maps. The multiple-attention module is designed to aggregate feature vectors of the same body part (or global) from different frames within each video into a single vector according to their importance. Experimental results on three video pedestrian datasets verify the effectiveness of the proposed local and global aligned spatiotemporal attention network.

引用

页码：34489 / 34512

页数：23

共 50 条

[1] Local and global aligned spatiotemporal attention network for video-based person re-identification
Cheng, Li
Jing, Xiao-Yuan
Zhu, Xiaoke
Hu, Chang-Hui
Gao, Guangwei
Wu, Songsong
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34489 - 34512
[2] Spatiotemporal Attention on Sliced Parts for Video-based Person Re-identification
Yang, Xu
Zhang, Bin
Dong, Yuan
Xiong, Fengye
Bai, Hongliang
2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
[3] Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
Li, Shuang
Bak, Slawomir
Carr, Peter
Wang, Xiaogang
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 369 - 378
[4] Triplet Attention Network for Video-Based Person Re-Identification
Sun, Rui
Liang, Qili
Yang, Zi
Zhao, Zhenghui
Zhang, Xudong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10) : 1775 - 1779
[5] A Duplex Spatiotemporal Filtering Network for Video-based Person Re-identification
Zheng, Chong
Wei, Ping
Zheng, Nanning
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7551 - 7557
[6] SANet: Statistic Attention Network for Video-Based Person Re-Identification
Bai, Shutao
Ma, Bingpeng
Chang, Hong
Huang, Rui
Shan, Shiguang
Chen, Xilin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3866 - 3879
[7] Context Sensing Attention Network for Video-based Person Re-identification
Wang, Kan
Ding, Changxing
Pang, Jianxin
Xu, Xiangmin
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
[8] Video-Based Convolutional Attention for Person Re-Identification
Zamprogno, Marco
Passon, Marco
Martinel, Niki
Serra, Giuseppe
Lancioni, Giuseppe
Micheloni, Christian
Tasso, Carlo
Foresti, Gian Luca
IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 3 - 14
[9] Non-Local Spatial and Temporal Attention Network for Video-Based Person Re-Identification
Liu, Zheng
Du, Feixiang
Li, Wang
Liu, Xu
Zou, Qiang
APPLIED SCIENCES-BASEL, 2020, 10 (15):
[10] Temporal Attention Quality Aware Network for Video-based Person Re-Identification
Xu, Boqin
Liu, Changhong
Xue, Shengjun
Jiang, Aiwen
Wang, Shimin
Ye, Jihua
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069

← 1 2 3 4 5 →