A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引：35

作者：

Zhang, Wei ^{[1
]}

He, Xuanyu ^{[1
]}

Yu, Xiaodong ^{[1
]}

Lu, Weizhi ^{[1
]}

Zha, Zhengjun ^{[2
]}

Tian, Qi ^{[3
]}

机构：

[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China

[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金;

关键词：

Video-based person re-id; spatial-temporal attention; multi-scale pooling;

D O I：

10.1109/TIP.2019.2959653

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.

引用

页码：3365 / 3373

页数：9

共 50 条

[31] Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification
Zhang, Yaobin
Lv, Jianming
Liu, Chen
Cai, Hongmin
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3736 - 3744
[32] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
Chen, Guangyi
Lu, Jiwen
Yang, Ming
Zhou, Jie
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205
[33] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
Jiacheng Pu
Wei Zou
Signal, Image and Video Processing, 2024, 18 : 243 - 253
[34] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
Pu, Jiacheng
Zou, Wei
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 243 - 253
[35] Multi-Scale Attention Network Based on Multi-Feature Fusion for Person Re-Identification
Li, Minghao
Yuan, Liming
Wen, Xianbin
Wang, Jianchen
Xie, Gengsheng
Jia, Yansong
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[36] Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification
Qian, Xuelin
Fu, Yanwei
Xiang, Tao
Jiang, Yu-Gang
Xue, Xiangyang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 371 - 385
[37] Multi-Scale Body-Part Mask Guided Attention for Person Re-identification
Cai, Honglong
Wang, Zhiguan
Cheng, Jinxing
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1555 - 1564
[38] A Multi-Scale Graph Attention-Based Transformer for Occluded Person Re-Identification
Ma, Ming
Wang, Jianming
Zhao, Bohan
APPLIED SCIENCES-BASEL, 2024, 14 (18):
[39] MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification
Wang, Jinghui
Wang, Jun
ELECTRONICS, 2024, 13 (08)
[40] Contextual Multi-Scale Feature Learning for Person Re-Identification
Fan, Baoyu
Wang, Li
Zhang, Runze
Guo, Zhenhua
Zhao, Yaqian
Li, Rengang
Gong, Weifeng
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 655 - 663

← 1 2 3 4 5 →