A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos

被引：35

作者：

Zhang, Wei ^{[1
]}

He, Xuanyu ^{[1
]}

Yu, Xiaodong ^{[1
]}

Lu, Weizhi ^{[1
]}

Zha, Zhengjun ^{[2
]}

Tian, Qi ^{[3
]}

机构：

[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China

[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金;

关键词：

Video-based person re-id; spatial-temporal attention; multi-scale pooling;

D O I：

10.1109/TIP.2019.2959653

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel deep neural network based attention model to learn the representative local regions from a video sequence for person re-identification. Specifically, we propose a multi-scale spatial-temporal attention (MSTA) model to measure the regions of each frame in different scales from the perspective of whole video sequence. Compared to traditional temporal attention models, MSTA focuses on exploiting the importance of local regions of each frame to the whole video representation in both spatial and temporal domains. A new training strategy is designed for the proposed model by incorporating the image-to-image mode with the video-to-video mode. Extensive experiments on benchmark datasets demonstrate the superiority of the proposed model over state-of-the-art methods.

引用

页码：3365 / 3373

页数：9

共 50 条

[21] Multi-scale attention vehicle re-identification
Aihua Zheng
Xianmin Lin
Jiacheng Dong
Wenzhong Wang
Jin Tang
Bin Luo
Neural Computing and Applications, 2020, 32 : 17489 - 17503
[22] Person re-identification with activity prediction based on hierarchical spatial-temporal model
Li, Minxian
Shen, Fumin
Wang, Jingya
Guan, Chao
Tang, Jinhui
NEUROCOMPUTING, 2018, 275 : 1200 - 1207
[23] Unsupervised Spatial-Temporal Model Based on Region Alignment for Person Re-identification
Li, Wei
Qi, Meibin
Yang, Ning
Zhou, Guowu
Yang, Yubing
2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
[24] SMSNet: A Novel Multi-scale Siamese Model for Person Re-Identification
Tagore, Nirbhay Kumar
Chattopadhyay, Pratik
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - DCNET, OPTICS, SIGMAP AND WINSYS (ICETE), VOL 2, 2020, : 103 - 112
[25] Multi-scale saliency features fusion model for person re-identification
Liao, Kaiyang
Wang, Keer
Zheng, Yuanlin
Lin, Guangfeng
Cao, Congjun
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (22) : 61605 - 61620
[26] Multi-scale feature combination for person re-identification
Huang, Bailiang
Piao, Yan
Zhang, Hao
Tang, Yanfeng
IET IMAGE PROCESSING, 2022, 16 (07) : 2001 - 2011
[27] Multi-scale feature representation for person re-identification
Lu J.
Wang H.-Y.
Chen X.
Zhang K.-B.
Liu W.
Kongzhi yu Juece/Control and Decision, 2021, 36 (12): : 3015 - 3022
[28] Multi-Scale Convolutional Network for Person Re-identification
Wu, Qiong
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 826 - 835
[29] Multi-scale joint learning for person re-identification
Xie P.
Xu X.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 613 - 622
[30] Multi-Scale Relation Network for Person Re-identification
Ma, Yi
Bai, Tian
Zhang, Wenyu
Li, Shuang
Hu, Jian
Lu, Mingzhe
26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,

← 1 2 3 4 5 →