Frame-Guided Region-Aligned Representation for Video Person Re-Identification

被引：0

作者：

Chen, Zengqun ^{[1
]}

Zhou, Zhiheng ^{[1
]}

Huang, Junchu ^{[1
]}

Zhang, Pengyu ^{[1
]}

Li, Bo ^{[1
]}

机构：

[1] South China Univ Technol, Guangzhou, Peoples R China

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pedestrians in videos are usually in a moving state, resulting in serious spatial misalignment like scale variations and pose changes, which makes the video-based person re-identification problem more challenging. To address the above issue, in this paper, we propose a Frame-Guided Region-Aligned model (FGRA) for discriminative representation learning in two steps in an end-to-end manner. Firstly, based on a frame-guided feature learning strategy and a non-parametric alignment module, a novel alignment mechanism is proposed to extract well-aligned region features. Secondly, in order to form a sequence representation, an effective feature aggregation strategy that utilizes temporal alignment score and spatial attention is adopted to fuse region features in the temporal and spatial dimensions, respectively. Experiments are conducted on benchmark datasets to demonstrate the effectiveness of the proposed method to solve the misalignment problem and the superiority of the proposed method to the existing video-based person re-identification methods.

引用

页码：10591 / 10598

页数：8

共 50 条

[1] TEMPORALLY ALIGNED POOLING REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
Gao, Changxin
Wang, Jin
Liu, Leyuan
Yu, Jin-Gang
Sang, Nong
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4284 - 4288
[2] MRRNet: Learning multiple region representation for video person re-identification
Fu, Hui
Zhang, Ke
Li, Haoyu
Wang, Jingyu
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
[3] An Aligned Bidirectional Feature Representation for Person Re-identification
Wang, Daiyin
Hao, Lei
Zhu, Yuesheng
2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
[4] Semantics-Aligned Representation Learning for Person Re-Identification
Jin, Xin
Lan, Cuiling
Zeng, Wenjun
Wei, Guoqiang
Chen, Zhibo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11173 - 11180
[5] Superpixel-Based Temporally Aligned Representation for Video-Based Person Re-Identification
Gao, Changxin
Wang, Jin
Liu, Leyuan
Yu, Jin-Gang
Sang, Nong
SENSORS, 2019, 19 (18)
[6] Pose-Guided Representation Learning for Person Re-Identification
Li, Jianing
Zhang, Shiliang
Tian, Qi
Wang, Meng
Gao, Wen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 622 - 635
[7] Adaptive Graph Representation Learning for Video Person Re-Identification
Wu, Yiming
Bourahla, Omar El Farouk
Li, Xi
Wu, Fei
Tian, Qi
Zhou, Xue
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 8821 - 8830
[8] Person Re-Identification by Semantic Region Representation and Topology Constraint
Lei, Jianjun
Niu, Lijie
Fu, Huazhu
Peng, Bo
Huang, Qingming
Hou, Chunping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2453 - 2466
[9] AN UNBIASED TEMPORAL REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
Zhang, Xiu
Bhanu, Bir
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 838 - 842
[10] Multiscale Aligned SpatialTemporal Interaction for Video-Based Person Re-Identification
Ran, Zhidan
Wei, Xuan
Liu, Wei
Lu, Xiaobo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8536 - 8546

← 1 2 3 4 5 →