Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

被引：0

作者：

Wu, Xinyi ^{[1
]}

Ma, Wentao ^{[2
]}

Guo, Dan ^{[3
]}

Zhou, Tongqing ^{[1
]}

Zhao, Shan ^{[3
]}

Cai, Zhiping ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China

[2] Anhui Agr Univ, Sch Informat & Artificial Intelligence, Hefei, Peoples R China

[3] HeFei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-based Person Re-identification (T-ReID), which aims at retrieving a specific pedestrian image from a collection of images via text-based information, has received significant attention. However, previous research has overlooked a challenging yet practical form of T-ReID: dealing with image galleries mixed with occluded and inconsistent personal visuals, instead of ideal visuals with a full-body and clear view. Its major challenges lay in the insufficiency of benchmark datasets and the enlarged semantic gap incurred by arbitrary occlusions and modality gap between text description and visual representation of the target person. To alleviate these issues, we first design an Occlusion Generator (OGor) for the automatic generation of artificial occluded images from generic surveillance images. Then, a fine-granularity token selection mechanism is proposed to minimize the negative impact of occlusion for robust feature learning, and a novel multi-granularity contrastive consistency alignment framework is designed to leverage intra/inter-granularity of visual-text representations for semantic alignment of occluded visuals and query texts. Experimental results demonstrate that our method exhibits superior performance. We believe this work could inspire the community to investigate more dedicated designs for implementing TReID in real-world scenarios. The source code is available at https://github.com/littlexinyi/MGCC.

引用

页码：6162 / 6170

页数：9

共 50 条

[41] Multi-level cross-modality learning framework for text-based person re-identification
Wu, Tinghui
Zhang, Shuhe
Chen, Dihu
Hu, Haifeng
ELECTRONICS LETTERS, 2023, 59 (20)
[42] LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System
Gong, Xun
Yao, Zu
Li, Xin
Fan, Yueqiao
Luo, Bin
Fan, Jianfeng
Lao, Boji
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 217 - 229
[43] Vehicle Re-Identification Based on Global Relational Attention and Multi-Granularity Feature Learning
Tian, Xin
Pang, Xiyu
Jiang, Gangwu
Meng, Qinglan
Zheng, Yanli
IEEE ACCESS, 2022, 10 : 17674 - 17682
[44] Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search
Jing, Ya
Si, Chenyang
Wang, Junbo
Wang, Wei
Wang, Liang
Tan, Tieniu
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11189 - 11196
[45] RMHNet: A Relation-Aware Multi-granularity Hierarchical Network for Person Re-identification
Gengsheng Xie
Xianbin Wen
Neural Processing Letters, 2023, 55 : 1433 - 1454
[46] Multi-granularity Separation Network for Text-Based Person Retrieval with Bidirectional Refinement Regularization
Li, Shenshen
Xu, Xing
Shen, Fumin
Yang, Yang
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 307 - 315
[47] RMHNet: A Relation-Aware Multi-granularity Hierarchical Network for Person Re-identification
Xie, Gengsheng
Wen, Xianbin
NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1433 - 1454
[48] IGMG: Instance-guided multi-granularity for domain generalizable person re-identification
Bhuiyan, Amran
Huang, Jimmy Xiangji
An, Aijun
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
[49] Deep Learning Based Occluded Person Re-Identification: A Survey
Peng, Yunjie
Wu, Jinlin
Xu, Boqiang
Cao, Chunshui
Liu, Xu
Sun, Zhenan
He, Zhiqiang
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
[50] Semantic consistent feature construction and multi-granularity feature learning for visible-infrared person re-identification
Yiming Wang
Kaixiong Xu
Yi Chai
Yutao Jiang
Guanqiu Qi
The Visual Computer, 2024, 40 : 2363 - 2379

← 1 2 3 4 5 →