Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

被引:0
|
作者
Wu, Xinyi [1 ]
Ma, Wentao [2 ]
Guo, Dan [3 ]
Zhou, Tongqing [1 ]
Zhao, Shan [3 ]
Cai, Zhiping [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
[2] Anhui Agr Univ, Sch Informat & Artificial Intelligence, Hefei, Peoples R China
[3] HeFei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6 | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based Person Re-identification (T-ReID), which aims at retrieving a specific pedestrian image from a collection of images via text-based information, has received significant attention. However, previous research has overlooked a challenging yet practical form of T-ReID: dealing with image galleries mixed with occluded and inconsistent personal visuals, instead of ideal visuals with a full-body and clear view. Its major challenges lay in the insufficiency of benchmark datasets and the enlarged semantic gap incurred by arbitrary occlusions and modality gap between text description and visual representation of the target person. To alleviate these issues, we first design an Occlusion Generator (OGor) for the automatic generation of artificial occluded images from generic surveillance images. Then, a fine-granularity token selection mechanism is proposed to minimize the negative impact of occlusion for robust feature learning, and a novel multi-granularity contrastive consistency alignment framework is designed to leverage intra/inter-granularity of visual-text representations for semantic alignment of occluded visuals and query texts. Experimental results demonstrate that our method exhibits superior performance. We believe this work could inspire the community to investigate more dedicated designs for implementing TReID in real-world scenarios. The source code is available at https://github.com/littlexinyi/MGCC.
引用
收藏
页码:6162 / 6170
页数:9
相关论文
共 50 条
  • [41] Multi-level cross-modality learning framework for text-based person re-identification
    Wu, Tinghui
    Zhang, Shuhe
    Chen, Dihu
    Hu, Haifeng
    ELECTRONICS LETTERS, 2023, 59 (20)
  • [42] LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System
    Gong, Xun
    Yao, Zu
    Li, Xin
    Fan, Yueqiao
    Luo, Bin
    Fan, Jianfeng
    Lao, Boji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 217 - 229
  • [43] Vehicle Re-Identification Based on Global Relational Attention and Multi-Granularity Feature Learning
    Tian, Xin
    Pang, Xiyu
    Jiang, Gangwu
    Meng, Qinglan
    Zheng, Yanli
    IEEE ACCESS, 2022, 10 : 17674 - 17682
  • [44] Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search
    Jing, Ya
    Si, Chenyang
    Wang, Junbo
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11189 - 11196
  • [45] RMHNet: A Relation-Aware Multi-granularity Hierarchical Network for Person Re-identification
    Gengsheng Xie
    Xianbin Wen
    Neural Processing Letters, 2023, 55 : 1433 - 1454
  • [46] Multi-granularity Separation Network for Text-Based Person Retrieval with Bidirectional Refinement Regularization
    Li, Shenshen
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 307 - 315
  • [47] RMHNet: A Relation-Aware Multi-granularity Hierarchical Network for Person Re-identification
    Xie, Gengsheng
    Wen, Xianbin
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1433 - 1454
  • [48] IGMG: Instance-guided multi-granularity for domain generalizable person re-identification
    Bhuiyan, Amran
    Huang, Jimmy Xiangji
    An, Aijun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [49] Deep Learning Based Occluded Person Re-Identification: A Survey
    Peng, Yunjie
    Wu, Jinlin
    Xu, Boqiang
    Cao, Chunshui
    Liu, Xu
    Sun, Zhenan
    He, Zhiqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
  • [50] Semantic consistent feature construction and multi-granularity feature learning for visible-infrared person re-identification
    Yiming Wang
    Kaixiong Xu
    Yi Chai
    Yutao Jiang
    Guanqiu Qi
    The Visual Computer, 2024, 40 : 2363 - 2379