Improving Text-Based Person Retrieval by Excavating All-Round Information Beyond Color

被引:2
|
作者
Zhu, Aichun [1 ]
Wang, Zijie [1 ]
Xue, Jingyi [1 ]
Wan, Xili [1 ]
Jin, Jing [1 ]
Wang, Tian [2 ]
Snoussi, Hichem [3 ]
机构
[1] Nanjing Tech Univ, Coll Comp & Informat Engn, Nanjing 211816, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Zhongguancun Lab, SKLCCSE, Beijing 100191, Peoples R China
[3] Univ Technol Troyes, Inst Charles Delaunay, LM2S FRE CNRS 2019, F-10004 Troyes, France
基金
中国国家自然科学基金;
关键词
Task analysis; Image color analysis; Visualization; Semantics; Data models; Pedestrians; Learning systems; Color (CLR) information; cross-modal retrieval; frequency; person reidentification (ReID); text-based person retrieval; NETWORK;
D O I
10.1109/TNNLS.2024.3368217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person retrieval is the process of searching a massive visual resource library for images of a particular pedestrian, based on a textual query. Existing approaches often suffer from a problem of color (CLR) over-reliance, which can result in a suboptimal person retrieval performance by distracting the model from other important visual cues such as texture and structure information. To handle this problem, we propose a novel framework to Excavate All-round Information Beyond Color for the task of text-based person retrieval, which is therefore termed EAIBC. The EAIBC architecture includes four branches, namely an RGB branch, a grayscale (GRS) branch, a high-frequency (HFQ) branch, and a CLR branch. Furthermore, we introduce a mutual learning (ML) mechanism to facilitate communication and learning among the branches, enabling them to take full advantage of all-round information in an effective and balanced manner. We evaluate the proposed method on three benchmark datasets, including CUHK-PEDES, ICFG-PEDES, and RSTPReid. The experimental results demonstrate that EAIBC significantly outperforms existing methods and achieves state-of-the-art (SOTA) performance in supervised, weakly supervised, and cross-domain settings.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 49 条
  • [21] Improving Query and Assessment Quality in Text-Based Interactive Video Retrieval Evaluation
    Bailer, Werner
    Arnold, Rahel
    Benz, Vera
    Coccomini, Davide Alessandro
    Gkagkas, Anastasios
    Gudmundsson, Gylfi Thor
    Heller, Silvan
    Jonsson, Bjorn Thor
    Lokoc, Jakub
    Messina, Nicola
    Pantelidis, Nick
    Wu, Jiaxin
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 597 - 601
  • [22] Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Wan, Xili
    Liu, Chao
    Wang, Tian
    Li, Yifeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1984 - 1992
  • [23] Evaluation of N-grams conflation approach in text-based information retrieval
    Kosinov, S
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 136 - 142
  • [24] Hash Join Algorithms Used in Text-Based Information Retrieval: Guidelines for Users
    Rahman, Nurazzah Abd
    Saad, Tareq Salahi
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1386 - 1392
  • [25] Cross-Modal Uncertainty Modeling With Diffusion-Based Refinement for Text-Based Person Retrieval
    Li, Shenshen
    Xu, Xing
    He, Chen
    Shen, Fumin
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2881 - 2893
  • [26] Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval
    Wang, Di
    Yan, Feng
    Wang, Yifeng
    Zhao, Lin
    Liang, Xiao
    Zhong, Haodi
    Zhang, Ronghua
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 92 - 100
  • [27] Multi-granularity Separation Network for Text-Based Person Retrieval with Bidirectional Refinement Regularization
    Li, Shenshen
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 307 - 315
  • [28] Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval
    Shen, Fei
    Shu, Xiangbo
    Du, Xiaoyu
    Tang, Jinhui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8922 - 8931
  • [29] Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval
    Ram Kumar
    S. C. Sharma
    The Journal of Supercomputing, 2023, 79 : 2251 - 2280
  • [30] Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval
    Kumar, Ram
    Sharma, S. C.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (02): : 2251 - 2280