Improving Text-Based Person Retrieval by Excavating All-Round Information Beyond Color

被引:2
|
作者
Zhu, Aichun [1 ]
Wang, Zijie [1 ]
Xue, Jingyi [1 ]
Wan, Xili [1 ]
Jin, Jing [1 ]
Wang, Tian [2 ]
Snoussi, Hichem [3 ]
机构
[1] Nanjing Tech Univ, Coll Comp & Informat Engn, Nanjing 211816, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Zhongguancun Lab, SKLCCSE, Beijing 100191, Peoples R China
[3] Univ Technol Troyes, Inst Charles Delaunay, LM2S FRE CNRS 2019, F-10004 Troyes, France
基金
中国国家自然科学基金;
关键词
Task analysis; Image color analysis; Visualization; Semantics; Data models; Pedestrians; Learning systems; Color (CLR) information; cross-modal retrieval; frequency; person reidentification (ReID); text-based person retrieval; NETWORK;
D O I
10.1109/TNNLS.2024.3368217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person retrieval is the process of searching a massive visual resource library for images of a particular pedestrian, based on a textual query. Existing approaches often suffer from a problem of color (CLR) over-reliance, which can result in a suboptimal person retrieval performance by distracting the model from other important visual cues such as texture and structure information. To handle this problem, we propose a novel framework to Excavate All-round Information Beyond Color for the task of text-based person retrieval, which is therefore termed EAIBC. The EAIBC architecture includes four branches, namely an RGB branch, a grayscale (GRS) branch, a high-frequency (HFQ) branch, and a CLR branch. Furthermore, we introduce a mutual learning (ML) mechanism to facilitate communication and learning among the branches, enabling them to take full advantage of all-round information in an effective and balanced manner. We evaluate the proposed method on three benchmark datasets, including CUHK-PEDES, ICFG-PEDES, and RSTPReid. The experimental results demonstrate that EAIBC significantly outperforms existing methods and achieves state-of-the-art (SOTA) performance in supervised, weakly supervised, and cross-domain settings.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 49 条
  • [1] Improving Text-Based Person Retrieval by Excavating All-Round Information Beyond Color
    Zhu, Aichun
    Wang, Zijie
    Xue, Jingyi
    Wan, Xili
    Jin, Jing
    Wang, Tian
    Snoussi, Hichem
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (03) : 5097 - 5111
  • [2] CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Wan, Xili
    Liu, Chao
    Wang, Tian
    Li, Yifeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5314 - 5322
  • [3] Chatting with interactive memory for text-based person retrieval
    He, Chen
    Li, Shenshen
    Wang, Zheng
    Chen, Hua
    Shen, Fumin
    Xu, Xing
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [4] Learning Semantic Polymorphic Mapping for Text-Based Person Retrieval
    Li, Jiayi
    Jiang, Min
    Kong, Jun
    Tao, Xuefeng
    Luo, Xi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10678 - 10691
  • [5] SUM: Serialized Updating and Matching for text-based person retrieval
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Jiang, Daihong
    Liu, Chao
    Li, Yifeng
    Hu, Fangqiang
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [6] Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval
    Li, Shenshen
    He, Chen
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shen, Heng Tao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3172 - 3180
  • [7] Text-based interfaces and text-based bibliographic enhancements: Thinking beyond standard bibliographic information (and text)
    Wall, TB
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1996, 33 : 278 - 278
  • [8] DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval
    Zhu, Aichun
    Wang, Zijie
    Li, Yifeng
    Wan, Xili
    Jin, Jing
    Wang, Tian
    Hu, Fangqiang
    Hua, Gang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 209 - 217
  • [9] Text-based information retrieval using exponentiated gradient descent
    Papka, R
    Callan, JP
    Barto, AG
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 3 - 9
  • [10] Voice-based Information Retrieval - how far are we from the text-based information retrieval ?
    Lee, Lin-shan
    Pan, Yi-cheng
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 26 - 43