TIPCB: A simple but effective part-based convolutional baseline for text-based person search

被引:81
|
作者
Chen, Yuhao [1 ]
Zhang, Guoqing [1 ]
Lu, Yujiang [1 ]
Wang, Zhenxing [2 ]
Zheng, Yuhui [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Math & Stat, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modality; Person search; Local representation; NETWORK; REIDENTIFICATION;
D O I
10.1016/j.neucom.2022.04.081
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a sub-task in the field of image retrieval, which aims to retrieve target person images according to a given textual description. The significant feature gap between two modalities makes this task very challenging. Many existing methods attempt to utilize local alignment to address this problem in the fine-grained level. However, most relevant methods introduce additional models or complicated training and evaluation strategies, which are hard to use in realistic scenarios. In order to facilitate the practical application, we propose a simple but effective baseline for text-based person search named TIPCB (i.e., Text-Image Part-based Convolutional Baseline). Firstly, a novel dual-path local alignment network structure is proposed to extract visual and textual local representations, in which images are segmented horizontally and texts are aligned adaptively. Then, we propose a multi-stage cross-modal matching strategy, which eliminates the modality gap from three feature levels, including low level, local level and global level. Extensive experiments are conducted on the widely-used benchmark datasets (CUHK-PEDES and ICFG-PEDES) and verify that our method outperforms all the existing methods. Our code has been released in https://github.com/OrangeYHChen/TIPCB. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 181
页数:11
相关论文
共 50 条
  • [41] PH-GCN: Person Retrieval With Part-Based Hierarchical Graph Convolutional Network
    Jiang, Bo
    Wang, Xixi
    Zheng, Aihua
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3218 - 3228
  • [42] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
    You, Kaiyang
    Chen, Wenjing
    Wang, Chengji
    Sun, Hao
    Xie, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234
  • [43] Text-based person search by non-saliency enhancing and dynamic label smoothing
    Pang Y.
    Zhang C.
    Li Z.
    Wei C.
    Wang Z.
    Neural Computing and Applications, 2024, 36 (21) : 13327 - 13339
  • [44] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1565 - 1582
  • [45] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    World Wide Web, 2022, 25 (04) : 1565 - 1582
  • [46] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Pengpeng Zeng
    Shuaiqi Jing
    Jingkuan Song
    Kaixuan Fan
    Xiangpeng Li
    Liansuo We
    Yuan Guo
    World Wide Web, 2022, 25 : 1565 - 1582
  • [47] Full-view salient feature mining and alignment for text-based person search
    Xie, Sheng
    Zhang, Canlong
    Ning, Enhao
    Li, Zhixin
    Wang, Zhiwen
    Wei, Chunrong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [48] Learning shared features from specific and ambiguous descriptions for text-based person search
    Ke Cheng
    Qikai Geng
    Shucheng Huang
    Juanjuan Tu
    Hu Lu
    Multimedia Systems, 2024, 30
  • [49] Learning shared features from specific and ambiguous descriptions for text-based person search
    Cheng, Ke
    Geng, Qikai
    Huang, Shucheng
    Tu, Juanjuan
    Lu, Hu
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [50] A Multi-configuration Part-based Person Detector
    Garcia-Martin, Alvaro
    Evangelio, Ruben Heras
    Sikora, Thomas
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP), 2014, : 321 - 328