Local-enhanced representation for text-based person search

被引:1
|
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Noise correspondence with evidence learning for text-based person search
    Xie, Yihan
    Zhang, Baohua
    Li, Yang
    Shan, Chongrui
    Wang, Shun
    Zhang, Jiale
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [12] Conditional Feature Learning Based Transformer for Text-Based Person Search
    Gao, Chenyang
    Cai, Guanyu
    Jiang, Xinyang
    Zheng, Feng
    Zhang, Jun
    Gong, Yifei
    Lin, Fangzhou
    Sun, Xing
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6097 - 6108
  • [13] Text-based Person Search without Parallel Image-Text Data
    Bai, Yang
    Wang, Jingyao
    Cao, Min
    Chen, Chen
    Cao, Ziqiang
    Nie, Liqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 757 - 767
  • [14] Text-Guided Visual Feature Refinement for Text-Based Person Search
    Gao, Liying
    Niu, Kai
    Ma, Zehong
    Jiao, Bingliang
    Tan, Tonghao
    Wang, Peng
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 118 - 126
  • [15] Feature semantic alignment and information supplement for Text-based person search
    Zhou, Hang
    Li, Fan
    Tian, Xuening
    Huang, Yuling
    FRONTIERS IN PHYSICS, 2023, 11
  • [16] Asymmetric Cross-Scale Alignment for Text-Based Person Search
    Ji, Zhong
    Hu, Junhua
    Liu, Deyin
    Wu, Lin Yuanbo
    Zhao, Ye
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7699 - 7709
  • [17] A Self-Supervised Learning Approach for Text-Based Person Search
    Ji Z.
    Hu J.
    Ding X.
    Li S.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (02): : 169 - 176
  • [18] A Simple and Robust Correlation Filtering Method for Text-Based Person Search
    Suo, Wei
    Sun, Mengyang
    Niu, Kai
    Gao, Yiqi
    Wang, Peng
    Zhang, Yanning
    Wu, Qi
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 726 - 742
  • [19] Improving Text-based Person Search by Spatial Matching and Adaptive Threshold
    Chen, Tianlang
    Xu, Chenliang
    Luo, Jiebo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1879 - 1887
  • [20] An Overview of Text-based Person Search: Recent Advances and Future Directions
    Niu K.
    Liu Y.
    Long Y.
    Huang Y.
    Wang L.
    Zhang Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 1 - 1