Local-enhanced representation for text-based person search

被引:1
|
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    World Wide Web, 2022, 25 (04) : 1565 - 1582
  • [42] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Pengpeng Zeng
    Shuaiqi Jing
    Jingkuan Song
    Kaixuan Fan
    Xiangpeng Li
    Liansuo We
    Yuan Guo
    World Wide Web, 2022, 25 : 1565 - 1582
  • [43] Full-view salient feature mining and alignment for text-based person search
    Xie, Sheng
    Zhang, Canlong
    Ning, Enhao
    Li, Zhixin
    Wang, Zhiwen
    Wei, Chunrong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [44] Learning shared features from specific and ambiguous descriptions for text-based person search
    Ke Cheng
    Qikai Geng
    Shucheng Huang
    Juanjuan Tu
    Hu Lu
    Multimedia Systems, 2024, 30
  • [45] Learning shared features from specific and ambiguous descriptions for text-based person search
    Cheng, Ke
    Geng, Qikai
    Huang, Shucheng
    Tu, Juanjuan
    Lu, Hu
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [46] BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
    Fujii, Takuro
    Tarashima, Shuhei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2778 - 2782
  • [47] Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval
    Wang, Di
    Yan, Feng
    Wang, Yifeng
    Zhao, Lin
    Liang, Xiao
    Zhong, Haodi
    Zhang, Ronghua
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 92 - 100
  • [48] Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search
    You, Kaiyang
    Chen, Wenjing
    Wang, Chengji
    Sun, Hao
    Xie, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2230 - 2234
  • [49] A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search
    Zang, Xianghao
    Gao, Wei
    Li, Ge
    Fang, Han
    Ban, Chao
    He, Zhongjiang
    Sun, Hao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7737 - 7746
  • [50] Multi-level Part-aware Feature Disentangling for Text-based Person Search
    Chen, Yuhao
    Zhang, Guoqing
    Zhang, Hongwei
    Zheng, Yuhui
    Lin, Weisi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2801 - 2806