Local-enhanced representation for text-based person search

被引:1
|
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Improvement of Vietnamese noun phrases chunking in text-based person image search
    Khanh-Toan Luong
    Thi-Hoai Phan
    Thi-Ngoc-Diep Do
    Thi-Lan Le
    2024 IEEE TENTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, ICCE 2024, 2024, : 381 - 386
  • [32] Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search
    Liu, Jiawei
    Zha, Zheng-Jun
    Hong, Richang
    Wang, Meng
    Zhang, Yongdong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 665 - 673
  • [33] Improving embedding learning by virtual attribute decoupling for text-based person search
    Chengji Wang
    Zhiming Luo
    Yaojin Lin
    Shaozi Li
    Neural Computing and Applications, 2022, 34 : 5625 - 5647
  • [34] A TEXT-BASED REPRESENTATION FOR PROGRAM VARIANTS
    NARAYANASWAMY, K
    PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON SOFTWARE CONFIGURATION MANAGEMENT, 1989, 17 : 30 - 37
  • [35] TIPCB: A simple but effective part-based convolutional baseline for text-based person search
    Chen, Yuhao
    Zhang, Guoqing
    Lu, Yujiang
    Wang, Zhenxing
    Zheng, Yuhui
    Neurocomputing, 2022, 494 : 171 - 181
  • [36] TIPCB: A simple but effective part-based convolutional baseline for text-based person search
    Chen, Yuhao
    Zhang, Guoqing
    Lu, Yujiang
    Wang, Zhenxing
    Zheng, Yuhui
    NEUROCOMPUTING, 2022, 494 : 171 - 181
  • [37] Text-based person search by non-saliency enhancing and dynamic label smoothing
    Pang Y.
    Zhang C.
    Li Z.
    Wei C.
    Wang Z.
    Neural Computing and Applications, 2024, 36 (21) : 13327 - 13339
  • [38] Relation-aware aggregation network with auxiliary guidance for text-based person search
    Zeng, Pengpeng
    Jing, Shuaiqi
    Song, Jingkuan
    Fan, Kaixuan
    Li, Xiangpeng
    We, Liansuo
    Guo, Yuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1565 - 1582
  • [39] PLOT: Text-Based Person Search with Part Slot Attention for Corresponding Part Discovery
    Park, Jicheol
    Kim, Dongwon
    Jeong, Boseung
    Kwak, Suha
    COMPUTER VISION - ECCV 2024, PT XXI, 2025, 15079 : 474 - 490
  • [40] Part-Based Multi-Scale Attention Network for Text-Based Person Search
    Wang, Yubin
    Qi, Ding
    Zhao, Cairong
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 462 - 474