Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search

被引:1
|
作者
You, Kaiyang [1 ,2 ]
Chen, Wenjing [3 ]
Wang, Chengji [1 ,2 ]
Sun, Hao [1 ,2 ]
Xie, Wei [1 ,2 ]
机构
[1] Cent China Normal Univ, Sch Comp Sci, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China
[3] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Knowledge transfer; Visualization; Transformers; Data mining; Task analysis; Sun; Text-based person search; knowledge imbalance; knowledge transfer; cross-modal fusion; TRANSFORMER;
D O I
10.1109/LSP.2024.3449222
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text-based person search aims to retrieve corresponding images of person from a large gallery based on text descriptions. Existing methods strive to bridge the modality gap between images and texts and have made promising progress. However, these approaches disregard the knowledge imbalance between images and texts caused by the reporting bias. To resolve this issue, we present a cross-modal feature fusion-based knowledge transfer network to balance identity information between images and texts. First, we design an identity information emphasis module to enhance person-relevant information and suppress person-irrelevant information. Second, we design an intermediate modal-guided knowledge transfer module to balance the knowledge between images and texts. Experimental results on CUHK-PEDES, ICFG-PEDE, and RSTPReid datasets demonstrate that our method achieves state-of-the-art performance.
引用
收藏
页码:2230 / 2234
页数:5
相关论文
共 50 条
  • [1] Cross-modal alignment with synthetic caption for text-based person search
    Weichen Zhao
    Yuxing Lu
    Zhiyuan Liu
    Yuan Yang
    Ge Jiao
    International Journal of Multimedia Information Retrieval, 2025, 14 (2)
  • [2] Text-based person search via cross-modal alignment learning
    Ke, Xiao
    Liu, Hao
    Xu, Peirong
    Lin, Xinru
    Guo, Wenzhong
    PATTERN RECOGNITION, 2024, 152
  • [3] DCEL: Deep Cross-modal Evidential Learning for Text-Based Person Retrieval
    Li, Shenshen
    Xu, Xing
    Yang, Yang
    Shen, Fumin
    Mo, Yijun
    Li, Yujie
    Shen, Heng Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6292 - 6300
  • [4] Cross-Modal Uncertainty Modeling With Diffusion-Based Refinement for Text-Based Person Retrieval
    Li, Shenshen
    Xu, Xing
    He, Chen
    Shen, Fumin
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2881 - 2893
  • [5] Cross-Modal Knowledge Adaptation for Language-Based Person Search
    Chen, Yucheng
    Huang, Rui
    Chang, Hong
    Tan, Chuanqi
    Xue, Tao
    Ma, Bingpeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4057 - 4069
  • [6] Conditional Feature Learning Based Transformer for Text-Based Person Search
    Gao, Chenyang
    Cai, Guanyu
    Jiang, Xinyang
    Zheng, Feng
    Zhang, Jun
    Gong, Yifei
    Lin, Fangzhou
    Sun, Xing
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6097 - 6108
  • [7] Text-Guided Visual Feature Refinement for Text-Based Person Search
    Gao, Liying
    Niu, Kai
    Ma, Zehong
    Jiao, Bingliang
    Tan, Tonghao
    Wang, Peng
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 118 - 126
  • [8] Feature semantic alignment and information supplement for Text-based person search
    Zhou, Hang
    Li, Fan
    Tian, Xuening
    Huang, Yuling
    FRONTIERS IN PHYSICS, 2023, 11
  • [9] Deep Multi-Semantic Fusion-Based Cross-Modal Hashing
    Zhu, Xinghui
    Cai, Liewu
    Zou, Zhuoyang
    Zhu, Lei
    MATHEMATICS, 2022, 10 (03)
  • [10] Joint Token and Feature Alignment Framework for Text-Based Person Search
    Li, Shangze
    Lu, Andong
    Huang, Yan
    Li, Chenglong
    Wang, Liang
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2238 - 2242