Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search

被引:1
|
作者
You, Kaiyang [1 ,2 ]
Chen, Wenjing [3 ]
Wang, Chengji [1 ,2 ]
Sun, Hao [1 ,2 ]
Xie, Wei [1 ,2 ]
机构
[1] Cent China Normal Univ, Sch Comp Sci, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China
[3] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Knowledge transfer; Visualization; Transformers; Data mining; Task analysis; Sun; Text-based person search; knowledge imbalance; knowledge transfer; cross-modal fusion; TRANSFORMER;
D O I
10.1109/LSP.2024.3449222
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text-based person search aims to retrieve corresponding images of person from a large gallery based on text descriptions. Existing methods strive to bridge the modality gap between images and texts and have made promising progress. However, these approaches disregard the knowledge imbalance between images and texts caused by the reporting bias. To resolve this issue, we present a cross-modal feature fusion-based knowledge transfer network to balance identity information between images and texts. First, we design an identity information emphasis module to enhance person-relevant information and suppress person-irrelevant information. Second, we design an intermediate modal-guided knowledge transfer module to balance the knowledge between images and texts. Experimental results on CUHK-PEDES, ICFG-PEDE, and RSTPReid datasets demonstrate that our method achieves state-of-the-art performance.
引用
收藏
页码:2230 / 2234
页数:5
相关论文
共 50 条
  • [41] 2D-Convolution Based Feature Fusion for Cross-Modal Correlation Learning
    Guo, Jingjing
    Yu, Jing
    Lu, Yuhang
    Hu, Yue
    Liu, Yanbing
    COMPUTATIONAL SCIENCE - ICCS 2019, PT II, 2019, 11537 : 131 - 144
  • [42] Hybrid Network Based on Cross-Modal Feature Fusion for Diagnosis of Alzheimer's Disease
    Qiu, Zifeng
    Yang, Peng
    Wang, Tianfu
    Lei, Baiying
    ETHICAL AND PHILOSOPHICAL ISSUES IN MEDICAL IMAGING, MULTIMODAL LEARNING AND FUSION ACROSS SCALES FOR CLINICAL DECISION SUPPORT, AND TOPOLOGICAL DATA ANALYSIS FOR BIOMEDICAL IMAGING, EPIMI 2022, ML-CDS 2022, TDA4BIOMEDICALIMAGING, 2022, 13755 : 87 - 99
  • [43] Vision-and-Language Navigation Based on Cross-Modal Feature Fusion in Indoor Environment
    Wen, Shuhuan
    Lv, Xiaohan
    Yu, F. Richard
    Gong, Simeng
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 3 - 15
  • [44] Modal Complementarity Based on Multimodal Large Language Model for Text-Based Person Retrieval
    Bao, Tong
    Xu, Tong
    Xu, Derong
    Zheng, Zhi
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 264 - 279
  • [45] Feature Compensation Network for Prototype-Based Cross-Modal Person Re-Identification
    Murali, Nirmala
    Mishra, Deepak
    IEEE ACCESS, 2024, 12 : 117994 - 118006
  • [46] A Self-Supervised Learning Approach for Text-Based Person Search
    Ji Z.
    Hu J.
    Ding X.
    Li S.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (02): : 169 - 176
  • [47] A Simple and Robust Correlation Filtering Method for Text-Based Person Search
    Suo, Wei
    Sun, Mengyang
    Niu, Kai
    Gao, Yiqi
    Wang, Peng
    Zhang, Yanning
    Wu, Qi
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 726 - 742
  • [48] Improving Text-based Person Search by Spatial Matching and Adaptive Threshold
    Chen, Tianlang
    Xu, Chenliang
    Luo, Jiebo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1879 - 1887
  • [49] An Overview of Text-based Person Search: Recent Advances and Future Directions
    Niu K.
    Liu Y.
    Long Y.
    Huang Y.
    Wang L.
    Zhang Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 1 - 1
  • [50] Joint feature fusion hashing for cross-modal retrieval
    Cao, Yuxia
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 6149 - 6162