Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search

被引：1

作者：

You, Kaiyang ^{[1
,2
]}

Chen, Wenjing ^{[3
]}

Wang, Chengji ^{[1
,2
]}

Sun, Hao ^{[1
,2
]}

Xie, Wei ^{[1
,2
]}

机构：

[1] Cent China Normal Univ, Sch Comp Sci, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China

[2] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China

[3] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Knowledge transfer; Visualization; Transformers; Data mining; Task analysis; Sun; Text-based person search; knowledge imbalance; knowledge transfer; cross-modal fusion; TRANSFORMER;

D O I：

10.1109/LSP.2024.3449222

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Text-based person search aims to retrieve corresponding images of person from a large gallery based on text descriptions. Existing methods strive to bridge the modality gap between images and texts and have made promising progress. However, these approaches disregard the knowledge imbalance between images and texts caused by the reporting bias. To resolve this issue, we present a cross-modal feature fusion-based knowledge transfer network to balance identity information between images and texts. First, we design an identity information emphasis module to enhance person-relevant information and suppress person-irrelevant information. Second, we design an intermediate modal-guided knowledge transfer module to balance the knowledge between images and texts. Experimental results on CUHK-PEDES, ICFG-PEDE, and RSTPReid datasets demonstrate that our method achieves state-of-the-art performance.

引用

页码：2230 / 2234

页数：5

共 50 条

[1] Cross-modal alignment with synthetic caption for text-based person search
Weichen Zhao
Yuxing Lu
Zhiyuan Liu
Yuan Yang
Ge Jiao
International Journal of Multimedia Information Retrieval, 2025, 14 (2)
[2] Text-based person search via cross-modal alignment learning
Ke, Xiao
Liu, Hao
Xu, Peirong
Lin, Xinru
Guo, Wenzhong
PATTERN RECOGNITION, 2024, 152
[3] DCEL: Deep Cross-modal Evidential Learning for Text-Based Person Retrieval
Li, Shenshen
Xu, Xing
Yang, Yang
Shen, Fumin
Mo, Yijun
Li, Yujie
Shen, Heng Tao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6292 - 6300
[4] Cross-Modal Uncertainty Modeling With Diffusion-Based Refinement for Text-Based Person Retrieval
Li, Shenshen
Xu, Xing
He, Chen
Shen, Fumin
Yang, Yang
Shen, Heng Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2881 - 2893
[5] Cross-Modal Knowledge Adaptation for Language-Based Person Search
Chen, Yucheng
Huang, Rui
Chang, Hong
Tan, Chuanqi
Xue, Tao
Ma, Bingpeng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4057 - 4069
[6] Conditional Feature Learning Based Transformer for Text-Based Person Search
Gao, Chenyang
Cai, Guanyu
Jiang, Xinyang
Zheng, Feng
Zhang, Jun
Gong, Yifei
Lin, Fangzhou
Sun, Xing
Bai, Xiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6097 - 6108
[7] Text-Guided Visual Feature Refinement for Text-Based Person Search
Gao, Liying
Niu, Kai
Ma, Zehong
Jiao, Bingliang
Tan, Tonghao
Wang, Peng
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 118 - 126
[8] Feature semantic alignment and information supplement for Text-based person search
Zhou, Hang
Li, Fan
Tian, Xuening
Huang, Yuling
FRONTIERS IN PHYSICS, 2023, 11
[9] Deep Multi-Semantic Fusion-Based Cross-Modal Hashing
Zhu, Xinghui
Cai, Liewu
Zou, Zhuoyang
Zhu, Lei
MATHEMATICS, 2022, 10 (03)
[10] Joint Token and Feature Alignment Framework for Text-Based Person Search
Li, Shangze
Lu, Andong
Huang, Yan
Li, Chenglong
Wang, Liang
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2238 - 2242

← 1 2 3 4 5 →