Cross-Modal Feature Fusion-Based Knowledge Transfer for Text-Based Person Search

被引：1

作者：

You, Kaiyang ^{[1
,2
]}

Chen, Wenjing ^{[3
]}

Wang, Chengji ^{[1
,2
]}

Sun, Hao ^{[1
,2
]}

Xie, Wei ^{[1
,2
]}

机构：

[1] Cent China Normal Univ, Sch Comp Sci, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China

[2] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China

[3] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Knowledge transfer; Visualization; Transformers; Data mining; Task analysis; Sun; Text-based person search; knowledge imbalance; knowledge transfer; cross-modal fusion; TRANSFORMER;

D O I：

10.1109/LSP.2024.3449222

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Text-based person search aims to retrieve corresponding images of person from a large gallery based on text descriptions. Existing methods strive to bridge the modality gap between images and texts and have made promising progress. However, these approaches disregard the knowledge imbalance between images and texts caused by the reporting bias. To resolve this issue, we present a cross-modal feature fusion-based knowledge transfer network to balance identity information between images and texts. First, we design an identity information emphasis module to enhance person-relevant information and suppress person-irrelevant information. Second, we design an intermediate modal-guided knowledge transfer module to balance the knowledge between images and texts. Experimental results on CUHK-PEDES, ICFG-PEDE, and RSTPReid datasets demonstrate that our method achieves state-of-the-art performance.

引用

页码：2230 / 2234

页数：5

共 50 条

[41] 2D-Convolution Based Feature Fusion for Cross-Modal Correlation Learning
Guo, Jingjing
Yu, Jing
Lu, Yuhang
Hu, Yue
Liu, Yanbing
COMPUTATIONAL SCIENCE - ICCS 2019, PT II, 2019, 11537 : 131 - 144
[42] Hybrid Network Based on Cross-Modal Feature Fusion for Diagnosis of Alzheimer's Disease
Qiu, Zifeng
Yang, Peng
Wang, Tianfu
Lei, Baiying
ETHICAL AND PHILOSOPHICAL ISSUES IN MEDICAL IMAGING, MULTIMODAL LEARNING AND FUSION ACROSS SCALES FOR CLINICAL DECISION SUPPORT, AND TOPOLOGICAL DATA ANALYSIS FOR BIOMEDICAL IMAGING, EPIMI 2022, ML-CDS 2022, TDA4BIOMEDICALIMAGING, 2022, 13755 : 87 - 99
[43] Vision-and-Language Navigation Based on Cross-Modal Feature Fusion in Indoor Environment
Wen, Shuhuan
Lv, Xiaohan
Yu, F. Richard
Gong, Simeng
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 3 - 15
[44] Modal Complementarity Based on Multimodal Large Language Model for Text-Based Person Retrieval
Bao, Tong
Xu, Tong
Xu, Derong
Zheng, Zhi
WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 264 - 279
[45] Feature Compensation Network for Prototype-Based Cross-Modal Person Re-Identification
Murali, Nirmala
Mishra, Deepak
IEEE ACCESS, 2024, 12 : 117994 - 118006
[46] A Self-Supervised Learning Approach for Text-Based Person Search
Ji Z.
Hu J.
Ding X.
Li S.
Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (02): : 169 - 176
[47] A Simple and Robust Correlation Filtering Method for Text-Based Person Search
Suo, Wei
Sun, Mengyang
Niu, Kai
Gao, Yiqi
Wang, Peng
Zhang, Yanning
Wu, Qi
COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 726 - 742
[48] Improving Text-based Person Search by Spatial Matching and Adaptive Threshold
Chen, Tianlang
Xu, Chenliang
Luo, Jiebo
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1879 - 1887
[49] An Overview of Text-based Person Search: Recent Advances and Future Directions
Niu K.
Liu Y.
Long Y.
Huang Y.
Wang L.
Zhang Y.
IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 1 - 1
[50] Joint feature fusion hashing for cross-modal retrieval
Cao, Yuxia
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 6149 - 6162

← 1 2 3 4 5 →