Deep Consistency Preserving Network for Unsupervised Cross-Modal Hashing

被引：0

作者：

Li, Mengluan ^{[1
]}

Guo, Yanqing ^{[1
,2
]}

Fu, Haiyan ^{[1
]}

Li, Yi ^{[2
]}

Su, Hong ^{[3
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian, Peoples R China

[2] Dalian Univ Technol, Sch Artificial Intelligence, Sch Future Technol, Dalian, Peoples R China

[3] Sci & Technol Commun Secur Lab, Chengdu, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I | 2024年 / 14425卷

基金：

中国国家自然科学基金;

关键词：

Multimodal; Unsupervised deep hashing; Cross-modal retrieval;

D O I：

10.1007/978-981-99-8429-9_19

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given the proliferation of multimodal data in search engines and social networks, unsupervised cross-modal hashing has gained traction for its low storage consumption and fast retrieval speed. Despite the great success achieved, unsupervised cross-modal hashing still suffers from lacking reliable similarity supervision and struggles with reducing information loss caused by quantization. In this paper, we propose a novel deep consistency preserving network (DCPN) for unsupervised cross-modal hashing, which sufficiently utilizes the semantic information in different modalities. Specifically, we gain consistent features to fully exploit the co-occurrence information and alleviate the heterogeneity between different modalities. Then, a fusion similarity matrix construction method is proposed to capture the semantic relationship between instances. Finally, a fusion hash code reconstruction strategy is designed to fit the gap between different modalities and reduce the quantization error. Experimental results demonstrate the effectiveness of the proposed DCPN on unsupervised cross-modal retrieval tasks.

引用

页码：235 / 246

页数：12

共 50 条

[41] Fast discrete cross-modal hashing with semantic consistency
Yao, Tao
Yan, Lianshan
Ma, Yilan
Yu, Hong
Su, Qingtang
Wang, Gang
Tian, Qi
NEURAL NETWORKS, 2020, 125 (125) : 142 - 152
[42] Generalized Semantic Preserving Hashing for Cross-Modal Retrieval
Mandal, Devraj
Chaudhury, Kunal N.
Biswas, Soma
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 102 - 112
[43] CLIP4Hashing: Unsupervised Deep Hashing for Cross-Modal Video-Text Retrieval
Zhuo, Yaoxin
Li, Yikang
Hsiao, Jenhao
Ho, Chiuman
Li, Baoxin
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 158 - 166
[44] Cross-modal hashing based on category structure preserving
Dong, Fei
Nie, Xiushan
Liu, Xingbo
Geng, Leilei
Wang, Qian
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 57 : 28 - 33
[45] Triplet-Based Deep Hashing Network for Cross-Modal Retrieval
Deng, Cheng
Chen, Zhaojia
Liu, Xianglong
Gao, Xinbo
Tao, Dacheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) : 3893 - 3903
[46] Deep medical cross-modal attention hashing
Zhang, Yong
Ou, Weihua
Shi, Yufeng
Deng, Jiaxin
You, Xinge
Wang, Anzhi
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1519 - 1536
[47] Dual-supervised attention network for deep cross-modal hashing
Peng, Hanyu
He, Junjun
Chen, Shifeng
Wang, Yali
Qiao, Yu
PATTERN RECOGNITION LETTERS, 2019, 128 : 333 - 339
[48] Deep Binary Reconstruction for Cross-Modal Hashing
Hu, Di
Nie, Feiping
Li, Xuelong
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) : 973 - 985
[49] Deep medical cross-modal attention hashing
Yong Zhang
Weihua Ou
Yufeng Shi
Jiaxin Deng
Xinge You
Anzhi Wang
World Wide Web, 2022, 25 : 1519 - 1536
[50] Deep Binary Reconstruction for Cross-modal Hashing
Li, Xuelong
Hu, Di
Nie, Feiping
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1398 - 1406

← 1 2 3 4 5 →