Unsupervised Cross-Modal Hashing With Modality-Interaction

被引：22

作者：

Tu, Rong-Cheng ^{[1
,2
]}

Jiang, Jie ^{[3
]}

Lin, Qinghong ^{[4
]}

Cai, Chengfei ^{[3
]}

Tian, Shangxuan ^{[3
]}

Wang, Hongfa ^{[3
]}

Liu, Wei ^{[3
]}

机构：

[1] Tencent, Shenzhen 518100, Peoples R China

[2] Beijing Inst Technol, Dept Comp Sci & Technol, Beijing 100081, Peoples R China

[3] Tencent Data Platform, Shenzhen 518051, Guangdong, Peoples R China

[4] Natl Univ Singapore, Elect & Comp Engn, Singapore 138600, Singapore

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 09期

关键词：

Cross-modal Retrieval; Hashing; Modality-interaction; Bit-selection; ATTENTION; NETWORK;

D O I：

10.1109/TCSVT.2023.3251395

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recently, numerous unsupervised cross-modal hashing methods have been proposed to deal the image-text retrieval tasks for the unlabeled cross-modal data. However, when these methods learn to generate hash codes, almost all of them lack modality-interaction in the following two aspects: 1) The instance similarity matrix used to guide the hashing networks training is constructed without image-text interaction, which fails to capture the fine-grained cross-modal cues to elaborately characterize the intrinsic semantic similarity among the datapoints. 2) The binary codes used for quantization loss are inferior because they are generated by directly quantizing a simple combination of continuous hash codes from different modalities without the interaction among these continuous hash codes. Such problems will cause the generated hash codes to be of poor quality and degrade the retrieval performance. Hence, in this paper, we propose a novel Unsupervised Cross-modal Hashing with Modality-interaction, termed UCHM. Specifically, by optimizing a novel hash-similarity-friendly loss, a modality-interaction-enabled (MIE) similarity generator is first trained to generate a superior MIE similarity matrix for the training set. Then, the generated MIE similarity matrix is utilized as guiding information to train the deep hashing networks. Furthermore, during the process of training the hashing networks, a novel bit-selection module is proposed to generate high-quality unified binary codes for the quantization loss with the interaction among continuous codes from different modalities, thereby further enhancing the retrieval performance. Extensive experiments on two widely used datasets show that the proposed UCHM outperforms state-of-the-art techniques on cross-modal retrieval tasks.

引用

页码：5296 / 5308

页数：13

共 50 条

[31] Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing
Yang, Xiaohan
Wang, Zhen
Wu, Nannan
Li, Guokun
Feng, Chuang
Liu, Pingping
MATHEMATICS, 2022, 10 (15)
[32] High-order nonlocal Hashing for unsupervised cross-modal retrieval
Zhang, Peng-Fei
Luo, Yadan
Huang, Zi
Xu, Xin-Shun
Song, Jingkuan
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (02): : 563 - 583
[33] Structure-aware contrastive hashing for unsupervised cross-modal retrieval
Cui, Jinrong
He, Zhipeng
Huang, Qiong
Fu, Yulu
Li, Yuting
Wen, Jie
NEURAL NETWORKS, 2024, 174
[34] Regularised Cross-Modal Hashing
Moran, Sean
Lavrenko, Victor
SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 907 - 910
[35] Semi-supervised cross-modal hashing via modality-specific and cross-modal graph convolutional networks
Wu, Fei
Li, Shuaishuai
Gao, Guangwei
Ji, Yimu
Jing, Xiao-Yuan
Wan, Zhiguo
PATTERN RECOGNITION, 2023, 136
[36] Flexible Cross-Modal Hashing
Yu, Guoxian
Liu, Xuanwu
Wang, Jun
Domeniconi, Carlotta
Zhang, Xiangliang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 304 - 314
[37] Discriminant Cross-modal Hashing
Xu, Xing
Shen, Fumin
Yang, Yang
Shen, Heng Tao
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 305 - 308
[38] Extensible Cross-Modal Hashing
Chen, Tian-yi
Zhang, Lan
Zhang, Shi-cong
Li, Zi-long
Huang, Bai-chuan
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2109 - 2115
[39] Cross-Modal Discrete Hashing
Liong, Venice Erin
Lu, Jiwen
Tan, Yap-Peng
PATTERN RECOGNITION, 2018, 79 : 114 - 129
[40] Deep Cross-Modal Hashing
Jiang, Qing-Yuan
Li, Wu-Jun
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278

← 1 2 3 4 5 →