Unsupervised Cross-Modal Hashing With Modality-Interaction

被引:22
|
作者
Tu, Rong-Cheng [1 ,2 ]
Jiang, Jie [3 ]
Lin, Qinghong [4 ]
Cai, Chengfei [3 ]
Tian, Shangxuan [3 ]
Wang, Hongfa [3 ]
Liu, Wei [3 ]
机构
[1] Tencent, Shenzhen 518100, Peoples R China
[2] Beijing Inst Technol, Dept Comp Sci & Technol, Beijing 100081, Peoples R China
[3] Tencent Data Platform, Shenzhen 518051, Guangdong, Peoples R China
[4] Natl Univ Singapore, Elect & Comp Engn, Singapore 138600, Singapore
关键词
Cross-modal Retrieval; Hashing; Modality-interaction; Bit-selection; ATTENTION; NETWORK;
D O I
10.1109/TCSVT.2023.3251395
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, numerous unsupervised cross-modal hashing methods have been proposed to deal the image-text retrieval tasks for the unlabeled cross-modal data. However, when these methods learn to generate hash codes, almost all of them lack modality-interaction in the following two aspects: 1) The instance similarity matrix used to guide the hashing networks training is constructed without image-text interaction, which fails to capture the fine-grained cross-modal cues to elaborately characterize the intrinsic semantic similarity among the datapoints. 2) The binary codes used for quantization loss are inferior because they are generated by directly quantizing a simple combination of continuous hash codes from different modalities without the interaction among these continuous hash codes. Such problems will cause the generated hash codes to be of poor quality and degrade the retrieval performance. Hence, in this paper, we propose a novel Unsupervised Cross-modal Hashing with Modality-interaction, termed UCHM. Specifically, by optimizing a novel hash-similarity-friendly loss, a modality-interaction-enabled (MIE) similarity generator is first trained to generate a superior MIE similarity matrix for the training set. Then, the generated MIE similarity matrix is utilized as guiding information to train the deep hashing networks. Furthermore, during the process of training the hashing networks, a novel bit-selection module is proposed to generate high-quality unified binary codes for the quantization loss with the interaction among continuous codes from different modalities, thereby further enhancing the retrieval performance. Extensive experiments on two widely used datasets show that the proposed UCHM outperforms state-of-the-art techniques on cross-modal retrieval tasks.
引用
收藏
页码:5296 / 5308
页数:13
相关论文
共 50 条
  • [41] Continuous cross-modal hashing
    Zheng, Hao
    Wang, Jinbao
    Zhen, Xiantong
    Song, Jingkuan
    Zheng, Feng
    Lu, Ke
    Qi, Guo-Jun
    PATTERN RECOGNITION, 2023, 142
  • [42] Cross-Modal Hamming Hashing
    Cao, Yue
    Liu, Bin
    Long, Mingsheng
    Wang, Jianmin
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 207 - 223
  • [43] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval
    Zeng, XianHua
    Xu, Ke
    Xie, YiCai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3437 - 3456
  • [44] Multi-Grained Similarity Preserving and Updating for Unsupervised Cross-Modal Hashing
    Wu, Runbing
    Zhu, Xinghui
    Yi, Zeqian
    Zou, Zhuoyang
    Liu, Yi
    Zhu, Lei
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [45] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Cheng Zhang
    Yuan Wan
    Haopeng Qiang
    Neural Computing and Applications, 2024, 36 : 5383 - 5397
  • [46] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397
  • [47] Similarity Graph-correlation Reconstruction Network for unsupervised cross-modal hashing
    Yao, Dan
    Li, Zhixin
    Li, Bo
    Zhang, Canlong
    Ma, Huifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [48] Category-Level Contrastive Learning for Unsupervised Hashing in Cross-Modal Retrieval
    Xu, Mengying
    Luo, Linyin
    Lai, Hanjiang
    Yin, Jian
    DATA SCIENCE AND ENGINEERING, 2024, 9 (03) : 251 - 263
  • [49] Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval
    Zhang, Peng-Fei
    Li, Yang
    Huang, Zi
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 466 - 479
  • [50] Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing
    Wang, Weiwei
    Shen, Yuming
    Zhang, Haofeng
    Yao, Yazhou
    Liu, Li
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 853 - 859