Deep cross-modal hashing with fine-grained similarity

被引:0
|
作者
Yangdong Chen
Jiaqi Quan
Yuejie Zhang
Rui Feng
Tao Zhang
机构
[1] Fudan University,School of Computer Science, Shanghai Key Lab of Intelligent Information Processing
[2] Shanghai University of Finance and Economics,School of Information Management and Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Cross-modal retrieval; Deep hashing; Fine-grained similarity; Focal loss;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-modal hashing has attracted noticeable attention in the multimedia community. Two-stage methods often show impressive performance by first learning hash codes for data instances from different modalities, then learning hash functions that map the original multimodal data to the low dimensional hash codes. However, most existing two-stage methods can hardly obtain satisfactory hash codes at the first stage, as the commonly used coarse-grained similarity matrix fails to capture the differentiated similarity relationships between the original data instances. Besides, such methods cannot obtain satisfactory hash functions at the second stage, where the learning of hash functions is treated as a multi-binary classification problem. In this paper, we propose a novel two-stage hashing method for cross-modal retrieval. At the first stage, we capture the differentiated similarity relationships between data instances by designing a fine-grained similarity matrix and add an Autoencoder to mine the semantic information. At the second stage, we introduce a similarity sensitivity learning strategy under the guidance of the similarity matrix to train the hash functions. This strategy makes the training process sensitive to the similar and hard pairs, boosting the retrieval performance. Comprehensive experiments on three benchmark datasets validate the effectiveness of our method.
引用
收藏
页码:28954 / 28973
页数:19
相关论文
共 50 条
  • [21] Deep Cross-modal Hashing Based on Intra-modal Similarity and Semantic Preservation
    Li T.
    Liu L.
    Data Analysis and Knowledge Discovery, 2023, 7 (05) : 105 - 115
  • [22] Asymmetric Deep Cross-modal Hashing
    Gu, Jingzi
    Zhang, JinChao
    Lin, Zheng
    Li, Bo
    Wang, Weiping
    Meng, Dan
    COMPUTATIONAL SCIENCE - ICCS 2019, PT V, 2019, 11540 : 41 - 54
  • [23] Cross-Modal Deep Variational Hashing
    Liong, Venice Erin
    Lu, Jiwen
    Tan, Yap-Peng
    Zhou, Jie
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4097 - 4105
  • [24] Multi-label adversarial fine-grained cross-modal retrieval
    Sun, Chunpu
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Wang, Lin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117
  • [25] Integration of Global and Local Representations for Fine-Grained Cross-Modal Alignment
    Jin, Seungwan
    Choi, Hoyoung
    Noh, Taehyung
    Han, Kyungsik
    COMPUTER VISION - ECCV 2024, PT LXXXIII, 2025, 15141 : 53 - 70
  • [26] Cross-Modal Fine-Grained Interaction Fusion in Fake News Detection
    Che, Zhanbin
    Cui, GuangBo
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 945 - 956
  • [27] VIDEO-MUSIC RETRIEVAL WITH FINE-GRAINED CROSS-MODAL ALIGNMENT
    Era, Yuki
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2005 - 2009
  • [28] ViT2CMH: Vision Transformer Cross-Modal Hashing for Fine-Grained Vision-Text Retrieval
    Li M.
    Li Q.
    Jiang Z.
    Ma Y.
    Computer Systems Science and Engineering, 2023, 46 (02): : 1401 - 1414
  • [29] Deep Saliency Hashing for Fine-Grained Retrieval
    Jin, Sheng
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhou, Shangchen
    Zhang, Lei
    Hua, Xiansheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5336 - 5351
  • [30] Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search
    Jin, Lu
    Li, Kai
    Li, Zechao
    Xiao, Fu
    Qi, Guo-Jun
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1429 - 1440