Deep Cross-Modal Hashing Based on Semantic Consistent Ranking

被引:21
|
作者
Liu, Xiaoqing [1 ]
Zeng, Huanqiang [1 ,2 ]
Shi, Yifan [2 ]
Zhu, Jianqing [2 ]
Hsia, Chih-Hsien [3 ]
Ma, Kai-Kuang [4 ]
机构
[1] Huaqiao Univ, Sch Informat Sci & Engn, Quanzhou 362021, Peoples R China
[2] Huaqiao Univ, Sch Engn, Quanzhou 362021, Peoples R China
[3] Ilan Univ, Dept Comp Sci & Informat Engn, Yilan City 260, Taiwan
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang 639798, Singapore
关键词
Cross-modal hashing; rank learning; heterogeneous gap; intra-modal similarity; IMAGE RETRIEVAL; REPRESENTATIONS; NETWORK; CODES;
D O I
10.1109/TMM.2023.3254199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of multi-modal data available on the Internet is enormous. Cross-modal hash retrieval maps heterogeneous cross-modal data into a single Hamming space to offer fast and flexible retrieval services. However, existing cross-modal methods mainly rely on the feature-level similarity between multi-modal data and ignore the relationship between relative rankings and label-level fine-grained similarity of neighboring instances. To overcome these issues, we propose a novel Deep Cross-modal Hashing based on Semantic Consistent Ranking (DCH-SCR) that comprehensively investigates the intra-modal semantic similarity relationship. Firstly, to the best of our knowledge, it is an early attempt to preserve semantic similarity for cross-modal hashing retrieval by combining label-level and feature-level information. Secondly, the inherent gap between modalities is narrowed by developing a ranking alignment loss function. Thirdly, the compact and efficient hash codes are optimized based on the common semantic space. Finally, we use the gradient to specify the optimization direction and introduce the Normalized Discounted Cumulative Gain (NDCG) to achieve varying optimization strengths for data pairs with different similarities. Extensive experiments on three real-world image-text retrieval datasets demonstrate the superiority of DCH-SCR over several state-of-the-art cross-modal retrieval methods.
引用
收藏
页码:9530 / 9542
页数:13
相关论文
共 50 条
  • [41] SCH: Symmetric Consistent Hashing for cross-modal retrieval
    Ni, Haomin
    Fang, Xiaozhao
    Kang, Peipei
    Gao, Hongbo
    Zhou, Guoxu
    Xie, Shengli
    SIGNAL PROCESSING, 2024, 215
  • [42] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397
  • [43] Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search
    Jin, Lu
    Li, Kai
    Li, Zechao
    Xiao, Fu
    Qi, Guo-Jun
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1429 - 1440
  • [44] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Cheng Zhang
    Yuan Wan
    Haopeng Qiang
    Neural Computing and Applications, 2024, 36 : 5383 - 5397
  • [45] Deep Semantic-Preserving Reconstruction Hashing for Unsupervised Cross-Modal Retrieval
    Cheng, Shuli
    Wang, Liejun
    Du, Anyu
    ENTROPY, 2020, 22 (11) : 1 - 22
  • [46] Cross-Modal Hashing Retrieval Based on Deep Residual Network
    Li, Zhiyi
    Xu, Xiaomian
    Zhang, Du
    Zhang, Peng
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (02): : 383 - 405
  • [47] Cross-modal retrieval based on deep regularized hashing constraints
    Khan, Asad
    Hayat, Sakander
    Ahmad, Muhammad
    Wen, Jinyu
    Farooq, Muhammad Umar
    Fang, Meie
    Jiang, Wenchao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (09) : 6508 - 6530
  • [48] Deep medical cross-modal attention hashing
    Zhang, Yong
    Ou, Weihua
    Shi, Yufeng
    Deng, Jiaxin
    You, Xinge
    Wang, Anzhi
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1519 - 1536
  • [49] Unsupervised Deep Fusion Cross-modal Hashing
    Huang, Jiaming
    Min, Chen
    Jing, Liping
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 358 - 366
  • [50] Deep Binary Reconstruction for Cross-Modal Hashing
    Hu, Di
    Nie, Feiping
    Li, Xuelong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) : 973 - 985