Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval

被引:9
|
作者
Meng, Min [1 ]
Sun, Jiaxuan [1 ]
Liu, Jigang [2 ]
Yu, Jun [1 ]
Wu, Jigang [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510006, Peoples R China
[2] Ping An Life Insurance China, Shenzhen 518046, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; hashing; adversarial learning; disentangled representation; REPRESENTATION; NETWORK;
D O I
10.1109/TCSVT.2023.3293104
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cross-modal hashing has gained considerable attention in cross-modal retrieval due to its low storage cost and prominent computational efficiency. However, preserving more semantic information in the compact hash codes to bridge the modality gap still remains challenging. Most existing methods unconsciously neglect the influence of modality-private information on semantic embedding discrimination, leading to unsatisfactory retrieval performance. In this paper, we propose a novel deep cross-modal hashing method, called Semantic Disentanglement Adversarial Hashing (SDAH), to tackle these challenges for cross-modal retrieval. Specifically, SDAH is designed to decouple the original features of each modality into modality-common features with semantic information and modality-private features with disturbing information. After the preliminary decoupling, the modality-private features are shuffled and treated as positive interactions to enhance the learning of modality-common features, which can significantly boost the discriminative and robustness of semantic embeddings. Moreover, the variational information bottleneck is introduced in the hash feature learning process, which can avoid the loss of a large amount of semantic information caused by the high-dimensional feature compression. Finally, the discriminative and compact hash codes can be computed directly from the hash features. A large number of comparative and ablation experiments show that SDAH achieves superior performance than other state-of-the-art methods.
引用
收藏
页码:1914 / 1926
页数:13
相关论文
共 50 条
  • [21] Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval
    Wang, Di
    Zhang, Caiping
    Wang, Quan
    Tian, Yumin
    He, Lihuo
    Zhao, Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1217 - 1229
  • [22] Deep semantic hashing with dual attention for cross-modal retrieval
    Wu, Jiagao
    Weng, Weiwei
    Fu, Junxia
    Liu, Linfeng
    Hu, Bin
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (07): : 5397 - 5416
  • [23] Discrete Semantic Matrix Factorization Hashing for Cross-Modal Retrieval
    Qin, Jianyang
    Fei, Lunke
    Teng, Shaohua
    Zhang, Wei
    Liu, Dongning
    Zhao, Genping
    Yuan, Haoliang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1550 - 1557
  • [24] Semantic Constraints Matrix Factorization Hashing for cross-modal retrieval
    Li, Weian
    Xiong, Haixia
    Ou, Weihua
    Gou, Jianping
    Deng, Jiaxing
    Liang, Linqing
    Zhou, Quan
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [25] Deep Visual-Semantic Hashing for Cross-Modal Retrieval
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Yang, Qiang
    Yu, Philip S.
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1445 - 1454
  • [26] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
    Weihua Ou
    Ruisheng Xuan
    Jianping Gou
    Quan Zhou
    Yongfeng Cao
    Multimedia Tools and Applications, 2020, 79 : 14733 - 14750
  • [27] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
    Ou, Weihua
    Xuan, Ruisheng
    Gou, Jianping
    Zhou, Quan
    Cao, Yongfeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14733 - 14750
  • [28] Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval
    Ma, Xinhong
    Zhang, Tianzhu
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (12) : 3101 - 3114
  • [29] Discriminant Adversarial Hashing Transformer for Cross-modal Vessel Image Retrieval
    Guan X.
    Guo J.
    Lu Y.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (12): : 4411 - 4420
  • [30] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Li, Ning
    Liu, Wei
    Gao, Xinbo
    Tao, Dacheng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251