SCH-GAN: Semi-Supervised Cross-Modal Hashing by Generative Adversarial Network

被引:92
|
作者
Zhang, Jian [1 ]
Peng, Yuxin [1 ]
Yuan, Mingkuan [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Data models; Correlation; Generative adversarial networks; Training data; Predictive models; Gallium nitride; Cross-modal hashing; generative adversarial network (GAN); semi-supervised; CODES;
D O I
10.1109/TCYB.2018.2868826
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing maps heterogeneous multimedia data into a common Hamming space to realize fast and flexible cross-modal retrieval. Supervised cross-modal hashing methods have achieved considerable progress by incorporating semantic side information. However, they heavily rely on large-scale labeled cross-modal training data which are hard to obtain, since multiple modalities are involved. They also ignore the rich information contained in the large amount of unlabeled data across different modalities, which can help to model the correlations between different modalities. To address these problems, in this paper, we propose a novel semi-supervised cross-modal hashing approach by generative adversarial network (SCH-GAN). The main contributions can be summarized as follows: 1) we propose a novel generative adversarial network for cross-modal hashing, in which the generative model tries to select margin examples of one modality from unlabeled data when given a query of another modality (e.g., giving a text query to retrieve images and vice versa). The discriminative model tries to distinguish the selected examples and true positive examples of the query. These two models play a minimax game so that the generative model can promote the hashing performance of the discriminative model and 2) we propose a reinforcement learning-based algorithm to drive the training of proposed SCH-GAN. The generative model takes the correlation score predicted by discriminative model as a reward, and tries to select the examples close to the margin to promote a discriminative model. Extensive experiments verify the effectiveness of our proposed approach, compared with nine state-of-the-art methods on three widely used datasets.
引用
收藏
页码:489 / 502
页数:14
相关论文
共 50 条
  • [21] A semi-supervised cross-modal memory bank for cross-modal retrieval
    Huang, Yingying
    Hu, Bingliang
    Zhang, Yipeng
    Gao, Chi
    Wang, Quan
    NEUROCOMPUTING, 2024, 579
  • [22] Semi-supervised cross-modal representation learning with GAN-based Asymmetric Transfer Network
    Zhang, Lei
    Chen, Leiting
    Ou, Weihua
    Zhou, Chuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
  • [23] CCS-GAN: a semi-supervised generative adversarial network for image classification
    Lei Wang
    Yu Sun
    Zheng Wang
    The Visual Computer, 2022, 38 : 2009 - 2021
  • [24] CCS-GAN: a semi-supervised generative adversarial network for image classification
    Wang, Lei
    Sun, Yu
    Wang, Zheng
    VISUAL COMPUTER, 2022, 38 (06): : 2009 - 2021
  • [25] S3ACH: Semi-Supervised Semantic Adaptive Cross-Modal Hashing
    Yang, Liu
    Zhang, Kaiting
    Li, Yinan
    Chen, Yunfei
    Long, Jun
    Yang, Zhan
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 252 - 269
  • [26] Semi-supervised constrained graph convolutional network for cross-modal retrieval
    Zhang, Lei
    Chen, Leiting
    Ou, Weihua
    Zhou, Chuan
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [27] Self-Training Based Semi-Supervised and Semi-Paired Hashing Cross-Modal Retrieval
    Jing, Rongrong
    Tian, Hu
    Zhang, Xingwei
    Zhou, Gang
    Zheng, Xiaolong
    Zeng, Dajun
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [28] Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval
    Zhang, Peng-Fei
    Li, Chuan-Xiang
    Liu, Meng-Yuan
    Nie, Liqiang
    Xu, Xin-Shun
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1762 - 1770
  • [29] Semi-supervised classification-aware cross-modal deep adversarial data augmentation
    Wang, Shaoqiang
    Wu, Zhenzhen
    He, Gewen
    Wang, Shudong
    Sun, Hongwei
    Fan, Fangfang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 194 - 205
  • [30] Generative adversarial network for semi-supervised image captioning
    Liang, Xu
    Li, Chen
    Tian, Lihua
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249