Semi-supervised cross-modal representation learning with GAN-based Asymmetric Transfer Network

被引:1
|
作者
Zhang, Lei [1 ,2 ]
Chen, Leiting [1 ,2 ,3 ]
Ou, Weihua [4 ]
Zhou, Chuan [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Digital Media Technol Key Lab Sichuan Prov, Chengdu, Peoples R China
[3] Inst Elect & Informat Engn UESTC Guangdong, Dongguan, Peoples R China
[4] Guizhou Normal Univ, Sch Big Data & Comp Sci, Guiyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; Modality gap; Generative adversarial network;
D O I
10.1016/j.jvcir.2020.102899
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we proposed a semi-supervised common representation learning method with GAN-based Asymmetric Transfer Network (GATN) for cross modality retrieval. GATN utilizes the asymmetric pipeline to guarantee the semantic consistency and adopt (Generative Adversarial Network) GAN to fit the distributions of different modalities. Specifically, the common representation learning across modalities includes two stages: (1) the first stage, GATN trains source mapping network to learn the semantic representation of text modality by supervised method; and (2) the second stage, GAN-based unsupervised modality transfer method is proposed to guide the training of target mapping network, which includes generative network (target mapping network) and discriminative network. Experimental results on three widely-used benchmarks show that GATN have achieved better performance comparing with several existing state-of-the-art methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Enhancing Semi-Supervised Learning with Cross-Modal Knowledge
    Zhu, Hui
    Lu, Yongchun
    Wang, Hongbin
    Zhou, Xunyi
    Ma, Qin
    Liu, Yanhong
    Jiang, Ning
    Wei, Xin
    Zeng, Linchengxi
    Zhao, Xiaofang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4456 - 4465
  • [2] SCH-GAN: Semi-Supervised Cross-Modal Hashing by Generative Adversarial Network
    Zhang, Jian
    Peng, Yuxin
    Yuan, Mingkuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 489 - 502
  • [3] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Fuhao Zou
    Xingqiang Bai
    Chaoyang Luan
    Kai Li
    Yunfei Wang
    Hefei Ling
    World Wide Web, 2019, 22 : 825 - 841
  • [4] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Zou, Fuhao
    Bai, Xingqiang
    Luan, Chaoyang
    Li, Kai
    Wang, Yunfei
    Ling, Hefei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
  • [5] A semi-supervised cross-modal memory bank for cross-modal retrieval
    Huang, Yingying
    Hu, Bingliang
    Zhang, Yipeng
    Gao, Chi
    Wang, Quan
    NEUROCOMPUTING, 2024, 579
  • [6] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    He, Jianfeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412
  • [7] Combining cross-modal knowledge transfer and semi-supervised learning for speech emotion recognition
    Zhang, Sheng
    Chen, Min
    Chen, Jincai
    Li, Yuan-Fang
    Wu, Yiling
    Li, Minglei
    Zhu, Chuanbo
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [8] Semi-supervised cross-modal common representation learning with vector-valued manifold regularization
    Zhang, Hong
    Wang, Ting
    Dai, Gang
    PATTERN RECOGNITION LETTERS, 2020, 130 : 335 - 344
  • [9] Semi-Supervised Cross-Modal Retrieval Based on Discriminative Comapping
    Liu, Li
    Dong, Xiao
    Wang, Tianshi
    COMPLEXITY, 2020, 2020
  • [10] Proxy-Based Semi-Supervised Cross-Modal Hashing
    Chen, Hao
    Zou, Zhuoyang
    Zhu, Xinghui
    APPLIED SCIENCES-BASEL, 2025, 15 (05):