Generating visual representations for zero-shot learning via adversarial learning and variational autoencoders

被引:2
|
作者
Gull, Muqaddas [1 ]
Arif, Omar [1 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
关键词
Zero-shot learning; generalized zero-shot learning; variational autoencoders; visual representations; DATABASE;
D O I
10.1080/03081079.2023.2199991
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Computer vision tasks rely heavily on a huge amount of training data for classification, but in everyday situations, it is impossible to assemble a large amount of training data. Zero-shot learning (ZSL) is a promising domain for the applications in which we have no labeled data available for novel classes. It aims to recognize those unseen classes, by transferring semantic information from seen to unseen classes. In this paper, we propose a generative approach for generalized ZSL that combines the strength of Conditional Variational Autoencoder (CVAE) and Conditional Generative Adversarial Network (CGAN). The key to our approach is synthesizing visual features by including a Regressor that works on cycle-consistency loss, which will constrain the whole generative process. For experimental purposes, four challenging data sets, i.e. CUB, AWA1, AWA2 and SUN, are used in both conventional and generalized settings. Our proposed approach achieves significantly better results on these standard datasets in both settings.
引用
收藏
页码:636 / 651
页数:16
相关论文
共 50 条
  • [1] Zero-shot Learning via Simultaneous Generating and Learning
    Yu, Hyeonwoo
    Lee, Beomhee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Generalized Zero-Shot Learning using Identifiable Variational Autoencoders
    Gull, Muqaddas
    Arif, Omar
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [3] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [4] Leveraging Dual Variational Autoencoders and Generative Adversarial Networks for Enhanced Multimodal Interaction in Zero-Shot Learning
    Li, Ning
    Chen, Jie
    Fu, Nanxin
    Xiao, Wenzhuo
    Ye, Tianrun
    Gao, Chunming
    Zhang, Ping
    ELECTRONICS, 2024, 13 (03)
  • [5] Generating Visual Representations for Zero-Shot Classification
    Bucher, Maxime
    Herbin, Stephane
    Jurie, Frederic
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2666 - 2673
  • [6] Zero-Shot Learning via Visual Abstraction
    Antol, Stanislaw
    Zitnick, C. Lawrence
    Parikh, Devi
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
  • [7] Adversarial unseen visual feature synthesis for Zero-shot Learning
    Zhang, Haofeng
    Long, Yang
    Liu, Li
    Shao, Ling
    NEUROCOMPUTING, 2019, 329 : 12 - 20
  • [8] Multi-Label Zero-Shot Learning With Adversarial and Variational Techniques
    Gull, Muqaddas
    Arif, Omar
    IEEE ACCESS, 2024, 12 : 94990 - 95006
  • [9] Variational Disentangle Zero-Shot Learning
    Su, Jie
    Wan, Jinhao
    Li, Taotao
    Li, Xiong
    Ye, Yuheng
    MATHEMATICS, 2023, 11 (16)
  • [10] Generating Variable Explanations via Zero-shot Prompt Learning
    Wang, Chong
    Lou, Yiling
    Liu, Junwei
    Peng, Xin
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 748 - 760