Generating visual representations for zero-shot learning via adversarial learning and variational autoencoders

被引:2
|
作者
Gull, Muqaddas [1 ]
Arif, Omar [1 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
关键词
Zero-shot learning; generalized zero-shot learning; variational autoencoders; visual representations; DATABASE;
D O I
10.1080/03081079.2023.2199991
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Computer vision tasks rely heavily on a huge amount of training data for classification, but in everyday situations, it is impossible to assemble a large amount of training data. Zero-shot learning (ZSL) is a promising domain for the applications in which we have no labeled data available for novel classes. It aims to recognize those unseen classes, by transferring semantic information from seen to unseen classes. In this paper, we propose a generative approach for generalized ZSL that combines the strength of Conditional Variational Autoencoder (CVAE) and Conditional Generative Adversarial Network (CGAN). The key to our approach is synthesizing visual features by including a Regressor that works on cycle-consistency loss, which will constrain the whole generative process. For experimental purposes, four challenging data sets, i.e. CUB, AWA1, AWA2 and SUN, are used in both conventional and generalized settings. Our proposed approach achieves significantly better results on these standard datasets in both settings.
引用
收藏
页码:636 / 651
页数:16
相关论文
共 50 条
  • [41] Zero-shot recognition with latent visual attributes learning
    Xie, Yurui
    He, Xiaohai
    Zhang, Jing
    Luo, Xiaodong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335
  • [42] Joint Visual and Semantic Optimization for zero-shot learning
    Wu, Hanrui
    Yan, Yuguang
    Chen, Sentao
    Huang, Xiangkang
    Wu, Qingyao
    Ng, Michael K.
    KNOWLEDGE-BASED SYSTEMS, 2021, 215 (215)
  • [43] Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
    Liu, Shaoteng
    Chen, Jingjing
    Pan, Liangming
    Ngo, Chong-Wah
    Chua, Tat-Seng
    Jiang, Yu-Gang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9270 - 9278
  • [44] Learning semantic consistency for audio-visual zero-shot learning
    Xiaoyong Li
    Jing Yang
    Yuling Chen
    Wei Zhang
    Xiaoli Ruan
    Chengjiang Li
    Zhidong Su
    Artificial Intelligence Review, 58 (7)
  • [45] Semantically Grounded Visual Embeddings for Zero-Shot Learning
    Nawaz, Shah
    Cavazza, Jacopo
    Del Bue, Alessio
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4588 - 4598
  • [46] A Generative Model For Zero Shot Learning Using Conditional Variational Autoencoders
    Mishra, Ashish
    Reddy, Shiva Krishna
    Mittal, Anurag
    Murthy, Hema A.
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2269 - 2277
  • [47] Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings
    Shen, Fumin
    Zhou, Xiang
    Yu, Jun
    Yang, Yang
    Liu, Li
    Shen, Heng Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3662 - 3674
  • [48] Learning unseen visual prototypes for zero-shot classification
    Li, Xiao
    Fang, Min
    Feng, Dazheng
    Li, Haikun
    Wu, Jinqiao
    KNOWLEDGE-BASED SYSTEMS, 2018, 160 : 176 - 187
  • [49] Zero-shot recognition with latent visual attributes learning
    Yurui Xie
    Xiaohai He
    Jing Zhang
    Xiaodong Luo
    Multimedia Tools and Applications, 2020, 79 : 27321 - 27335
  • [50] Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning
    Li, Jingjing
    Jing, Mengmeng
    Zhu, Lei
    Ding, Zhengming
    Lu, Ke
    Yang, Yang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1348 - 1356