Dual Projective Zero-Shot Learning Using Text Descriptions

被引:7
|
作者
Rao, Yunbo [1 ]
Yang, Ziqiang [1 ]
Zeng, Shaoning [2 ]
Wang, Qifeng [3 ]
Pu, Jiansu [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Chengdu 313000, Sichuan, Peoples R China
[3] Google Berkeley, Berkeley, CA 94720 USA
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; generalized zero-shot learning; autoencoder; inductive zero-shot learning;
D O I
10.1145/3514247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) aims to recognize image instances of unseen classes solely based on the semantic descriptions of the unseen classes. In this field, Generalized Zero-Shot Learning (GZSL) is a challenging problem in which the images of both seen and unseen classes are mixed in the testing phase of learning. Existing methods formulate GZSL as a semantic-visual correspondence problem and apply generative models such as Generative Adversarial Networks and Variational Autoencoders to solve the problem. However, these methods suffer from the bias problem since the images of unseen classes are often misclassified into seen classes. In this work, a novel model named the Dual Projective model for Zero-Shot Learning (DPZSL) is proposed using text descriptions. In order to alleviate the bias problem, we leverage two autoencoders to project the visual and semantic features into a latent space and evaluate the embeddings by a visual-semantic correspondence loss function. An additional novel classifier is also introduced to ensure the discriminability of the embedded features. Our method focuses on a more challenging inductive ZSL setting in which only the labeled data from seen classes are used in the training phase. The experimental results, obtained from two popular datasets-Caltech-UCSD Birds-200-2011 (CUB) and North America Birds (NAB)-show that the proposed DPZSL model significantly outperforms both the inductive ZSL and GZSL settings. Particularly in the GZSL setting, our model yields an improvement up to 15.2% in comparison with state-of-the-art CANZSL on datasets CUB and NAB with two splittings.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Zero-shot Metric Learning
    Xu, Xinyi
    Cao, Huanhuan
    Yang, Yanhua
    Yang, Erkun
    Deng, Cheng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3996 - 4002
  • [32] Active Zero-Shot Learning
    Xie, Sihong
    Wang, Shaoxiong
    Yu, Philip S.
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1889 - 1892
  • [33] Spherical Zero-Shot Learning
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Zhang, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 634 - 645
  • [34] Rebalanced Zero-Shot Learning
    Ye, Zihan
    Yang, Guanyu
    Jin, Xiaobo
    Liu, Youfa
    Huang, Kaizhu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4185 - 4198
  • [35] Incremental Zero-Shot Learning
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Tao, Dacheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13788 - 13799
  • [36] Generative Dual Adversarial Network for Generalized Zero-shot Learning
    Huang, He
    Wang, Changhu
    Yu, Philip S.
    Wang, Chang-Dong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 801 - 810
  • [37] Dual Progressive Prototype Network for Generalized Zero-Shot Learning
    Wang, Chaoqun
    Mina, Shaobo
    Chenl, Xuejin
    Sun, Xiaoyan
    Li, Houqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [38] Lifelong Zero-Shot Learning
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 551 - 557
  • [39] CLZT: A Contrastive Learning Based Framework for Zero-Shot Text Classification
    Li, Kun
    Lin, Meng
    Hu, Songlin
    Li, Ruixuan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 623 - 630
  • [40] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667