Dual Projective Zero-Shot Learning Using Text Descriptions

被引:7
|
作者
Rao, Yunbo [1 ]
Yang, Ziqiang [1 ]
Zeng, Shaoning [2 ]
Wang, Qifeng [3 ]
Pu, Jiansu [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Chengdu 313000, Sichuan, Peoples R China
[3] Google Berkeley, Berkeley, CA 94720 USA
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; generalized zero-shot learning; autoencoder; inductive zero-shot learning;
D O I
10.1145/3514247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) aims to recognize image instances of unseen classes solely based on the semantic descriptions of the unseen classes. In this field, Generalized Zero-Shot Learning (GZSL) is a challenging problem in which the images of both seen and unseen classes are mixed in the testing phase of learning. Existing methods formulate GZSL as a semantic-visual correspondence problem and apply generative models such as Generative Adversarial Networks and Variational Autoencoders to solve the problem. However, these methods suffer from the bias problem since the images of unseen classes are often misclassified into seen classes. In this work, a novel model named the Dual Projective model for Zero-Shot Learning (DPZSL) is proposed using text descriptions. In order to alleviate the bias problem, we leverage two autoencoders to project the visual and semantic features into a latent space and evaluate the embeddings by a visual-semantic correspondence loss function. An additional novel classifier is also introduced to ensure the discriminability of the embedded features. Our method focuses on a more challenging inductive ZSL setting in which only the labeled data from seen classes are used in the training phase. The experimental results, obtained from two popular datasets-Caltech-UCSD Birds-200-2011 (CUB) and North America Birds (NAB)-show that the proposed DPZSL model significantly outperforms both the inductive ZSL and GZSL settings. Particularly in the GZSL setting, our model yields an improvement up to 15.2% in comparison with state-of-the-art CANZSL on datasets CUB and NAB with two splittings.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Zero-shot Learning Using Multimodal Descriptions
    Mall, Utkarsh
    Hariharan, Bharath
    Bala, Kavita
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3930 - 3938
  • [2] ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual Summarization
    Paz-Argaman, Tzuf
    Atzmon, Yuval
    Chechik, Gal
    Tsarfaty, Reut
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 569 - 579
  • [3] Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions
    Saha, Oindrila
    Van Horn, Grant
    Maji, Subhransu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17542 - 17552
  • [4] Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions
    Elhoseiny, Mohamed
    Saleh, Babak
    Elgammal, Ahmed
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2584 - 2591
  • [5] Integrating topology beyond descriptions for zero-shot learning
    Chen, Ziyi
    Gao, Yutong
    Lang, Congyan
    Wei, Lili
    Li, Yidong
    Liu, Hongzhe
    Liu, Fayao
    PATTERN RECOGNITION, 2023, 143
  • [6] CORRELATED DUAL AUTOENCODER FOR ZERO-SHOT LEARNING
    Jiang, Ming
    Liu, Zhiyong
    Li, Pengfei
    Zhang, Min
    Tang, Jingfan
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2020, 82 (01): : 65 - 76
  • [7] Dual insurance for generalized zero-shot learning
    Liang, Jiahao
    Fang, Xiaozhao
    Kang, Peipei
    Han, Na
    Li, Chuang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2111 - 2125
  • [8] Correlated dual autoencoder for zero-shot learning
    Jiang, Ming
    Liu, Zhiyong
    Li, Pengfei
    Zhang, Min
    Tang, Jingfan
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2020, 82 (01): : 65 - 76
  • [9] Extreme Zero-Shot Learning for Extreme Text Classification
    Xiong, Yuanhao
    Chang, Wei-Cheng
    Hsieh, Cho-Jui
    Yu, Hsiang-Fu
    Dhillon, Inderjit
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5455 - 5468
  • [10] Using Task Descriptions in Lifelong Machine Learning for Improved Performance and Zero-Shot Transfer
    Rostami, Mohammad
    Isele, David
    Eaton, Eric
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 67 : 673 - 703