Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

被引:25
|
作者
Li, Jingjing [1 ]
Jing, Mengmeng [1 ]
Zhu, Lei [2 ]
Ding, Zhengming [3 ]
Lu, Ke [1 ]
Yang, Yang [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Shandong Normal Univ, Jinan, Shandong, Peoples R China
[3] Indiana Univ Purdue Univ, Indianapolis, IN 46202 USA
基金
中国国家自然科学基金;
关键词
Zero-shot learning; mutual information estimation; generalized ZSL; variational autoencoders;
D O I
10.1145/3394171.3413503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, feature generating methods have been successfully applied to zero-shot learning (ZSL). However, most previous approaches only generate visual representations for zero-shot recognition. In fact, typical ZSL is a classic multi-modal learning protocol which consists of a visual space and a semantic space. In this paper, therefore, we present a new method which can simultaneously generate both visual representations and semantic representations so that the essential multi-modal information associated with unseen classes can be captured. Specifically, we address the most challenging issue in such a paradigm, i.e., how to handle the domain shift and thus guarantee that the learned representations are modality-invariant. To this end, we propose two strategies: 1) leveraging the mutual information between the latent visual representations and the semantic representations; 2) maximizing the entropy of the joint distribution of the two latent representations. By leveraging the two strategies, we argue that the two modalities can be well aligned. At last, extensive experiments on five widely used datasets verify that the proposed method is able to significantly outperform previous the state-of-the-arts.
引用
收藏
页码:1348 / 1356
页数:9
相关论文
共 50 条
  • [21] Leveraging the Invariant Side of Generative Zero-Shot Learning
    Li, Jingjing
    Jing, Mengmeng
    Lu, Ke
    Ding, Zhengming
    Zhu, Lei
    Huang, Zi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7394 - 7403
  • [22] Discriminative Learning of Latent Features for Zero-Shot Recognition
    Li, Yan
    Zhang, Junge
    Zhang, Jianguo
    Huang, Kaiqi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7463 - 7471
  • [23] Zero-shot recognition with latent visual attributes learning
    Xie, Yurui
    He, Xiaohai
    Zhang, Jing
    Luo, Xiaodong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335
  • [24] Marginalized Latent Semantic Encoder for Zero-Shot Learning
    Ding, Zhengming
    Liu, Hongfu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6184 - 6192
  • [25] Discriminative Latent Attribute Autoencoder for Zero-Shot Learning
    Chen, Runqing
    Wu, Songsong
    Sun, Guangcheng
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 873 - 877
  • [26] Learning Discriminative Latent Attributes for Zero-Shot Classification
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Yang, Yi
    Chen, Xilin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4233 - 4242
  • [27] Zero-shot recognition with latent visual attributes learning
    Yurui Xie
    Xiaohai He
    Jing Zhang
    Xiaodong Luo
    Multimedia Tools and Applications, 2020, 79 : 27321 - 27335
  • [28] Zero-Shot Learning via Latent Space Encoding
    Yu, Yunlong
    Ji, Zhong
    Guo, Jichang
    Zhang, Zhongfei
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (10) : 3755 - 3766
  • [29] Generative-based hybrid model with semantic representations for generalized zero-shot learning
    Akdemir, Emre
    Barisci, Necaattin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [30] Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning
    Wang, Chaoqun
    Chen, Xuejin
    Min, Shaobo
    Sun, Xiaoyan
    Li, Houqiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2710 - 2718