Zero-Shot Learning via Robust Latent Representation and Manifold Regularization

被引:39
|
作者
Meng, Min [1 ]
Yu, Jun [2 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China
[2] Hangzhou Dianzi Univ, Dept Comp Sci, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero shot learning; image classification; latent subspace; manifold regularization;
D O I
10.1109/TIP.2018.2881926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) for visual recognition aims to accurately recognize the objects of unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. However, the semantic gap across visual features and their underlying semantics is still a big obstacle in ZSL. Conventional ZSL methods construct that the mapping typically focus on the original visual features that are independent of the ZSL tasks, thus degrading the prediction performance. In this paper, we propose an effective method to uncover an appropriate latent representation of data for the purpose of zero-shot classification. Specifically, we formulate a novel framework to jointly learn the latent subspace and cross-modal embedding to link visual features with their semantic representations. The proposed framework combines feature learning and semantics prediction, such that the learned data representation is more discriminative to predict the semantic vectors, hence improving the overall classification performance. To learn a robust latent subspace, we explicitly avoid the information loss by ensuring the reconstruction ability of the obtained data representation. An efficient algorithm is designed to solve the proposed optimization problem. To fully exploit the intrinsic geometric structure of data, we develop a manifold regularization strategy to refine the learned semantic representations, leading to further improvements of the classification performance. To validate the effectiveness of the proposed approach, extensive experiments are conducted on three ZSL benchmarks and encouraging results are achieved compared with the state-of-the-art ZSL methods.
引用
收藏
页码:1824 / 1836
页数:13
相关论文
共 50 条
  • [31] Zero-shot Learning via Simultaneous Generating and Learning
    Yu, Hyeonwoo
    Lee, Beomhee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [32] Dissimilarity Representation Learning for Generalized Zero-Shot Recognition
    Yang, Gang
    Liu, Jinlu
    Xu, Jieping
    Li, Xirong
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2032 - 2039
  • [33] Zero-Shot Learning via Visual Abstraction
    Antol, Stanislaw
    Zitnick, C. Lawrence
    Parikh, Devi
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
  • [34] Discriminative and Robust Attribute Alignment for Zero-Shot Learning
    Cheng, De
    Wang, Gerong
    Wang, Nannan
    Zhang, Dingwen
    Zhang, Qiang
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4244 - 4256
  • [35] Robust Zero-Shot Learning with Source Attributes Noise
    Yu, Jun
    Wu, Songsong
    Wang, Lu
    Jing, Xiao-Yuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 205 - 209
  • [36] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
    Qian Wang
    Ke Chen
    International Journal of Computer Vision, 2017, 124 : 356 - 383
  • [37] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
    Wang, Qian
    Chen, Ke
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (03) : 356 - 383
  • [38] Hierarchical Disentanglement of Discriminative Latent Features for Zero-shot Learning
    Tong, Bin
    Wang, Chao
    Klinkigt, Martin
    Kobayashi, Yoshiyuki
    Nonaka, Yuuichi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11459 - 11468
  • [39] Learning Latent Semantic Attributes for Zero-Shot Object Detection
    Wang, Kang
    Zhang, Lu
    Tan, Yifan
    Zhao, Jiajia
    Zhou, Shuigeng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
  • [40] Latent Embeddings for Zero-shot Classification
    Xian, Yongqin
    Akata, Zeynep
    Sharma, Gaurav
    Nguyen, Quynh
    Hein, Matthias
    Schiele, Bernt
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 69 - 77