Zero-Shot Learning via Robust Latent Representation and Manifold Regularization

被引：39

作者：

Meng, Min ^{[1
]}

Yu, Jun ^{[2
]}

机构：

[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China

[2] Hangzhou Dianzi Univ, Dept Comp Sci, Hangzhou 310018, Zhejiang, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2019年 / 28卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Zero shot learning; image classification; latent subspace; manifold regularization;

D O I：

10.1109/TIP.2018.2881926

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Zero-shot learning (ZSL) for visual recognition aims to accurately recognize the objects of unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. However, the semantic gap across visual features and their underlying semantics is still a big obstacle in ZSL. Conventional ZSL methods construct that the mapping typically focus on the original visual features that are independent of the ZSL tasks, thus degrading the prediction performance. In this paper, we propose an effective method to uncover an appropriate latent representation of data for the purpose of zero-shot classification. Specifically, we formulate a novel framework to jointly learn the latent subspace and cross-modal embedding to link visual features with their semantic representations. The proposed framework combines feature learning and semantics prediction, such that the learned data representation is more discriminative to predict the semantic vectors, hence improving the overall classification performance. To learn a robust latent subspace, we explicitly avoid the information loss by ensuring the reconstruction ability of the obtained data representation. An efficient algorithm is designed to solve the proposed optimization problem. To fully exploit the intrinsic geometric structure of data, we develop a manifold regularization strategy to refine the learned semantic representations, leading to further improvements of the classification performance. To validate the effectiveness of the proposed approach, extensive experiments are conducted on three ZSL benchmarks and encouraging results are achieved compared with the state-of-the-art ZSL methods.

引用

页码：1824 / 1836

页数：13

共 50 条

[31] Zero-shot Learning via Simultaneous Generating and Learning
Yu, Hyeonwoo
Lee, Beomhee
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[32] Dissimilarity Representation Learning for Generalized Zero-Shot Recognition
Yang, Gang
Liu, Jinlu
Xu, Jieping
Li, Xirong
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2032 - 2039
[33] Zero-Shot Learning via Visual Abstraction
Antol, Stanislaw
Zitnick, C. Lawrence
Parikh, Devi
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
[34] Discriminative and Robust Attribute Alignment for Zero-Shot Learning
Cheng, De
Wang, Gerong
Wang, Nannan
Zhang, Dingwen
Zhang, Qiang
Gao, Xinbo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4244 - 4256
[35] Robust Zero-Shot Learning with Source Attributes Noise
Yu, Jun
Wu, Songsong
Wang, Lu
Jing, Xiao-Yuan
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 205 - 209
[36] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
Qian Wang
Ke Chen
International Journal of Computer Vision, 2017, 124 : 356 - 383
[37] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
Wang, Qian
Chen, Ke
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (03) : 356 - 383
[38] Hierarchical Disentanglement of Discriminative Latent Features for Zero-shot Learning
Tong, Bin
Wang, Chao
Klinkigt, Martin
Kobayashi, Yoshiyuki
Nonaka, Yuuichi
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11459 - 11468
[39] Learning Latent Semantic Attributes for Zero-Shot Object Detection
Wang, Kang
Zhang, Lu
Tan, Yifan
Zhao, Jiajia
Zhou, Shuigeng
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
[40] Latent Embeddings for Zero-shot Classification
Xian, Yongqin
Akata, Zeynep
Sharma, Gaurav
Nguyen, Quynh
Hein, Matthias
Schiele, Bernt
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 69 - 77

← 1 2 3 4 5 →