Zero-shot recognition with latent visual attributes learning

被引：2

作者：

Xie, Yurui ^{[1
,2
]}

He, Xiaohai ^{[1
]}

Zhang, Jing ^{[1
]}

Luo, Xiaodong ^{[1
]}

机构：

[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu, Peoples R China

[2] Chengdu Univ Informat Technol, Chengdu, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 37-38期

基金：

中国国家自然科学基金;

关键词：

Zero-shot learning; Human-designed attributes; Dictionary learning; Visual attributes; Semantic representation; CONVOLUTIONAL NEURAL-NETWORKS;

D O I：

10.1007/s11042-020-09316-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Zero-shot learning (ZSL) aims to recognize novel object categories by means of transferring knowledge extracted from the seen categories (source domain) to the unseen categories (target domain). Recently, most ZSL methods concentrate on learning a visual-semantic alignment to bridge image features and their semantic representations by relying solely on the human-designed attributes. However, few works study whether the human-designed attributes are discriminative enough for recognition task. To address this problem, we propose a couple semantic dictionaries (CSD) learning approach to exploit the latent visual attributes and align the visual-semantic spaces at the same time. Specifically, the learned visual attributes are elegantly incorporated into the semantic representation of image feature and then consolidate the discriminative visual cues for object recognition. In addition, existing ZSL methods suffer from the domain shift issue due to the source domain and target domain have completely separated label spaces. We further employ the visual-semantic alignment and latent visual attributes jointly from source domain to regularise the learning of target domain, which ensures the expansibility of information transfer across domains. We formulate this as an optimization problem on a unified objective and propose an iterative solver. Extensive experiments on two challenging benchmark datasets demonstrate that our proposed approach outperforms several state-of-the-art ZSL methods.

引用

页码：27321 / 27335

页数：15

共 50 条

[1] Zero-shot recognition with latent visual attributes learning
Yurui Xie
Xiaohai He
Jing Zhang
Xiaodong Luo
Multimedia Tools and Applications, 2020, 79 : 27321 - 27335
[2] Beyond Semantic Attributes: Discrete Latent Attributes Learning for Zero-Shot Recognition
Qin, Jie
Wang, Yunhong
Liu, Li
Chen, Jiaxin
Shao, Ling
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (11) : 1667 - 1671
[3] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74 (74)
[4] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[5] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[6] Semantic-aware visual attributes learning for zero-shot recognition
Xie, Yurui
Song, Tiecheng
Li, Wei
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[7] Learning Discriminative Latent Attributes for Zero-Shot Classification
Jiang, Huajie
Wang, Ruiping
Shan, Shiguang
Yang, Yi
Chen, Xilin
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4233 - 4242
[8] Learning Latent Semantic Attributes for Zero-Shot Object Detection
Wang, Kang
Zhang, Lu
Tan, Yifan
Zhao, Jiajia
Zhou, Shuigeng
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
[9] Discriminative Learning of Latent Features for Zero-Shot Recognition
Li, Yan
Zhang, Junge
Zhang, Jianguo
Huang, Kaiqi
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7463 - 7471
[10] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
Qian Wang
Ke Chen
International Journal of Computer Vision, 2017, 124 : 356 - 383

← 1 2 3 4 5 →