Zero-shot recognition with latent visual attributes learning

被引:2
|
作者
Xie, Yurui [1 ,2 ]
He, Xiaohai [1 ]
Zhang, Jing [1 ]
Luo, Xiaodong [1 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Human-designed attributes; Dictionary learning; Visual attributes; Semantic representation; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1007/s11042-020-09316-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) aims to recognize novel object categories by means of transferring knowledge extracted from the seen categories (source domain) to the unseen categories (target domain). Recently, most ZSL methods concentrate on learning a visual-semantic alignment to bridge image features and their semantic representations by relying solely on the human-designed attributes. However, few works study whether the human-designed attributes are discriminative enough for recognition task. To address this problem, we propose a couple semantic dictionaries (CSD) learning approach to exploit the latent visual attributes and align the visual-semantic spaces at the same time. Specifically, the learned visual attributes are elegantly incorporated into the semantic representation of image feature and then consolidate the discriminative visual cues for object recognition. In addition, existing ZSL methods suffer from the domain shift issue due to the source domain and target domain have completely separated label spaces. We further employ the visual-semantic alignment and latent visual attributes jointly from source domain to regularise the learning of target domain, which ensures the expansibility of information transfer across domains. We formulate this as an optimization problem on a unified objective and propose an iterative solver. Extensive experiments on two challenging benchmark datasets demonstrate that our proposed approach outperforms several state-of-the-art ZSL methods.
引用
收藏
页码:27321 / 27335
页数:15
相关论文
共 50 条
  • [1] Zero-shot recognition with latent visual attributes learning
    Yurui Xie
    Xiaohai He
    Jing Zhang
    Xiaodong Luo
    Multimedia Tools and Applications, 2020, 79 : 27321 - 27335
  • [2] Beyond Semantic Attributes: Discrete Latent Attributes Learning for Zero-Shot Recognition
    Qin, Jie
    Wang, Yunhong
    Liu, Li
    Chen, Jiaxin
    Shao, Ling
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (11) : 1667 - 1671
  • [3] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74 (74)
  • [4] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [5] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [6] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [7] Learning Discriminative Latent Attributes for Zero-Shot Classification
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Yang, Yi
    Chen, Xilin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4233 - 4242
  • [8] Learning Latent Semantic Attributes for Zero-Shot Object Detection
    Wang, Kang
    Zhang, Lu
    Tan, Yifan
    Zhao, Jiajia
    Zhou, Shuigeng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
  • [9] Discriminative Learning of Latent Features for Zero-Shot Recognition
    Li, Yan
    Zhang, Junge
    Zhang, Jianguo
    Huang, Kaiqi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7463 - 7471
  • [10] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
    Qian Wang
    Ke Chen
    International Journal of Computer Vision, 2017, 124 : 356 - 383