Zero-shot recognition with latent visual attributes learning

被引:2
|
作者
Xie, Yurui [1 ,2 ]
He, Xiaohai [1 ]
Zhang, Jing [1 ]
Luo, Xiaodong [1 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Human-designed attributes; Dictionary learning; Visual attributes; Semantic representation; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1007/s11042-020-09316-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) aims to recognize novel object categories by means of transferring knowledge extracted from the seen categories (source domain) to the unseen categories (target domain). Recently, most ZSL methods concentrate on learning a visual-semantic alignment to bridge image features and their semantic representations by relying solely on the human-designed attributes. However, few works study whether the human-designed attributes are discriminative enough for recognition task. To address this problem, we propose a couple semantic dictionaries (CSD) learning approach to exploit the latent visual attributes and align the visual-semantic spaces at the same time. Specifically, the learned visual attributes are elegantly incorporated into the semantic representation of image feature and then consolidate the discriminative visual cues for object recognition. In addition, existing ZSL methods suffer from the domain shift issue due to the source domain and target domain have completely separated label spaces. We further employ the visual-semantic alignment and latent visual attributes jointly from source domain to regularise the learning of target domain, which ensures the expansibility of information transfer across domains. We formulate this as an optimization problem on a unified objective and propose an iterative solver. Extensive experiments on two challenging benchmark datasets demonstrate that our proposed approach outperforms several state-of-the-art ZSL methods.
引用
收藏
页码:27321 / 27335
页数:15
相关论文
共 50 条
  • [21] Visual Context Embeddings for Zero-Shot Recognition
    Cho, Gunhee
    Choi, Yong Suk
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1039 - 1047
  • [22] Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions
    Qian, Yijun
    Yu, Lijun
    Liu, Wenhe
    Hauptmann, Alexander G.
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 104 - 120
  • [23] Robust Zero-Shot Learning with Source Attributes Noise
    Yu, Jun
    Wu, Songsong
    Wang, Lu
    Jing, Xiao-Yuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 205 - 209
  • [24] Discriminative deep attributes for generalized zero-shot learning
    Kim, Hoseong
    Lee, Jewook
    Byun, Hyeran
    PATTERN RECOGNITION, 2022, 124
  • [25] Zero-Shot Learning with a Partial Set of Observed Attributes
    Wang, Yaqing
    Kwok, James T.
    Yao, Quanming
    Ni, Lionel M.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3777 - 3784
  • [26] Complementary Attributes: A New Clue to Zero-Shot Learning
    Xu, Xiaofeng
    Tsang, Ivor W.
    Liu, Chuancai
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (03) : 1519 - 1530
  • [27] Integrative zero-shot learning for fruit recognition
    Tran-Anh, Dat
    Huu, Quynh Nguyen
    Bui-Quoc, Bao
    Hoang, Ngan Dao
    Quoc, Tao Ngo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73191 - 73213
  • [28] Kernelized distance learning for zero-shot recognition
    Zarei, Mohammad Reza
    Taheri, Mohammad
    Long, Yang
    INFORMATION SCIENCES, 2021, 580 : 801 - 818
  • [29] An Attribute Learning Method for Zero-Shot Recognition
    Yazdanian, Ramtin
    Shojaee, Seyed Mohsen
    Baghshah, Mahdieh Soleymani
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2235 - 2240
  • [30] Fabric Recognition Using Zero-Shot Learning
    Wang, Feng
    Liu, Huaping
    Sun, Fuchun
    Pan, Haihong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (06) : 645 - 653