Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning

被引:11
|
作者
Wang, Ziyang [1 ,2 ]
Gou, Yunhao [1 ,2 ]
Li, Jingjing [2 ]
Zhu, Lei [3 ]
Shen, Heng Tao [3 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313002, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Visualization; Task analysis; Feature extraction; Image recognition; Annotations; Knowledge transfer; Zero-shot learning; transfer learning; attention mechanism;
D O I
10.1109/TCSVT.2022.3208256
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Zero-shot Learning (ZSL) aims to recognize novel classes through seen knowledge. The canonical approach to ZSL leverages a visual-to-semantic embedding to map the global features of an image sample to its semantic representation. These global features usually overlook the fine-grained information which is vital for knowledge transfer between seen and unseen classes, rendering these features sub-optimal for ZSL task, especially the more realistic Generalized Zero-shot Learning (GZSL) task where global features of similar classes could hardly be separated. To provide a remedy to this problem, we propose Language-Augmented Pixel Embedding (LAPE) that directly bridges the visual and semantic spaces in a pixel-based manner. To this end, we map the local features of each pixel to different attributes and then extract each semantic attribute from the corresponding pixel. However, the lack of pixel-level annotation conduces to an inefficient pixel-based knowledge transfer. To mitigate this dilemma, we adopt the text information of each attribute to augment the local features of image pixels which are related to the semantic attributes. Experiments on four ZSL benchmarks demonstrate that LAPE outperforms current state-of-the-art methods. Comprehensive ablation studies and analyses are provided to dissect what factors lead to this success.
引用
收藏
页码:1019 / 1030
页数:12
相关论文
共 50 条
  • [41] Data-Free Generalized Zero-Shot Learning
    Tang, Bowen
    Zhang, Jing
    Yan, Long
    Yu, Qian
    Sheng, Lu
    Xu, Dong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5108 - 5117
  • [42] Generalized Zero-Shot Learning with Noisy Labeled Data
    Xu, Liqing
    Liu, Xueliang
    Jiang, Yishun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 289 - 300
  • [43] Transferable Contrastive Network for Generalized Zero-Shot Learning
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9764 - 9773
  • [44] Discriminative deep attributes for generalized zero-shot learning
    Kim, Hoseong
    Lee, Jewook
    Byun, Hyeran
    PATTERN RECOGNITION, 2022, 124
  • [45] Class-Incremental Generalized Zero-Shot Learning
    Zhenfeng Sun
    Rui Feng
    Yanwei Fu
    Multimedia Tools and Applications, 2023, 82 : 38233 - 38247
  • [46] Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zha, Zheng-Jun
    Luo, Jiebo
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1938 - 1951
  • [47] Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation
    Baek, Donghyeon
    Oh, Youngmin
    Ham, Bumsub
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9516 - 9525
  • [48] A Contrastive Method for Continual Generalized Zero-Shot Learning
    Liang, Chen
    Fan, Wentao
    Liu, Xin
    Peng, Shu-Juan
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE. THEORY AND APPLICATIONS, IEA/AIE 2023, PT I, 2023, 13925 : 365 - 376
  • [49] Generalized Zero-Shot Learning Based on Manifold Alignment
    Xu, Rui
    Shao, Shuai
    Liu, Baodi
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 202 - 207
  • [50] Generalized Zero-Shot Learning with Deep Calibration Network
    Liu, Shichen
    Long, Mingsheng
    Wang, Jianmin
    Jordan, Michael I.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31