Domain-Aware Prototype Network for Generalized Zero-Shot Learning

被引:0
|
作者
Hu, Yongli [1 ]
Feng, Lincong [1 ]
Jiang, Huajie [1 ]
Liu, Mengting [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Fac Informat Technol, Beijing 100124, Peoples R China
关键词
Visualization; Prototypes; Semantics; Transformers; Image recognition; Feature extraction; Task analysis; Generalized zero-shot learning; transformer-based dual attention; domain detection;
D O I
10.1109/TCSVT.2023.3313727
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generalized zero-shot learning(GZSL) aims to recognize images from seen and unseen classes with side information, such as manually annotated attribute vectors. Traditional methods focus on mapping images and semantics into a common latent space, thus achieving the visual-semantics alignment. Since the unseen classes are unavailable during training, there is a serious problem of recognition bias, which will tend to recognize unseen classes as seen classes. To solve this problem, we propose a Domain-aware Prototype Network(DPN), which splits the GZSL problem into the seen class recognition and unseen class recognition problem. For the seen classes, we design a domain-aware prototype learning branch with a dual attention feature encoder to capture the essential visual information, which aims to recognize the seen classes and discriminate the novel categories. To further recognize the fine-grained unseen classes, a visual-semantic embedding branch is designed, which aims to align the visual and semantic information for unseen-class recognition. Through the multi-task learning of the prototype learning branch and visual-semantic embedding branch, our model can achieve excellent performance on three popular GZSL datasets.
引用
收藏
页码:3180 / 3191
页数:12
相关论文
共 50 条
  • [31] Generative Dual Adversarial Network for Generalized Zero-shot Learning
    Huang, He
    Wang, Changhu
    Yu, Philip S.
    Wang, Chang-Dong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 801 - 810
  • [32] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
    Ye, Yalan
    He, Yukun
    Pan, Tongjie
    Li, Jingjing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337
  • [33] Learning domain invariant unseen features for generalized zero-shot classification
    Li, Xiao
    Fang, Min
    Li, Haikun
    Wu, Jinqiao
    KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [34] Enhancing Domain-Invariant Parts for Generalized Zero-Shot Learning
    Zhang, Yang
    Feng, Songhe
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6283 - 6291
  • [35] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [36] Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning
    Yang, Yanhua
    Zhang, Xiaozhe
    Yang, Muli
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 280 - 290
  • [37] Superclass-aware visual feature disentangling for generalized zero-shot learning
    Niu, Chang
    Shang, Junyuan
    Zhou, Zhiheng
    Yang, Junmei
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [38] Dual-level contrastive learning network for generalized zero-shot learning
    Jiaqi Guan
    Min Meng
    Tianyou Liang
    Jigang Liu
    Jigang Wu
    The Visual Computer, 2022, 38 : 3087 - 3095
  • [39] Dual-level contrastive learning network for generalized zero-shot learning
    Guan, Jiaqi
    Meng, Min
    Liang, Tianyou
    Liu, Jigang
    Wu, Jigang
    VISUAL COMPUTER, 2022, 38 (9-10): : 3087 - 3095
  • [40] Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
    Xu, Tingting
    Zhao, Ye
    Liu, Xueliang
    COMPLEXITY, 2021, 2021