Domain-Aware Prototype Network for Generalized Zero-Shot Learning

被引:0
|
作者
Hu, Yongli [1 ]
Feng, Lincong [1 ]
Jiang, Huajie [1 ]
Liu, Mengting [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Fac Informat Technol, Beijing 100124, Peoples R China
关键词
Visualization; Prototypes; Semantics; Transformers; Image recognition; Feature extraction; Task analysis; Generalized zero-shot learning; transformer-based dual attention; domain detection;
D O I
10.1109/TCSVT.2023.3313727
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generalized zero-shot learning(GZSL) aims to recognize images from seen and unseen classes with side information, such as manually annotated attribute vectors. Traditional methods focus on mapping images and semantics into a common latent space, thus achieving the visual-semantics alignment. Since the unseen classes are unavailable during training, there is a serious problem of recognition bias, which will tend to recognize unseen classes as seen classes. To solve this problem, we propose a Domain-aware Prototype Network(DPN), which splits the GZSL problem into the seen class recognition and unseen class recognition problem. For the seen classes, we design a domain-aware prototype learning branch with a dual attention feature encoder to capture the essential visual information, which aims to recognize the seen classes and discriminate the novel categories. To further recognize the fine-grained unseen classes, a visual-semantic embedding branch is designed, which aims to align the visual and semantic information for unseen-class recognition. Through the multi-task learning of the prototype learning branch and visual-semantic embedding branch, our model can achieve excellent performance on three popular GZSL datasets.
引用
收藏
页码:3180 / 3191
页数:12
相关论文
共 50 条
  • [1] Domain-aware multi-modality fusion network for generalized zero-shot learning
    Wang, Jia
    Wang, Xiao
    Zhang, Han
    NEUROCOMPUTING, 2022, 488 : 23 - 35
  • [2] Domain-aware Stacked AutoEncoders for zero-shot learning
    Song, Jianqiang
    Shi, Guangming
    Xie, Xuemei
    Wu, Qingtao
    Zhang, Mingchuan
    NEUROCOMPUTING, 2021, 429 : 118 - 131
  • [3] Dual Prototype Contrastive Network for Generalized Zero-Shot Learning
    Jiang, Huajie
    Li, Zhengxian
    Hu, Yongli
    Yin, Baocai
    Yang, Jian
    van den Hengel, Anton
    Yang, Ming-Hsuan
    Qi, Yuankai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1111 - 1122
  • [4] Dual Progressive Prototype Network for Generalized Zero-Shot Learning
    Wang, Chaoqun
    Mina, Shaobo
    Chenl, Xuejin
    Sun, Xiaoyan
    Li, Houqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Residual-Prototype Generating Network for Generalized Zero-Shot Learning
    Zhang, Zeqing
    Li, Xiaofan
    Ma, Tai
    Gao, Zuodong
    Li, Cuihua
    Lin, Weiwei
    MATHEMATICS, 2022, 10 (19)
  • [6] Class-Prototype Discriminative Network for Generalized Zero-Shot Learning
    Huang, Sheng
    Lin, Jingkai
    Huangfu, Luwen
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 301 - 305
  • [7] Generalized zero-shot domain adaptation with target unseen class prototype learning
    Li, Xiao
    Fang, Min
    Chen, Bo
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (20): : 17793 - 17807
  • [8] Generalized zero-shot domain adaptation with target unseen class prototype learning
    Xiao Li
    Min Fang
    Bo Chen
    Neural Computing and Applications, 2022, 34 : 17793 - 17807
  • [9] Attributes learning network for generalized zero-shot learning
    Yun, Yu
    Wang, Sen
    Hou, Mingzhen
    Gao, Quanxue
    NEURAL NETWORKS, 2022, 150 : 112 - 118
  • [10] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Ming Zhu
    Chen Zhao
    Nian Wang
    Feiyang Gu
    Yu Liu
    Xin Li
    The Visual Computer, 2024, 40 : 3091 - 3101