Domain-Aware Prototype Network for Generalized Zero-Shot Learning

被引：0

作者：

Hu, Yongli ^{[1
]}

Feng, Lincong ^{[1
]}

Jiang, Huajie ^{[1
]}

Liu, Mengting ^{[1
]}

Yin, Baocai ^{[1
]}

机构：

[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Visualization; Prototypes; Semantics; Transformers; Image recognition; Feature extraction; Task analysis; Generalized zero-shot learning; transformer-based dual attention; domain detection;

D O I：

10.1109/TCSVT.2023.3313727

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Generalized zero-shot learning(GZSL) aims to recognize images from seen and unseen classes with side information, such as manually annotated attribute vectors. Traditional methods focus on mapping images and semantics into a common latent space, thus achieving the visual-semantics alignment. Since the unseen classes are unavailable during training, there is a serious problem of recognition bias, which will tend to recognize unseen classes as seen classes. To solve this problem, we propose a Domain-aware Prototype Network(DPN), which splits the GZSL problem into the seen class recognition and unseen class recognition problem. For the seen classes, we design a domain-aware prototype learning branch with a dual attention feature encoder to capture the essential visual information, which aims to recognize the seen classes and discriminate the novel categories. To further recognize the fine-grained unseen classes, a visual-semantic embedding branch is designed, which aims to align the visual and semantic information for unseen-class recognition. Through the multi-task learning of the prototype learning branch and visual-semantic embedding branch, our model can achieve excellent performance on three popular GZSL datasets.

引用

页码：3180 / 3191

页数：12

共 50 条

[31] Generative Dual Adversarial Network for Generalized Zero-shot Learning
Huang, He
Wang, Changhu
Yu, Philip S.
Wang, Chang-Dong
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 801 - 810
[32] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
Ye, Yalan
He, Yukun
Pan, Tongjie
Li, Jingjing
Shen, Heng Tao
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337
[33] Learning domain invariant unseen features for generalized zero-shot classification
Li, Xiao
Fang, Min
Li, Haikun
Wu, Jinqiao
KNOWLEDGE-BASED SYSTEMS, 2020, 206
[34] Enhancing Domain-Invariant Parts for Generalized Zero-Shot Learning
Zhang, Yang
Feng, Songhe
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6283 - 6291
[35] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
Rahman, Shafin
Khan, Salman
Porikli, Fatih
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
[36] Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning
Yang, Yanhua
Zhang, Xiaozhe
Yang, Muli
Deng, Cheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 280 - 290
[37] Superclass-aware visual feature disentangling for generalized zero-shot learning
Niu, Chang
Shang, Junyuan
Zhou, Zhiheng
Yang, Junmei
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[38] Dual-level contrastive learning network for generalized zero-shot learning
Jiaqi Guan
Min Meng
Tianyou Liang
Jigang Liu
Jigang Wu
The Visual Computer, 2022, 38 : 3087 - 3095
[39] Dual-level contrastive learning network for generalized zero-shot learning
Guan, Jiaqi
Meng, Min
Liang, Tianyou
Liu, Jigang
Wu, Jigang
VISUAL COMPUTER, 2022, 38 (9-10): : 3087 - 3095
[40] Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
Xu, Tingting
Zhao, Ye
Liu, Xueliang
COMPLEXITY, 2021, 2021

← 1 2 3 4 5 →