Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes

被引：32

作者：

Geng, Chuanxing ^{[1
]}

Tao, Lue ^{[1
]}

Chen, Songcan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, MIIT Key Lab Pattern Anal & Machine Intelligence, Nanjing 211106, Peoples R China

来源：

PATTERN RECOGNITION | 2020年 / 102卷

关键词：

Convolutional prototype learning; Generalized zero-shot Learning; Open set recognition;

D O I：

10.1016/j.patcog.2020.107263

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the process of exploring the world, the curiosity constantly drives humans to cognize new things. Supposing you are a zoologist, for a presented animal image, you can recognize it immediately if you know its class. Otherwise, you would more likely attempt to cognize it by exploiting the side-information (e.g., semantic information, etc.) you have accumulated. Inspired by this, this paper decomposes the generalized zero-shot learning (G-ZSL) task into an open set recognition (OSR) task and a zero-shot learning (ZSL) task, where OSR recognizes seen classes (if we have seen (or known) them) and rejects unseen classes (if we have never seen (or known) them before), while ZSL identifies the unseen classes rejected by the former. Simultaneously, without violating OSR's assumptions (only known class knowledge is available in training), we also first attempt to explore a new generalized open set recognition (G-OSR) by introducing the accumulated side-information from known classes to OSR. For G-ZSL, such a decomposition effectively solves the class overfitting problem with easily misclassifying unseen classes as seen classes. The problem is ubiquitous in most existing G-ZSL methods. On the other hand, for G-OSR, introducing such semantic information of known classes not only improves the recognition performance but also endows OSR with the cognitive ability of unknown classes. Specifically, a visual and semantic prototypes-jointly guided convolutional neural network (VSG-CNN) is proposed to fulfill these two tasks (G-ZSL and G-OSR) in a unified end-to-end learning framework. Extensive experiments on benchmark datasets demonstrate the advantages of our learning framework. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：10

共 50 条

[41] Learning complementary semantic information for zero-shot recognition
Hu, Xiaoming
Wang, Zilei
Li, Junjie
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
[42] Visual feature synthesis with semantic reconstructor for traditional and generalized zero-shot object classification
Zhao, Ye
Xu, Tingting
Liu, Xueliang
Guo, Dan
Hu, Zhenzhen
Liu, Hengchang
Li, Yicong
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (05) : 2934 - 2951
[43] Zero-Shot Visual Recognition via Semantic Attention-Based Compare Network
Nian, Fudong
Sheng, Yikun
Wang, Junfeng
Li, Teng
IEEE ACCESS, 2020, 8 : 26002 - 26011
[44] Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition
Jiang, Huajie
Wang, Ruiping
Shan, Shiguang
Chen, Xilin
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 121 - 138
[45] GOSS: towards generalized open-set semantic segmentation
Jie Hong
Weihao Li
Junlin Han
Jiyang Zheng
Pengfei Fang
Mehrtash Harandi
Lars Petersson
The Visual Computer, 2024, 40 : 2391 - 2404
[46] GOSS: towards generalized open-set semantic segmentation
Hong, Jie
Li, Weihao
Han, Junlin
Zheng, Jiyang
Fang, Pengfei
Harandi, Mehrtash
Petersson, Lars
VISUAL COMPUTER, 2024, 40 (04): : 2391 - 2404
[47] Dissimilarity Representation Learning for Generalized Zero-Shot Recognition
Yang, Gang
Liu, Jinlu
Xu, Jieping
Li, Xirong
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2032 - 2039
[48] Ontology-guided Semantic Composition for Zero-Shot Learning
Chen, Jiaoyan
Lecue, Freddy
Geng, Yuxia
Pan, Jeff Z.
Chen, Huajun
KR2020: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2020, : 850 - 854
[49] Zero-Shot Visual Emotion Recognition by Exploiting BERT
Kang, Hyunwook
Hazarika, Devamanyu
Kim, Dongho
Kim, Jihie
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 485 - 494
[50] Zero-shot recognition with latent visual attributes learning
Xie, Yurui
He, Xiaohai
Zhang, Jing
Luo, Xiaodong
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335

← 1 2 3 4 5 →