Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K

被引：1

作者：

Chen, Xingyu ^{[1
]}

Liu, Jiaxu ^{[1
]}

Liu, Zeyang ^{[1
]}

Wan, Lipeng ^{[1
]}

Lan, Xuguang ^{[1
]}

Zheng, Nanning ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Shaanxi, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

关键词：

Semantics; Knowledge graphs; Zero-shot learning; Circuits and systems; Visualization; Training; Task analysis; Fine-grained zero-shot learning; knowledge graph; graph convolutional neural network; DATABASE;

D O I：

10.1109/TCSVT.2024.3396215

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Fine-grained Zero-shot Learning on the large-scale dataset ImageNet21K is an important task that has promising perspectives in many real-world scenarios. One typical solution is to explicitly model the knowledge passing using a Knowledge Graph (KG) to transfer knowledge from seen to unseen instances. By analyzing the hierarchical structure and the word descriptions on ImageNet21K, we find that the noisy semantic information, the sparseness of seen classes, and the lack of supervision of unseen classes make the knowledge passing insufficient, which limits the KG-based fine-grained ZSL. To resolve this problem, in this paper, we enhance the knowledge passing from three aspects. First, we use more powerful models such as the Large Language Model and Vision-Language Model to get more reliable semantic embeddings. Then we propose a strategy that globally enhances the knowledge graph based on the convex combination relationship of the semantic embeddings. It effectively connects the edges between the non-kinship seen and unseen classes that have strong correlations while assigning an importance score to each edge. Based on the enhanced knowledge graph, we further present a novel regularizer that locally enhances the knowledge passing during training. We extensively conducted comparative evaluations to demonstrate the advantages of our method over state-of-the-art approaches.

引用

页码：9090 / 9101

页数：12

共 50 条

[31] UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Sun, Rui
Wang, Zhecan
You, Haoxuan
Codella, Noel
Chang, Kai-Wei
Chang, Shih-Fu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 778 - 793
[32] Content-Aware Rectified Activation for Zero-Shot Fine-Grained Image Retrieval
Wang, Shijie
Chang, Jianlong
Wang, Zhihui
Li, Haojie
Ouyang, Wanli
Tian, Qi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4366 - 4380
[33] Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics (vol 16, pg 563, 2019)
Li, Ao-Xue
Zhang, Ke-Xin
Wang, Li-Wei
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (06) : 1045 - 1045
[34] A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning
Song, Lingyun
Shang, Xuequn
Zhou, Ruizhi
Liu, Jun
Ma, Jie
Li, Zhanhuai
Sun, Mingxuan
NEURAL NETWORKS, 2024, 179
[35] Semantic-visual shared knowledge graph for zero-shot learning
Yu, Beibei
Xie, Cheng
Tang, Peng
Li, Bin
PEERJ COMPUTER SCIENCE, 2023, 9
[36] Semantic-visual shared knowledge graph for zero-shot learning
Yu B.
Xie C.
Tang P.
Li B.
PeerJ Computer Science, 2023, 9
[37] A Zero-shot Learning Method with a Multi-modal Knowledge Graph
Zhang, Yuhong
Shu, Haitao
Bu, Chenyang
Hu, Xuegang
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 391 - 395
[38] CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not
Sain, Aneeshan
Bhunia, Ayan Kumar
Chowdhury, Pinaki Nath
Koley, Subhadeep
Xiang, Tao
Song, Yi-Zhe
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2765 - 2775
[39] RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction
Zhao, Jun
Zhan, Wenyu
Zhao, Xin
Zhang, Qi
Gui, Tao
Wei, Zhongyu
Wang, Junzhe
Peng, Minlong
Sun, Mingming
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6680 - 6691
[40] Learning Graph Embeddings for Compositional Zero-shot Learning
Naeem, Muhammad Ferjad
Xian, Yongqin
Tombari, Federico
Akata, Zeynep
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962

← 1 2 3 4 5 →