Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K

被引:1
|
作者
Chen, Xingyu [1 ]
Liu, Jiaxu [1 ]
Liu, Zeyang [1 ]
Wan, Lipeng [1 ]
Lan, Xuguang [1 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Shaanxi, Peoples R China
关键词
Semantics; Knowledge graphs; Zero-shot learning; Circuits and systems; Visualization; Training; Task analysis; Fine-grained zero-shot learning; knowledge graph; graph convolutional neural network; DATABASE;
D O I
10.1109/TCSVT.2024.3396215
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fine-grained Zero-shot Learning on the large-scale dataset ImageNet21K is an important task that has promising perspectives in many real-world scenarios. One typical solution is to explicitly model the knowledge passing using a Knowledge Graph (KG) to transfer knowledge from seen to unseen instances. By analyzing the hierarchical structure and the word descriptions on ImageNet21K, we find that the noisy semantic information, the sparseness of seen classes, and the lack of supervision of unseen classes make the knowledge passing insufficient, which limits the KG-based fine-grained ZSL. To resolve this problem, in this paper, we enhance the knowledge passing from three aspects. First, we use more powerful models such as the Large Language Model and Vision-Language Model to get more reliable semantic embeddings. Then we propose a strategy that globally enhances the knowledge graph based on the convex combination relationship of the semantic embeddings. It effectively connects the edges between the non-kinship seen and unseen classes that have strong correlations while assigning an importance score to each edge. Based on the enhanced knowledge graph, we further present a novel regularizer that locally enhances the knowledge passing during training. We extensively conducted comparative evaluations to demonstrate the advantages of our method over state-of-the-art approaches.
引用
收藏
页码:9090 / 9101
页数:12
相关论文
共 50 条
  • [21] Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification
    Hong, Mingyao
    Zhang, Xinfeng
    Li, Guorong
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1599 - 1612
  • [22] Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
    Chen Y.-H.
    Yeh M.-C.
    Multimedia Tools and Applications, 2025, 84 (10) : 7915 - 7930
  • [23] Rethinking Knowledge Graph Propagation for Zero-Shot Learning
    Kampffmeyer, Michael
    Chen, Yinbo
    Liang, Xiaodan
    Wang, Hao
    Zhang, Yujia
    Xing, Eric P.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11479 - 11488
  • [24] Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention
    Dat Huynh
    Elhamifar, Ehsan
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4482 - 4492
  • [25] Zero-shot fine-grained entity typing in information security based on ontology
    Zhang, Han
    Zhu, Jiaxian
    Chen, Jicheng
    Liu, Junxiu
    Ji, Lixia
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [26] Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
    Lin, Haoqiang
    Wen, Haokun
    Song, Xuemeng
    Liu, Meng
    Hu, Yupeng
    Nie, Liqiang
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 240 - 250
  • [27] Towards Fine-grained Open Zero-shot Learning: Inferring Unseen Visual Features from Attributes
    Long, Yang
    Liu, Li
    Shao, Ling
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 944 - 952
  • [28] Transductive semantic knowledge graph propagation for zero-shot learning
    Zhang, Hai-gang
    Que, Hao-yi
    Ren, Jin
    Wu, Zheng-guang
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (17): : 13108 - 13125
  • [29] Fine-grained relation contrast enhancement of knowledge graph for recommendation
    Zhang, Junsan
    Wang, Te
    Wu, Sini
    Ding, Fengmei
    Zhu, Jie
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, : 485 - 505
  • [30] An Empirical Study on Multiple Information Sources for Zero-Shot Fine-Grained Entity Typing
    Chen, Yi
    Jiang, Haiyun
    Liu, Lemao
    Shi, Shuming
    Fan, Chuang
    Yang, Min
    Xu, Ruifeng
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2668 - 2678