Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K

被引:1
|
作者
Chen, Xingyu [1 ]
Liu, Jiaxu [1 ]
Liu, Zeyang [1 ]
Wan, Lipeng [1 ]
Lan, Xuguang [1 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, Xian 710049, Shaanxi, Peoples R China
关键词
Semantics; Knowledge graphs; Zero-shot learning; Circuits and systems; Visualization; Training; Task analysis; Fine-grained zero-shot learning; knowledge graph; graph convolutional neural network; DATABASE;
D O I
10.1109/TCSVT.2024.3396215
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fine-grained Zero-shot Learning on the large-scale dataset ImageNet21K is an important task that has promising perspectives in many real-world scenarios. One typical solution is to explicitly model the knowledge passing using a Knowledge Graph (KG) to transfer knowledge from seen to unseen instances. By analyzing the hierarchical structure and the word descriptions on ImageNet21K, we find that the noisy semantic information, the sparseness of seen classes, and the lack of supervision of unseen classes make the knowledge passing insufficient, which limits the KG-based fine-grained ZSL. To resolve this problem, in this paper, we enhance the knowledge passing from three aspects. First, we use more powerful models such as the Large Language Model and Vision-Language Model to get more reliable semantic embeddings. Then we propose a strategy that globally enhances the knowledge graph based on the convex combination relationship of the semantic embeddings. It effectively connects the edges between the non-kinship seen and unseen classes that have strong correlations while assigning an importance score to each edge. Based on the enhanced knowledge graph, we further present a novel regularizer that locally enhances the knowledge passing during training. We extensively conducted comparative evaluations to demonstrate the advantages of our method over state-of-the-art approaches.
引用
收藏
页码:9090 / 9101
页数:12
相关论文
共 50 条
  • [31] UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
    Sun, Rui
    Wang, Zhecan
    You, Haoxuan
    Codella, Noel
    Chang, Kai-Wei
    Chang, Shih-Fu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 778 - 793
  • [32] Content-Aware Rectified Activation for Zero-Shot Fine-Grained Image Retrieval
    Wang, Shijie
    Chang, Jianlong
    Wang, Zhihui
    Li, Haojie
    Ouyang, Wanli
    Tian, Qi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4366 - 4380
  • [33] Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics (vol 16, pg 563, 2019)
    Li, Ao-Xue
    Zhang, Ke-Xin
    Wang, Li-Wei
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (06) : 1045 - 1045
  • [34] A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning
    Song, Lingyun
    Shang, Xuequn
    Zhou, Ruizhi
    Liu, Jun
    Ma, Jie
    Li, Zhanhuai
    Sun, Mingxuan
    NEURAL NETWORKS, 2024, 179
  • [35] Semantic-visual shared knowledge graph for zero-shot learning
    Yu, Beibei
    Xie, Cheng
    Tang, Peng
    Li, Bin
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [36] Semantic-visual shared knowledge graph for zero-shot learning
    Yu B.
    Xie C.
    Tang P.
    Li B.
    PeerJ Computer Science, 2023, 9
  • [37] A Zero-shot Learning Method with a Multi-modal Knowledge Graph
    Zhang, Yuhong
    Shu, Haitao
    Bu, Chenyang
    Hu, Xuegang
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 391 - 395
  • [38] CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Chowdhury, Pinaki Nath
    Koley, Subhadeep
    Xiang, Tao
    Song, Yi-Zhe
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2765 - 2775
  • [39] RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction
    Zhao, Jun
    Zhan, Wenyu
    Zhao, Xin
    Zhang, Qi
    Gui, Tao
    Wei, Zhongyu
    Wang, Junzhe
    Peng, Minlong
    Sun, Mingming
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6680 - 6691
  • [40] Learning Graph Embeddings for Compositional Zero-shot Learning
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Tombari, Federico
    Akata, Zeynep
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962