Concept-Aware Graph Convolutional Network for Compositional Zero-Shot Learning

被引:0
|
作者
Liu, Yang [1 ,2 ]
Wang, Xinshuo [1 ]
Gao, Xinbo [3 ]
Han, Jungong [4 ]
Shao, Ling [5 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[4] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, Yorkshire, England
[5] Univ Chinese Acad Sci, UCAS Terminus AI Lab, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Transformers; Feature extraction; Dogs; Zero shot learning; Object recognition; Computational modeling; Training; Telecommunications; Compositional zero-shot learning (CZSL); concept-aware; cross-attentions; Earth mover's distance (EMD); graph convolutional network (GCN);
D O I
10.1109/TNNLS.2025.3528885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compositional zero-shot learning (CZSL) aims to identify unobservable compositional concepts with prior knowledge of known primitives (attributes and objects). Due to distribution differences between seen and unseen components, existing methods for CZSL often ignore intrinsic variations between primitives and suffer from domain bias problems. To address this challenge, we proposed a concept-aware graph convolutional network (GCN) that utilizes cross-attentions to extract features unique to attributes and objects from paired concept-sharing inputs. The proposed model utilizes the cosine similarity between visual features and synthetic embeddings to estimate the feasibility score for each unseen composition. This score is then employed as a weight in the graph adjacency matrix. Additionally, the proposed model incorporates the Earth mover's distance (EMD) to further limit the concept of learning interest in disentanglers. Experimental results on three challenging dataset benchmarks, including UT-Zappos 50K, C-GQA, and MIT-States, demonstrate that the proposed model outperforms prior work in both closed-and open-world CZSL (OW-CZSL).
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Zero-Shot Compositional Concept Learning
    Xu, Guangyue
    Kordjamshidi, Parisa
    Chai, Joyce Y.
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 19 - 27
  • [2] Learning Graph Embeddings for Compositional Zero-shot Learning
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Tombari, Federico
    Akata, Zeynep
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962
  • [3] Dynamic concept-aware network for few-shot learning
    Zhou, Jun
    Lv, Qiujie
    Chen, Calvin Yu-Chian
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [4] Learning Graph Embeddings for Open World Compositional Zero-Shot Learning
    Mancini, Massimiliano
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Akata, Zeynep
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1545 - 1560
  • [5] Zero-Shot Pill-Prescription Matching With Graph Convolutional Network and Contrastive Learning
    Nguyen, Trung Thanh
    Nguyen, Phi Le
    Kawanishi, Yasutomo
    Komamizu, Takahiro
    Ide, Ichiro
    IEEE ACCESS, 2024, 12 : 55889 - 55904
  • [6] Explainable zero-shot learning via attentive graph convolutional network and knowledge graphs
    Geng, Yuxia
    Chen, Jiaoyan
    Ye, Zhiquan
    Yuan, Zonggang
    Zhang, Wei
    Chen, Huajun
    SEMANTIC WEB, 2021, 12 (05) : 741 - 765
  • [7] Attribute Propagation Network for Graph Zero-Shot Learning
    Liu, Lu
    Zhou, Tianyi
    Long, Guodong
    Jiang, Jing
    Zhang, Chengqi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4868 - 4875
  • [8] Knowledge Guided Transformer Network for Compositional Zero-Shot Learning
    Panda, Aditya
    Prasad, Dipti
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
  • [9] Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
    Li, Xiangyu
    Yang, Xu
    Wei, Kun
    Deng, Cheng
    Yang, Muli
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9316 - 9325
  • [10] Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8352 - 8359