Concept-Aware Graph Convolutional Network for Compositional Zero-Shot Learning

被引:0
|
作者
Liu, Yang [1 ,2 ]
Wang, Xinshuo [1 ]
Gao, Xinbo [3 ]
Han, Jungong [4 ]
Shao, Ling [5 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[4] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, Yorkshire, England
[5] Univ Chinese Acad Sci, UCAS Terminus AI Lab, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Transformers; Feature extraction; Dogs; Zero shot learning; Object recognition; Computational modeling; Training; Telecommunications; Compositional zero-shot learning (CZSL); concept-aware; cross-attentions; Earth mover's distance (EMD); graph convolutional network (GCN);
D O I
10.1109/TNNLS.2025.3528885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compositional zero-shot learning (CZSL) aims to identify unobservable compositional concepts with prior knowledge of known primitives (attributes and objects). Due to distribution differences between seen and unseen components, existing methods for CZSL often ignore intrinsic variations between primitives and suffer from domain bias problems. To address this challenge, we proposed a concept-aware graph convolutional network (GCN) that utilizes cross-attentions to extract features unique to attributes and objects from paired concept-sharing inputs. The proposed model utilizes the cosine similarity between visual features and synthetic embeddings to estimate the feasibility score for each unseen composition. This score is then employed as a weight in the graph adjacency matrix. Additionally, the proposed model incorporates the Earth mover's distance (EMD) to further limit the concept of learning interest in disentanglers. Experimental results on three challenging dataset benchmarks, including UT-Zappos 50K, C-GQA, and MIT-States, demonstrate that the proposed model outperforms prior work in both closed-and open-world CZSL (OW-CZSL).
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Convolutional prototype learning for zero-shot recognition
    Liu, Zhizhe
    Zhang, Xingxing
    Zhu, Zhenfeng
    Zheng, Shuai
    Zhao, Yao
    Cheng, Jian
    IMAGE AND VISION COMPUTING, 2020, 98
  • [22] GNDAN: Graph Navigated Dual Attention Network for Zero-Shot Learning
    Chen, Shiming
    Hong, Ziming
    Xie, Guosen
    Peng, Qinmu
    You, Xinge
    Ding, Weiping
    Shao, Ling
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4516 - 4529
  • [23] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [24] Hierarchical Prompt Learning for Compositional Zero-Shot Recognition
    Wang, Henan
    Yang, Muli
    Wei, Kun
    Deng, Cheng
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1470 - 1478
  • [25] Reference-Limited Compositional Zero-Shot Learning
    Huang, Siteng
    Wei, Qiyao
    Wang, Donglin
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 443 - 451
  • [26] Environment Generation for Zero-Shot Compositional Reinforcement Learning
    Gur, Izzeddin
    Jaques, Natasha
    Miao, Yingjie
    Choi, Jongwook
    Tiwari, Manoj
    Lee, Honglak
    Faust, Aleksandra
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Adaptive Fusion Learning for Compositional Zero-Shot Recognition
    Min, Lingtong
    Fan, Ziman
    Wang, Shunzhou
    Dou, Feiyang
    Li, Xin
    Wang, Binglu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1193 - 1204
  • [28] A Decomposable Causal View of Compositional Zero-Shot Learning
    Yang, Muli
    Xu, Chenghao
    Wu, Aming
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5892 - 5902
  • [29] An Entropy-Guided Reinforced Partial Convolutional Network for Zero-Shot Learning
    Li, Yun
    Liu, Zhe
    Yao, Lina
    Wang, Xianzhi
    McAuley, Julian
    Chang, Xiaojun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5175 - 5186
  • [30] Relation-Aware Compositional Zero-Shot Learning for Attribute-Object Pair Recognition
    Xu, Ziwei
    Wang, Guangzhi
    Wong, Yongkang
    Kankanhalli, Mohan S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3652 - 3664