Learning Graph Embeddings for Open World Compositional Zero-Shot Learning

被引:22
|
作者
Mancini, Massimiliano [1 ]
Naeem, Muhammad Ferjad [2 ]
Xian, Yongqin [2 ,3 ]
Akata, Zeynep [4 ,5 ]
机构
[1] Univ Tubingen, D-72076 Tubingen, Germany
[2] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[3] Max Planck Inst MPI Informat, Saarbrucken, Germany
[4] MPI Intelligent Syst, MPI Informat, D-72076 Tubingen, Germany
[5] Univ Tubingen, D-72076 Tubingen, Germany
基金
欧洲研究理事会;
关键词
Visualization; Training; Standards; Task analysis; Dogs; Convolutional neural networks; Smoothing methods; Compositional zero-shot learning; graph neural networks; open-world recognition; scene understanding; CLASSIFICATION; NETWORKS;
D O I
10.1109/TPAMI.2022.3163667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compositional Zero-Shot learning (CZSL) aims to recognize unseen compositions of state and object visual primitives seen during training. A problem with standard CZSL is the assumption of knowing which unseen compositions will be available at test time. In this work, we overcome this assumption operating on the open world setting, where no limit is imposed on the compositional space at test time, and the search space contains a large number of unseen compositions. To address this problem, we propose a new approach, Compositional Cosine Graph Embeddings (Co-CGE), based on two principles. First, Co-CGE models the dependency between states, objects and their compositions through a graph convolutional neural network. The graph propagates information from seen to unseen concepts, improving their representations. Second, since not all unseen compositions are equally feasible, and less feasible ones may damage the learned representations, Co-CGE estimates a feasibility score for each unseen composition, using the scores as margins in a cosine similarity-based loss and as weights in the adjacency matrix of the graphs. Experiments show that our approach achieves state-of-the-art performances in standard CZSL while outperforming previous methods in the open world scenario.
引用
收藏
页码:1545 / 1560
页数:16
相关论文
共 50 条
  • [1] On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning
    Anwaar, Muhammad Umer
    Pan, Zhihui
    Kleinsteuber, Martin
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4645 - 4654
  • [2] Learning Graph Embeddings for Compositional Zero-shot Learning
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Tombari, Federico
    Akata, Zeynep
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962
  • [3] Open World Compositional Zero-Shot Learning
    Mancini, Massimiliano
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Akata, Zeynep
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5218 - 5226
  • [4] Zero-Shot Compositional Concept Learning
    Xu, Guangyue
    Kordjamshidi, Parisa
    Chai, Joyce Y.
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 19 - 27
  • [5] Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning
    Li, Yun
    Liu, Zhe
    Jha, Saurav
    Yao, Lina
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1782 - 1791
  • [6] Learning the Compositional Domains for Generalized Zero-shot Learning
    Dong, Hanze
    Fu, Yanwei
    Hwang, Sung Ju
    Sigal, Leonid
    Xue, Xiangyang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
  • [7] Learning Attention Propagation for Compositional Zero-Shot Learning
    Khan, Muhammad Gul Zain Ali
    Naeem, Muhammad Ferjad
    Van Gool, Luc
    Pagani, A.
    Stricker, Didier
    Afzal, Muhammad Zeshan
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3817 - 3826
  • [8] Learning Attention as Disentangler for Compositional Zero-shot Learning
    Hao, Shaozhe
    Han, Kai
    Wong, Kwan-Yee K.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15315 - 15324
  • [9] Learning Conditional Attributes for Compositional Zero-Shot Learning
    Wang, Qingsheng
    Liu, Lingqiao
    Jing, Chenchen
    Chen, Hao
    Liang, Guoqiang
    Wang, Peng
    Shen, Chunhua
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11197 - 11206
  • [10] Learning adversarial semantic embeddings for zero-shot recognition in open worlds
    Li, Tianqi
    Pang, Guansong
    Bai, Xiao
    Zheng, Jin
    Zhou, Lei
    Ning, Xin
    PATTERN RECOGNITION, 2024, 149