Learning Graph Embeddings for Open World Compositional Zero-Shot Learning

被引：22

作者：

Mancini, Massimiliano ^{[1
]}

Naeem, Muhammad Ferjad ^{[2
]}

Xian, Yongqin ^{[2
,3
]}

Akata, Zeynep ^{[4
,5
]}

机构：

[1] Univ Tubingen, D-72076 Tubingen, Germany

[2] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[3] Max Planck Inst MPI Informat, Saarbrucken, Germany

[4] MPI Intelligent Syst, MPI Informat, D-72076 Tubingen, Germany

[5] Univ Tubingen, D-72076 Tubingen, Germany

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 03期

基金：

欧洲研究理事会;

关键词：

Visualization; Training; Standards; Task analysis; Dogs; Convolutional neural networks; Smoothing methods; Compositional zero-shot learning; graph neural networks; open-world recognition; scene understanding; CLASSIFICATION; NETWORKS;

D O I：

10.1109/TPAMI.2022.3163667

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Compositional Zero-Shot learning (CZSL) aims to recognize unseen compositions of state and object visual primitives seen during training. A problem with standard CZSL is the assumption of knowing which unseen compositions will be available at test time. In this work, we overcome this assumption operating on the open world setting, where no limit is imposed on the compositional space at test time, and the search space contains a large number of unseen compositions. To address this problem, we propose a new approach, Compositional Cosine Graph Embeddings (Co-CGE), based on two principles. First, Co-CGE models the dependency between states, objects and their compositions through a graph convolutional neural network. The graph propagates information from seen to unseen concepts, improving their representations. Second, since not all unseen compositions are equally feasible, and less feasible ones may damage the learned representations, Co-CGE estimates a feasibility score for each unseen composition, using the scores as margins in a cosine similarity-based loss and as weights in the adjacency matrix of the graphs. Experiments show that our approach achieves state-of-the-art performances in standard CZSL while outperforming previous methods in the open world scenario.

引用

页码：1545 / 1560

页数：16

共 50 条

[1] On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning
Anwaar, Muhammad Umer
Pan, Zhihui
Kleinsteuber, Martin
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4645 - 4654
[2] Learning Graph Embeddings for Compositional Zero-shot Learning
Naeem, Muhammad Ferjad
Xian, Yongqin
Tombari, Federico
Akata, Zeynep
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962
[3] Open World Compositional Zero-Shot Learning
Mancini, Massimiliano
Naeem, Muhammad Ferjad
Xian, Yongqin
Akata, Zeynep
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5218 - 5226
[4] Zero-Shot Compositional Concept Learning
Xu, Guangyue
Kordjamshidi, Parisa
Chai, Joyce Y.
1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 19 - 27
[5] Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning
Li, Yun
Liu, Zhe
Jha, Saurav
Yao, Lina
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1782 - 1791
[6] Learning the Compositional Domains for Generalized Zero-shot Learning
Dong, Hanze
Fu, Yanwei
Hwang, Sung Ju
Sigal, Leonid
Xue, Xiangyang
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
[7] Learning Attention Propagation for Compositional Zero-Shot Learning
Khan, Muhammad Gul Zain Ali
Naeem, Muhammad Ferjad
Van Gool, Luc
Pagani, A.
Stricker, Didier
Afzal, Muhammad Zeshan
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3817 - 3826
[8] Learning Attention as Disentangler for Compositional Zero-shot Learning
Hao, Shaozhe
Han, Kai
Wong, Kwan-Yee K.
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15315 - 15324
[9] Learning Conditional Attributes for Compositional Zero-Shot Learning
Wang, Qingsheng
Liu, Lingqiao
Jing, Chenchen
Chen, Hao
Liang, Guoqiang
Wang, Peng
Shen, Chunhua
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11197 - 11206
[10] Learning adversarial semantic embeddings for zero-shot recognition in open worlds
Li, Tianqi
Pang, Guansong
Bai, Xiao
Zheng, Jin
Zhou, Lei
Ning, Xin
PATTERN RECOGNITION, 2024, 149

← 1 2 3 4 5 →