Graph embedding based multi-label Zero-shot Learning

被引:6
|
作者
Zhang, Haigang [1 ]
Meng, Xianglong [1 ]
Cao, Weipeng [2 ]
Liu, Ye [2 ]
Ming, Zhong [2 ]
Yang, Jinfeng [1 ]
机构
[1] Shenzhen Polytech, Inst Appl Artificial Intelligence, Guangdong Hong Kong Macao Greater Bay Area, Shenzhen 518055, Peoples R China
[2] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot Learning; Knowledge graph; Multi-label classification; Feature embedding; NETWORKS;
D O I
10.1016/j.neunet.2023.08.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label Zero-shot Learning (ZSL) is more reasonable and realistic than standard single-label ZSL because several objects can co-exist in a natural image in real scenarios. Intra-class feature entanglement is a significant factor influencing the alignment of visual and semantic features, resulting in the model's inability to recognize unseen samples comprehensively and completely. We observe that existing multi-label ZSL methods place a greater emphasis on attention-based refinement and decoupling of visual features, while ignoring the relationship between label semantics. Relying on label correlations to solve multi-label ZSL tasks has not been deeply studied. In this paper, we make full use of the co-occurrence relationship between category labels and build a directed weighted semantic graph based on statistics and prior knowledge, in which node features represent category semantics and weighted edges represent conditional probabilities of label co-occurrence. To guide the targeted extraction of visual features, node features and edge set weights are simultaneously updated and refined, and embedded into the visual feature extraction network from a global and local perspective. The proposed method's effectiveness was demonstrated by simulation results on two challenging multi-label ZSL benchmarks: NUS-WIDE and Open Images. In comparison to stateof-the-art models, our model achieves an absolute gain of 2.4% mAP on NUS-WIDE and 2.1% mAP on Open Images respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [21] Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label Graphs
    Lu, Jueqing
    Du, Lan
    Liu, Ming
    Dipnall, Joanna
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2935 - 2943
  • [22] Multi-label zero-shot human action recognition via joint latent ranking embedding
    Wang, Qian
    Chen, Ke
    NEURAL NETWORKS, 2020, 122 (122) : 1 - 23
  • [23] MULTI-LABEL ZERO-SHOT AUDIO CLASSIFICATION WITH TEMPORAL ATTENTION
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 250 - 254
  • [24] Reformulating Zero-shot Action Recognition for Multi-label Actions
    Kerrigan, Alec
    Duarte, Kevin
    Rawat, Yogesh S.
    Shah, Mubarak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] Deep Ranking for Image Zero-Shot Multi-Label Classification
    Ji, Zhong
    Cui, Biying
    Li, Huihui
    Jiang, Yu-Gang
    Xiang, Tao
    Hospedales, Timothy
    Fu, Yanwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 6549 - 6560
  • [26] MULTI-LABEL AUDIO CLASSIFICATION WITH A NOISY ZERO-SHOT TEACHER
    Braun, Sebastian
    Gamper, Hannes
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 240 - 244
  • [27] Pairnorm based Graphical Convolution Network for zero-shot multi-label classification
    Chauhan, Vikas
    Tiwari, Aruna
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [28] Efficient max-margin multi-label classification with applications to zero-shot learning
    Bharath Hariharan
    S. V. N. Vishwanathan
    Manik Varma
    Machine Learning, 2012, 88 : 127 - 155
  • [29] Leveraging Pre-Trained Extreme Multi-Label Classifiers for Zero-Shot Learning
    Ostapuk, Natalia
    Dolamic, Ljiljana
    Mermoud, Alain
    Cudre-Mauroux, Philippe
    2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 233 - 236
  • [30] Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification
    Zhang, Yu
    Shen, Zhihong
    Wu, Chieh-Han
    Xie, Boya
    Hao, Junheng
    Wang, Ye-Yi
    Wang, Kuansan
    Han, Jiawei
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3162 - 3173