Graph embedding based multi-label Zero-shot Learning

被引:6
|
作者
Zhang, Haigang [1 ]
Meng, Xianglong [1 ]
Cao, Weipeng [2 ]
Liu, Ye [2 ]
Ming, Zhong [2 ]
Yang, Jinfeng [1 ]
机构
[1] Shenzhen Polytech, Inst Appl Artificial Intelligence, Guangdong Hong Kong Macao Greater Bay Area, Shenzhen 518055, Peoples R China
[2] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot Learning; Knowledge graph; Multi-label classification; Feature embedding; NETWORKS;
D O I
10.1016/j.neunet.2023.08.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label Zero-shot Learning (ZSL) is more reasonable and realistic than standard single-label ZSL because several objects can co-exist in a natural image in real scenarios. Intra-class feature entanglement is a significant factor influencing the alignment of visual and semantic features, resulting in the model's inability to recognize unseen samples comprehensively and completely. We observe that existing multi-label ZSL methods place a greater emphasis on attention-based refinement and decoupling of visual features, while ignoring the relationship between label semantics. Relying on label correlations to solve multi-label ZSL tasks has not been deeply studied. In this paper, we make full use of the co-occurrence relationship between category labels and build a directed weighted semantic graph based on statistics and prior knowledge, in which node features represent category semantics and weighted edges represent conditional probabilities of label co-occurrence. To guide the targeted extraction of visual features, node features and edge set weights are simultaneously updated and refined, and embedded into the visual feature extraction network from a global and local perspective. The proposed method's effectiveness was demonstrated by simulation results on two challenging multi-label ZSL benchmarks: NUS-WIDE and Open Images. In comparison to stateof-the-art models, our model achieves an absolute gain of 2.4% mAP on NUS-WIDE and 2.1% mAP on Open Images respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [41] Expanding Semantic Knowledge for Zero-Shot Graph Embedding
    Wang, Zheng
    Shao, Ruihang
    Wang, Changping
    Hu, Changjun
    Wang, Chaokun
    Gong, Zhiguo
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 394 - 402
  • [42] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [43] Transductive Unbiased Embedding for Zero-Shot Learning
    Song, Jie
    Shen, Chengchao
    Yang, Yezhou
    Liu, Yang
    Song, Mingli
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1024 - 1033
  • [44] Generalised Zero-shot Learning with Multi-modal Embedding Spaces
    Felix, Rafael
    Sasdelli, Michele
    Harwood, Ben
    Carneiro, Gustavo
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [45] Disentangled Ontology Embedding for Zero-shot Learning
    Geng, Yuxia
    Chen, Jiaoyan
    Zhang, Wen
    Xu, Yajing
    Chen, Zhuo
    Pan, Jeff Z.
    Huang, Yufeng
    Xiong, Feiyu
    Chen, Huajun
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 443 - 453
  • [46] Learning a Deep Embedding Model for Zero-Shot Learning
    Zhang, Li
    Xiang, Tao
    Gong, Shaogang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
  • [47] Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations
    Huynh, Dat
    Elhamifar, Ehsan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8452 - 8463
  • [48] Graph and Autoencoder Based Feature Extraction for Zero-shot Learning
    Liu, Yang
    Xie, Deyan
    Gao, Quanxue
    Han, Jungong
    Wang, Shujian
    Gao, Xinbo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3038 - 3044
  • [49] A Zero-shot Learning Method with a Multi-modal Knowledge Graph
    Zhang, Yuhong
    Shu, Haitao
    Bu, Chenyang
    Hu, Xuegang
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 391 - 395
  • [50] Inductive Zero-Shot Image Annotation via Embedding Graph
    Wang, Fangxin
    Liu, Jie
    Zhang, Shuwu
    Zhang, Guixuan
    Li, Yuejun
    Yuan, Fei
    IEEE ACCESS, 2019, 7 : 107816 - 107830