IGCN: A Provably Informative GCN Embedding for Semi-Supervised Learning With Extremely Limited Labels

被引:0
|
作者
Zhang, Lin [1 ]
Song, Ran [1 ]
Tan, Wenhao [1 ]
Ma, Lin [2 ]
Zhang, Wei [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Meituan, Beijing 100102, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Semisupervised learning; Symmetric matrices; Convolution; Training; Laplace equations; Task analysis; Graph algorithms; graph neural networks; information theory; limited labels;
D O I
10.1109/TPAMI.2024.3404655
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Networks (GNNs) have gained much more attention in the representation learning for the graph-structured data. However, the labels are always limited in the graph, which easily leads to the overfitting problem and causes the poor performance. To solve this problem, we propose a new framework called IGCN, short for Informative Graph Convolutional Network, where the objective of IGCN is designed to obtain the informative embeddings via discarding the task-irrelevant information of the graph data based on the mutual information. As the mutual information for irregular data is intractable to compute, our framework is optimized via a surrogate objective, where two terms are derived to approximate the original objective. For the former term, it demonstrates that the mutual information between the learned embeddings and the ground truth should be high, where we utilize the semi-supervised classification loss and the prototype based supervised contrastive learning loss for optimizing it. For the latter term, it requires that the mutual information between the learned node embeddings and the initial embeddings should be high and we propose to minimize the reconstruction loss between them to achieve the goal of maximizing the latter term from the feature level and the layer level, which contains the graph encoder-decoder module and a novel architecture GCN(Info). Moreover, we provably show that the designed GCN(Info) can better alleviate the information loss and preserve as much useful information of the initial embeddings as possible. Experimental results show that the IGCN outperforms the state-of-the-art methods on 7 popular datasets.
引用
收藏
页码:8396 / 8409
页数:14
相关论文
共 50 条
  • [41] Towards Semi-supervised Learning with Non-random Missing Labels
    Duan, Yue
    Zhao, Zhen
    Qi, Lei
    Zhou, Luping
    Wang, Lei
    Shi, Yinghuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16075 - 16085
  • [42] Semi-supervised learning with pseudo-negative labels for image classification
    Xu, Hao
    Xiao, Hui
    Hao, Huazheng
    Dong, Li
    Qiu, Xiaojie
    Peng, Chengbin
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [43] Training query filtering for semi-supervised learning to rank with pseudo labels
    Xin Zhang
    Ben He
    Tiejian Luo
    World Wide Web, 2016, 19 : 833 - 864
  • [44] Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels
    Hailat, Zeyad
    Chen, Xue-Wen
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 907 - 912
  • [45] FocalMatch: Mitigating Class Imbalance of Pseudo Labels in Semi-Supervised Learning
    Deng, Yongkun
    Zhang, Chenghao
    Yang, Nan
    Chen, Huaming
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [46] The helpfulness of category labels in semi-supervised learning depends on category structure
    Wai Keen Vong
    Daniel J. Navarro
    Andrew Perfors
    Psychonomic Bulletin & Review, 2016, 23 : 230 - 238
  • [47] CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy Labels
    Higashimoto, Ryota
    Yoshida, Soh
    Muneyasu, Mitsuji
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [48] Instance-Proxy Loss for Semi-supervised Learning with Coarse Labels
    Wu, Hongyan
    Miao, Qinghai
    Guo, Haiyun
    Huang, Min
    Wang, Jinqiao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 238 - 250
  • [49] Nuclear-norm based semi-supervised multiple labels learning
    Liu, Yang
    Nie, Feiping
    Gao, QuanXue
    NEUROCOMPUTING, 2018, 275 : 940 - 947
  • [50] Data Based Construction of Kernels for Semi-Supervised Learning With Less Labels
    Mhaskar, Hrushikesh
    Pereverzyev, Sergei V.
    Semenov, Vasyl Yu.
    Semenova, Evgeniya V.
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 5