Rates of convergence for Laplacian semi-supervised learning with low labeling rates

被引:7
|
作者
Calder, Jeff [1 ]
Slepcev, Dejan [2 ]
Thorpe, Matthew [3 ,4 ]
机构
[1] Univ Minnesota, Sch Math, Minneapolis, MN 55455 USA
[2] Carnegie Mellon Univ, Dept Math Sci, Pittsburgh, PA USA
[3] Univ Manchester, Dept Math, Manchester, England
[4] Alan Turing Inst, London NW1 2DB, England
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
Semi-supervised learning; Regression; Asymptotic consistency; Gamma-convergence; PDEs on graphs; Non-local variational problems; Random walks on graphs; CONSISTENCY; GRAPH; REGULARIZATION;
D O I
10.1007/s40687-022-00371-x
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We investigate graph-based Laplacian semi-supervised learning at low labeling rates (ratios of labeled to total number of data points) and establish a threshold for the learning to be well posed. Laplacian learning uses harmonic extension on a graph to propagate labels. It is known that when the number of labeled data points is finite while the number of unlabeled data points tends to infinity, the Laplacian learning becomes degenerate and the solutions become roughly constant with a spike at each labeled data point. In this work, we allow the number of labeled data points to grow to infinity as the total number of data points grows. We show that for a random geometric graph with length scale epsilon > 0, if the labeling rate / << epsilon(2), then the solution becomes degenerate and spikes form. On the other hand, if beta >> epsilon(2), then Laplacian learning is well-posed and consistent with a continuum Laplace equation. Furthermore, in the well-posed setting we prove quantitative error estimates of O( epsilon beta-(1/2)) for the difference between the solutions of the discrete problem and continuum PDE, up to logarithmic factors. We also study p-Laplacian regularization and show the same degeneracy result when / << epsilon( p). The proofs of our well-posedness results use the random walk interpretation of Laplacian learning and PDE arguments, while the proofs of the ill-posedness results use gamma -convergence tools from the calculus of variations. We also present numerical results on synthetic and real data to illustrate our results.
引用
收藏
页数:42
相关论文
共 50 条
  • [31] Asymptotic Bayes risk of semi-supervised learning with uncertain labeling
    Leger, Victor
    Couillet, Romain
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 1831 - 1835
  • [32] Towards Semi-Supervised Learning for Deep Semantic Role Labeling
    Mehta, Sanket Vaibhav
    Lee, Jay Yoon
    Carbonell, Jaime
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4958 - 4963
  • [33] Unsupervised Selective Labeling for More Effective Semi-supervised Learning
    Wang, Xudong
    Lian, Long
    Yu, Stella X.
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 427 - 445
  • [34] Semi-supervised Learning Using a Constrained Labeling LDA Model
    Guzman, Rel
    Ochoa-Luna, Eduardo
    PROCEEDINGS OF THE 2016 IEEE ANDESCON, 2016,
  • [35] Semi-supervised learning using constrained laplacian regularized least squares
    Sousa, Celso A. R.
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [36] Laplacian pair-weight vector projection for semi-supervised learning
    Xue, Yangtao
    Zhang, Li
    INFORMATION SCIENCES, 2021, 573 : 1 - 19
  • [37] A Lie group Laplacian Support Vector Machine for semi-supervised learning
    Zhang, Yue
    Liu, Li
    Qiao, Qian
    Li, Fanzhang
    NEUROCOMPUTING, 2025, 630
  • [38] Time-Series Laplacian Semi-Supervised Learning for Indoor Localization
    Yoo, Jaehyun
    SENSORS, 2019, 19 (18)
  • [39] Semi-supervised Learning
    Adams, Niall
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530
  • [40] Semi-supervised Mesh Segmentation and Labeling
    Lv, Jiajun
    Chen, Xinlei
    Huang, Jin
    Bao, Hujun
    COMPUTER GRAPHICS FORUM, 2012, 31 (07) : 2241 - 2248