Large deviations in the perceptron model and consequences for active learning

被引:0
|
作者
Cui H. [1 ]
Saglietti L. [1 ]
Zdeborov'A L. [1 ]
机构
[1] Institute of Physics, École Polytechnique Fédérale de Lausanne, Lausanne
来源
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Active learning; Large deviations; Perceptron model;
D O I
10.1088/2632-2153/abfbbb
中图分类号
学科分类号
摘要
Active learning (AL) is a branch of machine learning that deals with problems where unlabeled data is abundant yet obtaining labels is expensive. The learning algorithm has the possibility of querying a limited number of samples to obtain the corresponding labels, subsequently used for supervised learning. In this work, we consider the task of choosing the subset of samples to be labeled from a fixed finite pool of samples. We assume the pool of samples to be a random matrix and the ground truth labels to be generated by a single-layer teacher random neural network. We employ replica methods to analyze the large deviations for the accuracy achieved after supervised learning on a subset of the original pool. These large deviations then provide optimal achievable performance boundaries for any AL algorithm. We show that the optimal learning performance can be efficiently approached by simple message-passing AL algorithms. We also provide a comparison with the performance of some other popular active learning strategies. © 2021 The Author(s).
引用
收藏
相关论文
共 50 条
  • [21] Large deviations for the stochastic shell model of turbulence
    Manna, U.
    Sritharan, S. S.
    Sundar, P.
    NODEA-NONLINEAR DIFFERENTIAL EQUATIONS AND APPLICATIONS, 2009, 16 (04): : 493 - 521
  • [22] Large deviations for the stochastic shell model of turbulence
    U. Manna
    S. S. Sritharan
    P. Sundar
    Nonlinear Differential Equations and Applications NoDEA, 2009, 16 : 493 - 521
  • [23] Large deviations for the Ginzburg–Landau ∇φ interface model
    T. Funaki
    T. Nishikawa
    Probability Theory and Related Fields, 2001, 120 : 535 - 568
  • [24] Current Large Deviations in a Driven Dissipative Model
    Bodineau, T.
    Lagouge, M.
    JOURNAL OF STATISTICAL PHYSICS, 2010, 139 (02) : 201 - 218
  • [25] Static large deviations for a reaction–diffusion model
    J. Farfán
    C. Landim
    K. Tsunoda
    Probability Theory and Related Fields, 2019, 174 : 49 - 101
  • [26] Large deviations of the ballistic Levy walk model
    Wang, Wanli
    Holl, Marc
    Barkai, Eli
    PHYSICAL REVIEW E, 2020, 102 (05)
  • [27] AN OPTIMAL PARALLEL PERCEPTRON LEARNING ALGORITHM FOR A LARGE TRAINING SET
    HONG, TP
    TSENG, SS
    PARALLEL COMPUTING, 1994, 20 (03) : 347 - 352
  • [28] LARGE DEVIATIONS FOR A REACTION-DIFFUSION MODEL
    JONALASINIO, G
    LANDIM, C
    VARES, ME
    PROBABILITY THEORY AND RELATED FIELDS, 1993, 97 (03) : 339 - 361
  • [29] Spherical Model on a Cayley Tree: Large Deviations
    Patrick, A. E.
    JOURNAL OF STATISTICAL PHYSICS, 2017, 166 (01) : 45 - 71
  • [30] Yielding and large deviations in micellar gels: a model
    Nandi, Saroj Kumar
    Chakraborty, Bulbul
    Sood, A. K.
    Ramaswamy, Sriram
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2013,