Statistical guarantees for sparse deep learning

被引:2
|
作者
Lederer, Johannes [1 ]
机构
[1] Ruhr Univ Bochum, Dept Math, Bochum, Germany
关键词
Sparsity; Regularization; Oracle inequalities; High-dimensionality; NEURAL-NETWORKS; INEQUALITIES; CLASSIFICATION; PERFORMANCE; SELECTION; ERROR;
D O I
10.1007/s10182-022-00467-3
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Neural networks are becoming increasingly popular in applications, but our math-ematical understanding of their potential and limitations is still limited. In this paper, we further this understanding by developing statistical guarantees for sparse deep learning. In contrast to previous work, we consider different types of sparsity, such as few active connections, few active nodes, and other norm-based types of sparsity. Moreover, our theories cover important aspects that previous theories have neglected, such as multiple outputs, regularization, and t'2-loss. The guarantees have a mild dependence on network widths and depths, which means that they support the application of sparse but wide and deep networks from a statistical perspective. Some of the concepts and tools that we use in our derivations are uncommon in deep learning and, hence, might be of additional interest.
引用
收藏
页码:231 / 258
页数:28
相关论文
共 50 条
  • [31] Deep learning optoacoustic tomography with sparse data
    Davoudi, Neda
    Dean-Ben, Xose Luis
    Razansky, Daniel
    NATURE MACHINE INTELLIGENCE, 2019, 1 (10) : 453 - 460
  • [32] Consistent Sparse Deep Learning: Theory and Computation
    Sun, Yan
    Song, Qifan
    Liang, Faming
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (540) : 1981 - 1995
  • [33] CONNECTIONS BETWEEN SPARSE ESTIMATION AND ROBUST STATISTICAL LEARNING
    Tsakonas, Efthymios
    Jalden, Joakim
    Sidiropoulos, Nicholas D.
    Ottersten, Bjorn
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 5489 - 5493
  • [34] Task-Driven Detection of Distribution Shifts With Statistical Guarantees for Robot Learning
    Farid, Alec
    Veer, Sushant
    Pachisia, Divyanshu
    Majumdar, Anirudha
    IEEE TRANSACTIONS ON ROBOTICS, 2025, 41 : 926 - 945
  • [35] α-VARIATIONAL INFERENCE WITH STATISTICAL GUARANTEES
    Yang, Yun
    Pati, Debdeep
    Bhattacharya, Anirban
    ANNALS OF STATISTICS, 2020, 48 (02): : 886 - 905
  • [36] Deep learning from a statistical perspective
    Yuan, Yubai
    Deng, Yujia
    Zhang, Yanqing
    Qu, Annie
    STAT, 2020, 9 (01):
  • [37] Understanding Deep Learning with Statistical Relevance
    Raez, Tim
    PHILOSOPHY OF SCIENCE, 2022, 89 (01) : 20 - 41
  • [38] On the Feasibility Guarantees of Deep Reinforcement Learning Solutions for Distribution System Operation
    Hosseini, Mohammad Mehdi
    Parvania, Masood
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (02) : 954 - 964
  • [39] Functional Gradient Boosting for Learning Residual-like Networks with Statistical Guarantees
    Nitanda, Atsushi
    Suzuki, Taiji
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2981 - 2990
  • [40] Learning Deep Neural Network Controllers for Dynamical Systems with Safety Guarantees
    Deshmukh, Jyotirmoy V.
    Kapinski, James P.
    Yamaguchi, Tomoya
    Prokhorov, Danil
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,