Provable Guarantees for Neural Networks via Gradient Feature Learning

被引:0
|
作者
Shi, Zhenmei [1 ]
Wei, Junyi [1 ]
Liang, Yingyu [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have achieved remarkable empirical performance, while the current theoretical analysis is not adequate for understanding their success, e.g., the Neural Tangent Kernel approach fails to capture their key feature learning ability, while recent analyses on feature learning are typically problem-specific. This work proposes a unified analysis framework for two-layer networks trained by gradient descent. The framework is centered around the principle of feature learning from gradients, and its effectiveness is demonstrated by applications in several prototypical problems such as mixtures of Gaussians and parity functions. The framework also sheds light on interesting network learning phenomena such as feature learning beyond kernels and the lottery ticket hypothesis.
引用
收藏
页数:71
相关论文
共 50 条
  • [41] Enhanced gradient learning for deep neural networks
    Yan, Ming
    Yang, Jianxi
    Chen, Cen
    Zhou, Joey Tianyi
    Pan, Yi
    Zeng, Zeng
    IET IMAGE PROCESSING, 2022, 16 (02) : 365 - 377
  • [42] The natural gradient learning algorithm for neural networks
    Amari, S
    THEORETICAL ASPECTS OF NEURAL COMPUTATION: A MULTIDISCIPLINARY PERSPECTIVE, 1998, : 1 - 15
  • [43] Functional Gradient Boosting for Learning Residual-like Networks with Statistical Guarantees
    Nitanda, Atsushi
    Suzuki, Taiji
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2981 - 2990
  • [44] Random Feature Amplification: Feature Learning and Generalization in Neural Networks
    Frei, Spencer
    Chatterji, Niladri S.
    Bartlett, Peter L.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [45] Deciphering Raw Data in Neuro-Symbolic Learning with Provable Guarantees
    Tao, Lue
    Huang, Yu-Xuan
    Dai, Wang-Zhou
    Jiang, Yuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15310 - 15318
  • [46] AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks
    Song, Weiping
    Shi, Chence
    Xiao, Zhiping
    Duan, Zhijian
    Xu, Yewen
    Zhang, Ming
    Tang, Jian
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1161 - 1170
  • [47] Provable Learning of Noisy-or Networks
    Arora, Sanjeev
    Ge, Rong
    Ma, Tengyu
    Risteski, Andrej
    STOC'17: PROCEEDINGS OF THE 49TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2017, : 1057 - 1066
  • [48] Sensorimotor Learning With Stability Guarantees via Autonomous Neural Dynamic Policies
    Totsila, Dionis
    Chatzilygeroudis, Konstantinos
    Modugno, Valerio
    Hadjivelichkov, Denis
    Kanoulas, Dimitrios
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1760 - 1767
  • [49] Provable Identifiability of Two-Layer ReLU Neural Networks via LASSO Regularization
    Li G.
    Wang G.
    Ding J.
    IEEE Transactions on Information Theory, 2023, 69 (09) : 5921 - 5935
  • [50] Learning Student Networks via Feature Embedding
    Chen, Hanting
    Wang, Yunhe
    Xu, Chang
    Xu, Chao
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (01) : 25 - 35