Provable Guarantees for Neural Networks via Gradient Feature Learning

被引:0
|
作者
Shi, Zhenmei [1 ]
Wei, Junyi [1 ]
Liang, Yingyu [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have achieved remarkable empirical performance, while the current theoretical analysis is not adequate for understanding their success, e.g., the Neural Tangent Kernel approach fails to capture their key feature learning ability, while recent analyses on feature learning are typically problem-specific. This work proposes a unified analysis framework for two-layer networks trained by gradient descent. The framework is centered around the principle of feature learning from gradients, and its effectiveness is demonstrated by applications in several prototypical problems such as mixtures of Gaussians and parity functions. The framework also sheds light on interesting network learning phenomena such as feature learning beyond kernels and the lottery ticket hypothesis.
引用
收藏
页数:71
相关论文
共 50 条
  • [21] Learning neural connectivity from firing activity: efficient algorithms with provable guarantees on topology
    Amin Karbasi
    Amir Hesam Salavati
    Martin Vetterli
    Journal of Computational Neuroscience, 2018, 44 : 253 - 272
  • [22] Online Learning for Predictive Control with Provable Regret Guarantees
    Muthirayan, Deepan
    Yuan, Jianjun
    Kalathil, Dileep
    Khargonekar, Pramod P.
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 6666 - 6671
  • [23] Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks
    Awasthi, Pranjal
    Frank, Natalie S.
    Mohri, Mehryar
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [24] Provable Gradient Variance Guarantees for Black-Box Variational Inference
    Domke, Justin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Model-based Reinforcement Learning with Provable Safety Guarantees via Control Barrier Functions
    Zhang, Hongchao
    Li, Zhouchi
    Clark, Andrew
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 792 - 798
  • [26] Supervised Feature Selection via Ensemble Gradient Information from Sparse Neural Networks
    Liu, Kaiting
    Atashgahi, Zahra
    Sokar, Ghada
    Pechenizkiy, Mykola
    Mocanu, Decebal Constantin
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [27] Post-training Quantization for Neural Networks with Provable Guarantees (vol 5, pg 373, 2023)
    Zhang, Jinjie
    Zhou, Yixuan
    Saab, Rayan
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2024, 6 (03): : 842 - 846
  • [28] Online Constrained Meta-Learning: Provable Guarantees for Generalization
    Xu, Siyuan
    Zhu, Minghui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [29] Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
    Li, Yuanzhi
    Liang, Yingyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] Provable Repair of Deep Neural Networks
    Sotoudeh, Matthew
    Thakur, Aditya, V
    PROCEEDINGS OF THE 42ND ACM SIGPLAN INTERNATIONAL CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '21), 2021, : 588 - 603