The Loss Surface of Deep and Wide Neural Networks

被引:0
|
作者
Quynh Nguyen [1 ]
Hein, Matthias [1 ]
机构
[1] Saarland Univ, Dept Math & Comp Sci, Saarbrucken, Germany
基金
欧洲研究理事会;
关键词
LOCAL MINIMA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While the optimization problem behind deep neural networks is highly non-convex, it is frequently observed in practice that training deep networks seems possible without getting stuck in suboptimal points. It has been argued that this is the case as all local minima are close to being globally optimal. We show that this is (almost) true, in fact almost all local minima are globally optimal, for a fully connected network with squared loss and analytic activation function given that the number of hidden units of one layer of the network is larger than the number of training points and the network structure from this layer on is pyramidal.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] CONTRASTIVE-CENTER LOSS FOR DEEP NEURAL NETWORKS
    Qi, Ce
    Su, Fei
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2851 - 2855
  • [12] Wide Hidden Expansion Layer for Deep Convolutional Neural Networks
    Wang, Min
    Liu, Baoyuan
    Foroosh, Hassan
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 923 - 931
  • [13] WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING
    Gao, Zhan
    Ribeiro, Alejandro
    Gama, Fernando
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5270 - 5274
  • [14] Experimental exploration on loss surface of deep neural network
    Yuan, Qunyong
    Xiao, Nanfeng
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2020, 30 (04) : 860 - 873
  • [15] Loss Function Dynamics and Landscape for Deep Neural Networks Trained with Quadratic Loss
    M. S. Nakhodnov
    M. S. Kodryan
    E. M. Lobacheva
    D. S. Vetrov
    Doklady Mathematics, 2022, 106 : S43 - S62
  • [16] Loss Function Dynamics and Landscape for Deep Neural Networks Trained with Quadratic Loss
    Nakhodnov, M. S.
    Kodryan, M. S.
    Lobacheva, E. M.
    Vetrov, D. S.
    DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S43 - S62
  • [17] Understanding the Loss Surface of Neural Networks for Binary Classification
    Liang, Shiyu
    Sun, Ruoyu
    Li, Yixuan
    Srikant, R.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [18] Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks
    Li, Mingyong
    An, Ziye
    Wei, Qinmin
    Xiang, Kaiyue
    Ma, Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [19] Deep feature loss to denoise OCT images using deep neural networks
    Mehdizadeh, Maryam
    MacNish, Cara
    Xiao, Di
    Alonso-Caneiro, David
    Kugelman, Jason
    Bennamoun, Mohammed
    JOURNAL OF BIOMEDICAL OPTICS, 2021, 26 (04)
  • [20] Training Deep Neural Networks via Direct Loss Minimization
    Song, Yang
    Schwing, Alexander G.
    Zemel, Richard S.
    Urtasun, Raquel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48