The Loss Surface of Deep and Wide Neural Networks

被引：0

作者：

Quynh Nguyen ^{[1
]}

Hein, Matthias ^{[1
]}

机构：

[1] Saarland Univ, Dept Math & Comp Sci, Saarbrucken, Germany

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

基金：

欧洲研究理事会;

关键词：

LOCAL MINIMA;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While the optimization problem behind deep neural networks is highly non-convex, it is frequently observed in practice that training deep networks seems possible without getting stuck in suboptimal points. It has been argued that this is the case as all local minima are close to being globally optimal. We show that this is (almost) true, in fact almost all local minima are globally optimal, for a fully connected network with squared loss and analytic activation function given that the number of hidden units of one layer of the network is larger than the number of training points and the network structure from this layer on is pyramidal.

引用

页数：10

共 50 条

[11] CONTRASTIVE-CENTER LOSS FOR DEEP NEURAL NETWORKS
Qi, Ce
Su, Fei
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2851 - 2855
[12] Wide Hidden Expansion Layer for Deep Convolutional Neural Networks
Wang, Min
Liu, Baoyuan
Foroosh, Hassan
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 923 - 931
[13] WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING
Gao, Zhan
Ribeiro, Alejandro
Gama, Fernando
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5270 - 5274
[14] Experimental exploration on loss surface of deep neural network
Yuan, Qunyong
Xiao, Nanfeng
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2020, 30 (04) : 860 - 873
[15] Loss Function Dynamics and Landscape for Deep Neural Networks Trained with Quadratic Loss
M. S. Nakhodnov
M. S. Kodryan
E. M. Lobacheva
D. S. Vetrov
Doklady Mathematics, 2022, 106 : S43 - S62
[16] Loss Function Dynamics and Landscape for Deep Neural Networks Trained with Quadratic Loss
Nakhodnov, M. S.
Kodryan, M. S.
Lobacheva, E. M.
Vetrov, D. S.
DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S43 - S62
[17] Understanding the Loss Surface of Neural Networks for Binary Classification
Liang, Shiyu
Sun, Ruoyu
Li, Yixuan
Srikant, R.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[18] Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks
Li, Mingyong
An, Ziye
Wei, Qinmin
Xiang, Kaiyue
Ma, Yan
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
[19] Deep feature loss to denoise OCT images using deep neural networks
Mehdizadeh, Maryam
MacNish, Cara
Xiao, Di
Alonso-Caneiro, David
Kugelman, Jason
Bennamoun, Mohammed
JOURNAL OF BIOMEDICAL OPTICS, 2021, 26 (04)
[20] Training Deep Neural Networks via Direct Loss Minimization
Song, Yang
Schwing, Alexander G.
Zemel, Richard S.
Urtasun, Raquel
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48

← 1 2 3 4 5 →