How regularization affects the critical points in linear networks

被引:0
|
作者
Taghvaei, Amirhossein [1 ]
Kim, Jin W. [1 ]
Mehta, Prashant G. [1 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with the problem of representing and learning a linear transformation using a linear neural network. In recent years, there is a growing interest in the study of such networks, in part due to the successes of deep learning. The main question of this body of research (and also of our paper) is related to the existence and optimality properties of the critical points of the mean-squared loss function. An additional primary concern of our paper pertains to the robustness of these critical points in the face of (a small amount of) regularization. An optimal control model is introduced for this purpose and a learning algorithm (backprop with weight decay) derived for the same using the Hamilton's formulation of optimal control. The formulation is used to provide a complete characterization of the critical points in terms of the solutions of a nonlinear matrix-valued equation, referred to as the characteristic equation. Analytical and numerical tools from bifurcation theory are used to compute the critical points via the solutions of the characteristic equation.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Understanding how learning affects agreement process in social networks
    Maity, Suman Kalyan
    Porwal, Abhishek
    Mukherjee, Animesh
    2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 228 - 235
  • [42] How the reversible change of contact networks affects the epidemic spreading
    Shu, Xincheng
    Ruan, Zhongyuan
    NONLINEAR DYNAMICS, 2024, 112 (01) : 731 - 739
  • [43] How clustering affects the bond percolation threshold in complex networks
    Gleeson, James P.
    Melnik, Sergey
    Hackett, Adam
    PHYSICAL REVIEW E, 2010, 81 (06)
  • [44] Understanding How Image Quality Affects Transformer Neural Networks
    Varga, Domonkos
    SIGNALS, 2024, 5 (03): : 562 - 579
  • [45] LINEAR AND NONLINEAR RELAXATION AND CLUSTER DYNAMICS NEAR CRITICAL-POINTS
    KRETSCHMER, R
    BINDER, K
    STAUFFER, D
    JOURNAL OF STATISTICAL PHYSICS, 1976, 15 (04) : 267 - 297
  • [46] SIMPLE TESTS FOR CLASSIFYING CRITICAL-POINTS OF QUADRATICS WITH LINEAR CONSTRAINTS
    BINDING, P
    AMERICAN MATHEMATICAL MONTHLY, 1991, 98 (10): : 949 - 954
  • [47] Critical points of the linear entropy for pure L-qubit states
    Maciazek, Tomasz
    Sawicki, Adam
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2015, 48 (04)
  • [48] How Centrality of Driver Nodes Affects Controllability of Complex Networks
    Song, Guang-Hua
    Li, Xin-Feng
    Lu, Zhe-Ming
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (08) : 1340 - 1348
  • [49] How the reversible change of contact networks affects the epidemic spreading
    Xincheng Shu
    Zhongyuan Ruan
    Nonlinear Dynamics, 2024, 112 : 731 - 739
  • [50] Understanding How Image Quality Affects Deep Neural Networks
    Dodge, Samuel
    Karam, Lina
    2016 EIGHTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2016,