How regularization affects the critical points in linear networks

被引:0
|
作者
Taghvaei, Amirhossein [1 ]
Kim, Jin W. [1 ]
Mehta, Prashant G. [1 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with the problem of representing and learning a linear transformation using a linear neural network. In recent years, there is a growing interest in the study of such networks, in part due to the successes of deep learning. The main question of this body of research (and also of our paper) is related to the existence and optimality properties of the critical points of the mean-squared loss function. An additional primary concern of our paper pertains to the robustness of these critical points in the face of (a small amount of) regularization. An optimal control model is introduced for this purpose and a learning algorithm (backprop with weight decay) derived for the same using the Hamilton's formulation of optimal control. The formulation is used to provide a complete characterization of the critical points in terms of the solutions of a nonlinear matrix-valued equation, referred to as the characteristic equation. Analytical and numerical tools from bifurcation theory are used to compute the critical points via the solutions of the characteristic equation.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Detection and classification of critical points in piecewise linear vector fields
    Wang, Wentao
    Wang, Wenke
    Li, Sikun
    JOURNAL OF VISUALIZATION, 2018, 21 (01) : 147 - 161
  • [22] Detection and classification of critical points in piecewise linear vector fields
    Wentao Wang
    Wenke Wang
    Sikun Li
    Journal of Visualization, 2018, 21 : 147 - 161
  • [23] THE NUMBER OF CRITICAL-POINTS OF A PRODUCT OF POWERS OF LINEAR FUNCTIONS
    ORLIK, P
    TERAO, H
    INVENTIONES MATHEMATICAE, 1995, 120 (01) : 1 - 14
  • [24] Manifolds of Critical Points in a Quasi linear Model for Phase Transitions
    Drabek, Pavel
    Manasevich, Raul F.
    Takac, Peter
    NONLINEAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS, 2011, 540 : 95 - 134
  • [25] Invertible residual networks in the context of regularization theory for linear inverse problems
    Arndt, Clemens
    Denker, Alexander
    Dittmer, Soeren
    Heilenkoetter, Nick
    Iske, Meira
    Kluth, Tobias
    Maass, Peter
    Nickel, Judith
    INVERSE PROBLEMS, 2023, 39 (12)
  • [26] Comparison of Critical Adsorption Points of Ring Polymers with Linear Polymers
    Ziebarth, Jesse D.
    Gardiner, Abigail Anne
    Wang, Yongmei
    Jeong, Youncheol
    Ahn, Junyoung
    Jin, Ye
    Chang, Taihyun
    MACROMOLECULES, 2016, 49 (22) : 8780 - 8788
  • [27] Comparison of critical adsorption points of ring polymers with linear polymers
    Wang, Yongmei
    Chang, Taihyun
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [28] Rectified Linear Neural Networks with Tied-Scalar Regularization for LVCSR
    Zhang, Shiliang
    Jiang, Hui
    Wei, Si
    Dai, Li-Rong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2635 - 2639
  • [29] Methodology for Detecting Critical Points in Pressurized Irrigation Networks with Multiple Water Supply Points
    Fernandez Garcia, I.
    Montesinos, P.
    Camacho Poyato, E.
    Rodriguez Diaz, J. A.
    WATER RESOURCES MANAGEMENT, 2014, 28 (04) : 1095 - 1109
  • [30] Methodology for Detecting Critical Points in Pressurized Irrigation Networks with Multiple Water Supply Points
    I. Fernández García
    P. Montesinos
    E. Camacho Poyato
    J. A. Rodríguez Díaz
    Water Resources Management, 2014, 28 : 1095 - 1109