Neural Network Identifiability for a Family of Sigmoidal Nonlinearities

被引:5
|
作者
Vlacic, Verner [1 ]
Bolcskei, Helmut [1 ]
机构
[1] Swiss Fed Inst Technol, Chair Math Informat Sci, Zurich, Switzerland
关键词
Deep neural networks; Identifiability; Sigmoidal nonlinearities; OPTIMAL APPROXIMATION;
D O I
10.1007/s00365-021-09544-3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper addresses the following question of neural network identifiability: Does the input-output map realized by a feed-forward neural network with respect to a given nonlinearity uniquely specify the network architecture, weights, and biases? The existing literature on the subject (Sussman in Neural Netw 5(4):589-593, 1992; Albertini et al. in Artificial neural networks for speech and vision, 1993; Fefferman in Rev Mat Iberoam 10(3):507-555, 1994) suggests that the answer should be yes, up to certain symmetries induced by the nonlinearity, and provided that the networks under consideration satisfy certain "genericity conditions." The results in Sussman (1992) and Albertini et al. (1993) apply to networks with a single hidden layer and in Fefferman (1994) the networks need to be fully connected. In an effort to answer the identifiability question in greater generality, we derive necessary genericity conditions for the identifiability of neural networks of arbitrary depth and connectivity with an arbitrary nonlinearity. Moreover, we construct a family of nonlinearities for which these genericity conditions are minimal, i.e., both necessary and sufficient. This family is large enough to approximate many commonly encountered nonlinearities to within arbitrary precision in the uniform norm.
引用
收藏
页码:173 / 224
页数:52
相关论文
共 50 条
  • [21] LANGEVIN MACHINE - A NEURAL NETWORK BASED ON STOCHASTICALLY JUSTIFIABLE SIGMOIDAL FUNCTION
    NEELAKANTA, PS
    SUDHAKAR, R
    DEGROFF, D
    BIOLOGICAL CYBERNETICS, 1991, 65 (05) : 331 - 338
  • [22] Construction and approximation rate for feedforward neural network operators with sigmoidal functions
    Yu, Dansheng
    Cao, Feilong
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2025, 453
  • [23] Sigmoidal Approximations of a Nonautonomous Neural Network with Infinite Delay and Heaviside Function
    Kloeden, Peter E.
    Villarragut, Victor M.
    JOURNAL OF DYNAMICS AND DIFFERENTIAL EQUATIONS, 2022, 34 (01) : 721 - 745
  • [24] Modeling power amplifier nonlinearities with artifical neural network
    Pochmara, J.
    MIXDES 2007: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS:, 2007, : 449 - 453
  • [25] Orthogonal Neural Network for Nonlinearities Reduction of Multicarrier Transmitters
    Rodriguez, Nibaldo
    Duran, Orlando
    Cubillos, Claudio
    MICAI 2007: SIXTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, : 305 - 310
  • [26] Nonlinear Multiview Analysis: Identifiability and Neural Network-based Implementation
    Lyu, Qi
    Fu, Xiao
    2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [27] Structural identifiability of generalized constraint neural network models for nonlinear regression
    Yang, Shuang-Hong
    Hu, Bao-Gang
    Cournede, Paul-Henry
    NEUROCOMPUTING, 2008, 72 (1-3) : 392 - 400
  • [28] Nonlinear Multiview Analysis: Identifiability and Neural Network-Assisted Implementation
    Lyu, Qi
    Fu, Xiao
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 2697 - 2712
  • [29] On the training error and generalization error of neural network regression without identifiability
    Hagiwara, K
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1575 - 1579
  • [30] Multiperiodicity of Periodically Oscillated Discrete-Time Neural Networks with Transient Excitatory Self-Connections and Sigmoidal Nonlinearities
    Huang, Zhenkun
    Wang, Xinghua
    Feng, Chunhua
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (10): : 1643 - 1655