Continuous limits of residual neural networks in case of large input data

被引:0
|
作者
Herty, Michael [1 ]
Thuenen, Anna [2 ]
Trimborn, Torsten [3 ]
Visconti, Giuseppe [4 ]
机构
[1] Rhein Westfal TH Aachen, Inst Geometrie & Prakt Math IGPM, Templergraben 55, D-52062 Aachen, Germany
[2] Tech Univ Clausthal, Inst Math, Erzstr 1, D-38678 Clausthal Zellerfeld, Germany
[3] NRW BANK, Kavalleriestr 22, D-40213 Dusseldorf, Germany
[4] Sapienza Univ Rome, Dept Math G Castelnuovo, Ple Aldo Moro 5, I-00185 Rome, Italy
关键词
Neural networks; mean-field limit; well-posedness; optimal control; controllability; LEARNING FRAMEWORK; CONVERGENCE;
D O I
10.2478/caim-2022-0008
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean-field limit and show well-posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.
引用
收藏
页码:96 / 120
页数:25
相关论文
共 50 条
  • [31] Large Networks and Graph Limits
    Balek, Martin
    Goodall, Andrew
    COMPUTER SCIENCE REVIEW, 2013, 10 : 35 - 46
  • [32] Rethinking residual connection in training large-scale spiking neural networks
    Li, Yudong
    Lei, Yunlin
    Yang, Xu
    NEUROCOMPUTING, 2025, 616
  • [33] Noisy recurrent neural networks: The continuous-time case
    Das, S
    Olurotimi, O
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05): : 913 - 936
  • [34] Flush Air Data Sensing Based on Dimensionless Input and Output Neural Networks With Less Data
    Liu, Yang
    Zhang, Chen-an
    Yan, Xunshi
    Liu, Wen
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (02) : 1411 - 1425
  • [35] The Impact of Input Image Data Size on The Training Speed of Convolutional Neural Networks
    Lyu, Xinzhou
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 654 - 657
  • [36] BUILDING AN EXPERT SYSTEM FOR DEBUGGING FEM INPUT DATA WITH ARTIFICIAL NEURAL NETWORKS
    YEH, YC
    KUO, YH
    HSU, DS
    EXPERT SYSTEMS WITH APPLICATIONS, 1992, 5 (1-2) : 59 - 70
  • [37] Importance of input data normalization for the application of neural networks to complex industrial problems
    Sola, J.
    Sevilla, J.
    IEEE Transactions on Nuclear Science, 1997, 44 (3 Pt 3) : 1464 - 1468
  • [38] Input projection method for safe use of neural networks based on process data
    Hamalainen, JJ
    Jarvimaki, I
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 193 - 198
  • [39] An Input Data Set Compression Method for Improving the Training Ability of Neural Networks
    Tusor, Balazs
    Varkonyi-Koczy, Annamaria R.
    Rudas, Imre J.
    Klie, Gabor
    Kocsis, Gabor
    2012 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2012, : 1774 - 1779
  • [40] The influence of input data standardization method on prediction accuracy of artificial neural networks
    Anysz, Hubert
    Zbiciak, Artur
    Ibadov, Nabi
    XXV POLISH - RUSSIAN - SLOVAK SEMINAR -THEORETICAL FOUNDATION OF CIVIL ENGINEERING, 2016, 153 : 66 - 70