Continuous limits of residual neural networks in case of large input data

被引：0

作者：

Herty, Michael ^{[1
]}

Thuenen, Anna ^{[2
]}

Trimborn, Torsten ^{[3
]}

Visconti, Giuseppe ^{[4
]}

机构：

[1] Rhein Westfal TH Aachen, Inst Geometrie & Prakt Math IGPM, Templergraben 55, D-52062 Aachen, Germany

[2] Tech Univ Clausthal, Inst Math, Erzstr 1, D-38678 Clausthal Zellerfeld, Germany

[3] NRW BANK, Kavalleriestr 22, D-40213 Dusseldorf, Germany

[4] Sapienza Univ Rome, Dept Math G Castelnuovo, Ple Aldo Moro 5, I-00185 Rome, Italy

来源：

COMMUNICATIONS IN APPLIED AND INDUSTRIAL MATHEMATICS | 2022年 / 13卷 / 01期

关键词：

Neural networks; mean-field limit; well-posedness; optimal control; controllability; LEARNING FRAMEWORK; CONVERGENCE;

D O I：

10.2478/caim-2022-0008

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean-field limit and show well-posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.

引用

页码：96 / 120

页数：25

共 50 条

[31] Large Networks and Graph Limits
Balek, Martin
Goodall, Andrew
COMPUTER SCIENCE REVIEW, 2013, 10 : 35 - 46
[32] Rethinking residual connection in training large-scale spiking neural networks
Li, Yudong
Lei, Yunlin
Yang, Xu
NEUROCOMPUTING, 2025, 616
[33] Noisy recurrent neural networks: The continuous-time case
Das, S
Olurotimi, O
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05): : 913 - 936
[34] Flush Air Data Sensing Based on Dimensionless Input and Output Neural Networks With Less Data
Liu, Yang
Zhang, Chen-an
Yan, Xunshi
Liu, Wen
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (02) : 1411 - 1425
[35] The Impact of Input Image Data Size on The Training Speed of Convolutional Neural Networks
Lyu, Xinzhou
2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 654 - 657
[36] BUILDING AN EXPERT SYSTEM FOR DEBUGGING FEM INPUT DATA WITH ARTIFICIAL NEURAL NETWORKS
YEH, YC
KUO, YH
HSU, DS
EXPERT SYSTEMS WITH APPLICATIONS, 1992, 5 (1-2) : 59 - 70
[37] Importance of input data normalization for the application of neural networks to complex industrial problems
Sola, J.
Sevilla, J.
IEEE Transactions on Nuclear Science, 1997, 44 (3 Pt 3) : 1464 - 1468
[38] Input projection method for safe use of neural networks based on process data
Hamalainen, JJ
Jarvimaki, I
IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 193 - 198
[39] An Input Data Set Compression Method for Improving the Training Ability of Neural Networks
Tusor, Balazs
Varkonyi-Koczy, Annamaria R.
Rudas, Imre J.
Klie, Gabor
Kocsis, Gabor
2012 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2012, : 1774 - 1779
[40] The influence of input data standardization method on prediction accuracy of artificial neural networks
Anysz, Hubert
Zbiciak, Artur
Ibadov, Nabi
XXV POLISH - RUSSIAN - SLOVAK SEMINAR -THEORETICAL FOUNDATION OF CIVIL ENGINEERING, 2016, 153 : 66 - 70

← 1 2 3 4 5 →