Controlled learning of pointwise nonlinearities in neural-network-like architectures

被引：0

作者：

Unser, Michael ^{[1
]}

Goujon, Alexis ^{[1
]}

Ducotterd, Stanislas ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne EPFL, Biomed Imaging Grp, Stn 17, CH-1015 Lausanne, Switzerland

来源：

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS | 2025年 / 77卷

关键词：

LINEAR INVERSE PROBLEMS; CONVEX REGULARIZERS; IMAGE; RECONSTRUCTION; ALGORITHM; SPLINES; MODELS;

D O I：

10.1016/j.acha.2025.101764

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We present a general variational framework for the training of freeform nonlinearities in layered computational architectures subject to some slope constraints. The regularization that we add to the traditional training loss penalizes the second-order total variation of each trainable activation. The slope constraints allow us to impose properties such as 1-Lipschitz stability, firm non-expansiveness, and monotonicity/invertibility. These properties are crucial to ensure the proper functioning of certain classes of signal-processing algorithms (e.g., plug-and-play schemes, unrolled proximal gradient, invertible flows). We prove that the global optimum of the stated constrained-optimization problem is achieved with nonlinearities that are adaptive nonuniform linear splines. We then show how to solve the resulting function-optimization problem numerically by representing the nonlinearities in a suitable (nonuniform) B-spline basis. Finally, we illustrate the use of our framework with the data-driven design of (weakly) convex regularizers for the denoising of images and the resolution of inverse problems.

引用

页数：19

共 50 条

[1] Neural network Architectures and learning
Wilamowski, BM
2003 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2003, : TU1 - TU12
[2] Neural network with deep learning architectures
Patel, Hima
Thakkar, Amit
Pandya, Mrudang
Makwana, Kamlesh
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2018, 39 (01): : 31 - 38
[3] Bayesian Learning of Neural Network Architectures
Dikov, Georgi
Bayer, Justin
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 730 - 738
[4] Neural Network Architectures and Learning Algorithms
Wilamowski, Bogdan M.
IEEE INDUSTRIAL ELECTRONICS MAGAZINE, 2009, 3 (04) : 56 - 63
[5] A Survey on Recurrent Neural Network Architectures for Sequential Learning
Prakash, B. Shiva
Sanjeev, K., V
Prakash, Ramesh
Chandrasekaran, K.
SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 57 - 66
[6] Learning the architectures and parameters of RBF neural network based on MDL
Liu, Meiqin
Chen, Jida
Cai, Zixing
2000, Shenyang Inst Comput Technol, China (21):
[7] Learning the architectures and parameters of RBF neural network based on MDL
Liu, Meiqin
Chen, Jida
Cai, Zixing
Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2000, 21 (04): : 379 - 382
[8] Some new neural network architectures with improved learning schemes
Sinha M.
Kumar K.
Kalra P.K.
Soft Computing, 2000, 4 (4) : 214 - 223
[9] An Intrinsically Irreversible, Neural-network-like Approach to the Schrodinger Equation and some Results of Application to Drive Nuclear Synthesis Research Work
Abundo, Ugo
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
[10] Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
Parhi, Rahul
Unser, Michael
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2025, 7 (01): : 110 - 135

← 1 2 3 4 5 →