Controlled learning of pointwise nonlinearities in neural-network-like architectures

被引:0
|
作者
Unser, Michael [1 ]
Goujon, Alexis [1 ]
Ducotterd, Stanislas [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Biomed Imaging Grp, Stn 17, CH-1015 Lausanne, Switzerland
关键词
LINEAR INVERSE PROBLEMS; CONVEX REGULARIZERS; IMAGE; RECONSTRUCTION; ALGORITHM; SPLINES; MODELS;
D O I
10.1016/j.acha.2025.101764
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We present a general variational framework for the training of freeform nonlinearities in layered computational architectures subject to some slope constraints. The regularization that we add to the traditional training loss penalizes the second-order total variation of each trainable activation. The slope constraints allow us to impose properties such as 1-Lipschitz stability, firm non-expansiveness, and monotonicity/invertibility. These properties are crucial to ensure the proper functioning of certain classes of signal-processing algorithms (e.g., plug-and-play schemes, unrolled proximal gradient, invertible flows). We prove that the global optimum of the stated constrained-optimization problem is achieved with nonlinearities that are adaptive nonuniform linear splines. We then show how to solve the resulting function-optimization problem numerically by representing the nonlinearities in a suitable (nonuniform) B-spline basis. Finally, we illustrate the use of our framework with the data-driven design of (weakly) convex regularizers for the denoising of images and the resolution of inverse problems.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Neural network Architectures and learning
    Wilamowski, BM
    2003 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2003, : TU1 - TU12
  • [2] Neural network with deep learning architectures
    Patel, Hima
    Thakkar, Amit
    Pandya, Mrudang
    Makwana, Kamlesh
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2018, 39 (01): : 31 - 38
  • [3] Bayesian Learning of Neural Network Architectures
    Dikov, Georgi
    Bayer, Justin
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 730 - 738
  • [4] Neural Network Architectures and Learning Algorithms
    Wilamowski, Bogdan M.
    IEEE INDUSTRIAL ELECTRONICS MAGAZINE, 2009, 3 (04) : 56 - 63
  • [5] A Survey on Recurrent Neural Network Architectures for Sequential Learning
    Prakash, B. Shiva
    Sanjeev, K., V
    Prakash, Ramesh
    Chandrasekaran, K.
    SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 57 - 66
  • [6] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    2000, Shenyang Inst Comput Technol, China (21):
  • [7] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2000, 21 (04): : 379 - 382
  • [8] Some new neural network architectures with improved learning schemes
    Sinha M.
    Kumar K.
    Kalra P.K.
    Soft Computing, 2000, 4 (4) : 214 - 223
  • [9] An Intrinsically Irreversible, Neural-network-like Approach to the Schrodinger Equation and some Results of Application to Drive Nuclear Synthesis Research Work
    Abundo, Ugo
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [10] Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
    Parhi, Rahul
    Unser, Michael
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2025, 7 (01): : 110 - 135