Solving the linear interval tolerance problem for weight initialization of neural networks

被引:29
|
作者
Adam, S. P. [1 ,2 ]
Karras, D. A. [3 ]
Magoulas, G. D. [4 ]
Vrahatis, M. N. [1 ]
机构
[1] Univ Patras, Dept Math, Computat Intelligence Lab, GR-26110 Patras, Greece
[2] Technol Educ Inst Epirus, Dept Comp Engn, Arta 47100, Greece
[3] Technol Educ Inst Sterea Hellas, Dept Automat, Psahna 34400, Evia, Greece
[4] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Neural networks; Weight initialization; Interval analysis; Linear interval tolerance problem; FEEDFORWARD NETWORKS; STATISTICAL TESTS; TRAINING SPEED; HIGH-DIMENSION; BACKPROPAGATION; ALGORITHM; INTELLIGENCE;
D O I
10.1016/j.neunet.2014.02.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining good initial conditions for an algorithm used to train a neural network is considered a parameter estimation problem dealing with uncertainty about the initial weights. Interval analysis approaches model uncertainty in parameter estimation problems using intervals and formulating tolerance problems. Solving a tolerance problem is defining lower and upper bounds of the intervals so that the system functionality is guaranteed within predefined limits. The aim of this paper is to show how the problem of determining the initial weight intervals of a neural network can be defined in terms of solving a linear interval tolerance problem. The proposed linear interval tolerance approach copes with uncertainty about the initial weights without any previous knowledge or specific assumptions on the input data as required by approaches such as fuzzy sets or rough sets. The proposed method is tested on a number of well known benchmarks for neural networks trained with the back-propagation family of algorithms. Its efficiency is evaluated with regards to standard performance measures and the results obtained are compared against results of a number of well known and established initialization methods. These results provide credible evidence that the proposed method outperforms classical weight initialization methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:17 / 37
页数:21
相关论文
共 50 条
  • [31] A Comparison of Metaheurisitics for the Problem of Solving Parametric Interval Linear Systems
    Skalna, Iwona
    Duda, Jerzy
    NUMERICAL METHODS AND APPLICATIONS, 2011, 6046 : 305 - 312
  • [32] A new class of interval projection neural networks for solving interval quadratic program
    Ding, Ke
    Huang, Nan-Jing
    CHAOS SOLITONS & FRACTALS, 2008, 35 (04) : 718 - 725
  • [33] Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks
    Magen, Roey
    Shamir, Ohad
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [34] Weight Initialization Possibilities for Feedforward Neural Network with Linear Saturated Activation Functions
    Dolezel, Petr
    Skrabanek, Pavel
    Gago, Lumir
    IFAC PAPERSONLINE, 2016, 49 (25): : 49 - 54
  • [35] Fault tolerance problem of the feedforward neural networks
    Zhang, L. (bhzhao@ustc.edu.cn), 1693, Chinese Academy of Sciences (12):
  • [36] Solving interval linear programming problem using generalized interval lu decomposition method
    Nirmala, K.
    Nirmala, T.
    Ganesan, K.
    3RD INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING (ICAME 2020), PTS 1-6, 2020, 912
  • [37] Interval methods for solving Cellular Neural Networks (CNNs) equations
    Mladenov, VM
    Kolev, LV
    ICECS 96 - PROCEEDINGS OF THE THIRD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2, 1996, : 417 - 420
  • [38] Applications of the general nonlinear neural networks in solving the inverse optimal value problem with linear constraints
    Wu, Huaiqin
    Wang, Kewang
    Li, Ning
    Wu, Chongyang
    Guo, Qiangqiang
    Xu, Guohua
    Information Technology Journal, 2012, 11 (06) : 713 - 718
  • [39] Mutual information based weight initialization method for sigmoidal feedforward neural networks
    Qiao, Junfei
    Li, Sanyi
    Li, Wenjing
    NEUROCOMPUTING, 2016, 207 : 676 - 683
  • [40] Domain adaptation and weight initialization of neural networks for diagnosing interstitial lung diseases
    Thorat, Onkar
    Salvi, Siddharth
    Dedhia, Shrey
    Bhadane, Chetashri
    Dongre, Deepika
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (05) : 1535 - 1547