A constructive design method for two-layer perceptrons and its application to the design of modular neural networks

被引:2
|
作者
Moon, YJ
Oh, SY
机构
[1] Department of Electrical Engineering, Pohang Univ. of Sci. and Technology, Pohang
[2] Pusan University, Pusan
[3] Case Western Reserve University, Cleveland, OH
[4] Korea Atom. Ener. Research Institute, Seoul
[5] Dept. of Elec. Eng. and Comp. Sci., University of Illinois, Chicago, IL
[6] Department of Electrical Engineering, University of Florida, Gainesville, FL
[7] Department of Electrical Engineering, Pohang Univ. of Sci. and Technology, Pohang
关键词
constructive design method (CDM); modular neural network (MNN); feature vector;
D O I
10.1111/j.1468-0394.1996.tb00118.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A multilayer perceptron is known to be capable of approximating any smooth function to any desired accuracy if it has a sufficient number of hidden neurons. But its training, based on the gradient method, is usually a time consuming procedure that may converge toward a local minimum, and furthermore its performance is greatly influenced by the number of hidden neurons and their initial weights. Usually these crucial parameters are determined based on the trial and error procedure, requiring much experience on the designer's part. In this paper, a constructive design method (CDM) has been propose for a two-layer perceptron that can approximate a class of smooth functions whose feature vector classes are linearly separable. Based on the analysis of a given data set sampled from the target function, feature vectors that can characterize the function 'well' are extracted and used to determine the number of hidden neurons and the initial weights of the network. But when the classes of the feature vectors are not linearly separable, the network may not be trained easily, mainly due to the interference among the hyperplanes generated by hidden neurons. Next, to compensate for this interference, a refined version of the modular neural network (MNN) has been proposed where each network module is created by CDM. After the input space has been partitioned into many local regions, a two-layer perceptron constructed by CDM is assigned to each local region. By doing this, the feature vector classes are more likely to become linearly separable in each local region and as a result, the function may be approximated with greatly improved accuracy by MNN. An example simulation illustrates the improvements in learning speed using a smaller number of neurons.
引用
收藏
页码:183 / 194
页数:12
相关论文
共 50 条
  • [1] Constructive design method for two-layer perceptrons and its application to the design of modular neural networks
    Pohang Univ of Science and, Technology, Pohang, Korea, Republic of
    Expert Syst, 3 (183-194):
  • [2] Constructive neural networks and their application to ship multidisciplinary design optimization
    Mechanical and Aerospace Engineering, California State University, Long Beach, CA, United States
    J Ship Res, 2007, 4 (297-312):
  • [3] Constructive neural networks and their application to ship multidisciplinary design optimization
    Besnard, Eric
    Schmitz, Adeline
    Hefazi, Hamid
    Shinde, Rahul
    JOURNAL OF SHIP RESEARCH, 2007, 51 (04): : 297 - 312
  • [4] Design techniques of two-layer architectures for WDM optical networks
    Chiaraluce, F. (chifra@popcsi.unian.it), 1600, John Wiley and Sons Ltd (14):
  • [5] Tabu search algorithm for two-layer networks design problem
    Lenarski, Krzysztof
    Czajka, Bartosz
    Pozniak-Koszalka, Iwona
    Kasprzak, Andrzej
    ICSENG 2008: INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING, 2008, : 319 - 324
  • [6] Design techniques of two-layer architectures for WDM optical networks
    Borella, A
    Cancellieri, G
    Chiaraluce, F
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2001, 14 (02) : 171 - 188
  • [7] An online gradient method with momentum for two-layer feedforward neural networks
    Zhang, Naimin
    APPLIED MATHEMATICS AND COMPUTATION, 2009, 212 (02) : 488 - 498
  • [8] Convergence of gradient method with momentum for two-layer feedforward neural networks
    Zhang, NM
    Wu, W
    Zheng, GF
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (02): : 522 - 525
  • [9] Plasticity of two-layer fast neural networks
    Alexeev, AA
    Dorogov, AY
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 1999, 38 (05) : 786 - 791
  • [10] Compressing neural networks with two-layer decoupling
    De Jonghe, Joppe
    Usevich, Konstantin
    Dreesen, Philippe
    Ishteva, Mariya
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 226 - 230