Differential and algebraic geometry of multilayer perceptrons

被引:0
|
作者
Amari, S [1 ]
Ozeki, T [1 ]
机构
[1] RIKEN, Brain Sci Inst, Wako, Saitama 3510198, Japan
来源
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2001年 / E84A卷 / 01期
关键词
information geometry; multilayer perceptron; singularities; learning; natural gradient;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Information geometry is applied to the manifold of neural networks called multilayer perceptrons. It is important to study a total family of networks as a geometrical manifold, because learning is represented by a trajectory in such a space. The manifold of perceptrons has a rich differential-geometrical structure represented by a Riemannian metric and singularities. An efficient learning method is proposed by using it. The parameter space of perceptrons includes a lot of algebraic singularities, which affect trajectories of learning. Such singularities are studied by using simple models. This poses an interesting problem of statistical inference and learning in hierarchical models including singularities.
引用
收藏
页码:31 / 38
页数:8
相关论文
共 50 条
  • [21] Efficiently learning multilayer perceptrons
    Bunzmann, C
    Biehl, M
    Urbanczik, R
    PHYSICAL REVIEW LETTERS, 2001, 86 (10) : 2166 - 2169
  • [22] An approach to encode Multilayer Perceptrons
    Korczak, J
    Blindauer, E
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 302 - 307
  • [23] MMLD Inference of Multilayer Perceptrons
    Makalic, Enes
    Allison, Lloyd
    ALGORITHMIC PROBABILITY AND FRIENDS: BAYESIAN PREDICTION AND ARTIFICIAL INTELLIGENCE, 2013, 7070 : 261 - 272
  • [24] ERROR SURFACES FOR MULTILAYER PERCEPTRONS
    HUSH, DR
    HORNE, B
    SALAS, JM
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (05): : 1152 - 1161
  • [25] Multilayer perceptrons and data compression
    Manger, Robert
    Puljic, Krunoslav
    COMPUTING AND INFORMATICS, 2007, 26 (01) : 45 - 62
  • [26] Representation and extrapolation in multilayer perceptrons
    Browne, A
    NEURAL COMPUTATION, 2002, 14 (07) : 1739 - 1754
  • [27] Clustering using multilayer perceptrons
    Charalampidis, Dimitrios
    Muldrey, Barry
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2009, 71 (12) : E2807 - E2813
  • [28] ON LANGEVIN UPDATING IN MULTILAYER PERCEPTRONS
    ROGNVALDSSON, T
    NEURAL COMPUTATION, 1994, 6 (05) : 916 - 926
  • [29] Active learning in multilayer perceptrons
    Fukumizu, K
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 295 - 301
  • [30] ON THE INITIALIZATION AND OPTIMIZATION OF MULTILAYER PERCEPTRONS
    WEYMAERE, N
    MARTENS, JP
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (05): : 738 - 751