Information geometry of the EM and em algorithms for neural networks

被引:177
|
作者
Amari, SI
机构
关键词
EM algorithm; information geometry; stochastic model of neural networks; learning; identification of neural network; e-projection; m-projection; hidden variable;
D O I
10.1016/0893-6080(95)00003-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To realize an input-output relation given by noise-contaminated examples, it is effective to use a stochastic model of neural networks. When the model network includes hidden units whose activation values are not specified nor observed, it is useful to estimate the hidden variables from the observed or specified input-output data based on the stochastic model. Two algorithms, the EM and em algorithms, have so far been proposed for this purpose. The EM algorithm is an iterative statistical technique of using the conditional expectation, and the em algorithm is a geometrical one given by information geometry. The em algorithm minimizes iteratively the Kullback-Leibler divergence in the manifold of neural networks. These two algorithms are equivalent in most cases. The present paper gives a unified information geometrical framework for studying stochastic models of neural networks, by focusing on the EM and em algorithms, and proves a condition that guarantees their equivalence. Examples include: (1) stochastic multilayer perceptron, (2) mixtures of experts, and (3) normal mixture model.
引用
收藏
页码:1379 / 1408
页数:30
相关论文
共 50 条
  • [21] Monotonically Overrelaxed EM Algorithms
    Yu, Yaming
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2012, 21 (02) : 518 - 537
  • [22] Fast α-weighted EM learning for neural networks of module mixtures
    Matsuyama, Y
    Furukawa, S
    Takeda, N
    Ikeda, T
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2306 - 2311
  • [23] DeepEM: Deep Neural Networks Model Recovery through EM Side-Channel Information Leakage
    Yu, Honggang
    Ma, Haocheng
    Yang, Kaichen
    Zhao, Yiqiang
    Jin, Yier
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL SYMPOSIUM ON HARDWARE ORIENTED SECURITY AND TRUST (HOST), 2020, : 209 - 218
  • [25] On convergence and parameter selection of the EM and DA-EM algorithms for Gaussian mixtures
    Yu, Jian
    Chaomurilige, Chaomu
    Yang, Miin-Shen
    PATTERN RECOGNITION, 2018, 77 : 188 - 203
  • [26] Neural based EM modeling
    Kabir, H.
    Cao, Yi
    Zhang, L.
    Zhang, Q. J.
    2007 INTERNATIONAL SYMPOSIUM ON SIGNALS, SYSTEMS AND ELECTRONICS, VOLS 1 AND 2, 2007, : 163 - 166
  • [27] EM STIMULATION OF NEURAL TISSUES
    SCHUETZ, PW
    PAUL, JP
    ANDREWS, BJ
    ARTIFICIAL ORGANS, 1984, 8 (03) : 391 - 391
  • [28] EM algorithms for estimating the Bernstein copula
    Dou, Xiaoling
    Kuriki, Satoshi
    Lin, Gwo Dong
    Richards, Donald
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 93 : 228 - 245
  • [29] ECM algorithms that converge at the rate of EM
    Sexton, J
    Swensen, AR
    BIOMETRIKA, 2000, 87 (03) : 651 - 662
  • [30] EM algorithms for independent component analysis
    Attias, H
    NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 132 - 141