Information geometry of the EM and em algorithms for neural networks

被引:177
|
作者
Amari, SI
机构
关键词
EM algorithm; information geometry; stochastic model of neural networks; learning; identification of neural network; e-projection; m-projection; hidden variable;
D O I
10.1016/0893-6080(95)00003-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To realize an input-output relation given by noise-contaminated examples, it is effective to use a stochastic model of neural networks. When the model network includes hidden units whose activation values are not specified nor observed, it is useful to estimate the hidden variables from the observed or specified input-output data based on the stochastic model. Two algorithms, the EM and em algorithms, have so far been proposed for this purpose. The EM algorithm is an iterative statistical technique of using the conditional expectation, and the em algorithm is a geometrical one given by information geometry. The em algorithm minimizes iteratively the Kullback-Leibler divergence in the manifold of neural networks. These two algorithms are equivalent in most cases. The present paper gives a unified information geometrical framework for studying stochastic models of neural networks, by focusing on the EM and em algorithms, and proves a condition that guarantees their equivalence. Examples include: (1) stochastic multilayer perceptron, (2) mixtures of experts, and (3) normal mixture model.
引用
收藏
页码:1379 / 1408
页数:30
相关论文
共 50 条
  • [41] Performance of Hybrid Hopfield Neural Networks with EM Algorithms for Multiuser Detection in Ultra-Wide-Band Communication Systems
    Hung, Ho-Lung
    Huang, Yung-Fa
    Cheng, Chia-Hsin
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1423 - 1429
  • [42] Segmentation of EM showers for neutrino experiments with deep graph neural networks
    Belavin, V
    Trofimova, E.
    Ustyuzhanin, A.
    JOURNAL OF INSTRUMENTATION, 2021, 16 (12)
  • [43] ENERGY, TENSION, AND SPECTRAL GEOMETRY OF MAPS INTO EM
    CHEN, BY
    MORVAN, JM
    NORE, T
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1985, 301 (04): : 123 - 126
  • [44] Finding 'em? Following 'em? Fixing 'em?
    Koea, Jonathan
    ARCHIVES OF SURGERY, 2012, 147 (12) : 1084 - 1084
  • [45] Drill 'em, Fill 'em and Bill 'em
    Pit and Quarry, 2001, 94 (04):
  • [46] CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks
    Zhong, Ellen D.
    Bepler, Tristan
    Berger, Bonnie
    Davis, Joseph H.
    NATURE METHODS, 2021, 18 (02) : 176 - +
  • [47] AUTOMATED INTERPRETATION OF CRYO-EM DENSITY MAPS WITH CONVOLUTIONAL NEURAL NETWORKS
    Mostosi, Philipp
    Philip, Kollmannsberger
    Schindelin, Hermann
    Thorn, Andrea
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2019, 75 : E83 - E83
  • [48] CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks
    Ellen D. Zhong
    Tristan Bepler
    Bonnie Berger
    Joseph H. Davis
    Nature Methods, 2021, 18 : 176 - 185
  • [49] Damped Anderson Acceleration With Restarts and Monotonicity Control for Accelerating EM and EM-like Algorithms
    Henderson, Nicholas C.
    Varadhan, Ravi
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) : 834 - 846
  • [50] Estimation of multiple sound sources with data and model uncertainties using the EM and evidential EM algorithms
    Wang, Xun
    Quost, Benjamin
    Chazot, Jean-Daniel
    Antoni, Jerome
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2016, 66-67 : 159 - 177