Information geometry on hierarchy of probability distributions

被引:270
作者
Amari, S [1 ]
机构
[1] RIKEN, Brain Sci Inst, Lab Math Neurosci, Wako, Saitama 3510198, Japan
关键词
decomposition of entropy; e- and m-projections; extended Pythagoras theorem; higher order interactions; higher order Markov chain; information geometry; Kullback divergence;
D O I
10.1109/18.930911
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An exponential family or mixture family of probability distributions has a natural hierarchical structure. This paper gives an ''orthogonal" decomposition of such a system based on information geometry, A typical example is the decomposition of stochastic dependency among a number of random variables. m general, they have a complex structure of dependencies. Pairwise dependency is easily represented by correlation, but it is more difficult to measure effects of pure triplewise or higher order interactions (dependencies) among these variables, Stochastic dependency is decomposed quantitatively into an "orthogonal" sum of pairwise, triplewise, and further higher order dependencies. This gives a new invariant decomposition of joint entropy. This problem is important for extracting intrinsic interactions in firing patterns of an ensemble of neurons and for estimating its functional connections. The orthogonal decomposition is given in a wide class of hierarchical structures including both exponential and mixture families. As an example, we decompose the dependency in a higher order Markov chain into a sum of those in various lower order Markov chains.
引用
收藏
页码:1701 / 1711
页数:11
相关论文
共 37 条
[1]   DYNAMICS OF NEURONAL FIRING CORRELATION - MODULATION OF EFFECTIVE CONNECTIVITY [J].
AERTSEN, AMHJ ;
GERSTEIN, GL ;
HABIB, MK ;
PALM, G .
JOURNAL OF NEUROPHYSIOLOGY, 1989, 61 (05) :900-917
[2]  
Agresti A., 1990, Analysis of categorical data
[3]   INFORMATION GEOMETRY OF BOLTZMANN MACHINES [J].
AMARI, S ;
KURATA, K ;
NAGAOKA, H .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02) :260-271
[4]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[6]   DIFFERENTIAL GEOMETRY OF A PARAMETRIC FAMILY OF INVERTIBLE LINEAR-SYSTEMS - RIEMANNIAN METRIC, DUAL AFFINE CONNECTIONS, AND DIVERGENCE [J].
AMARI, S .
MATHEMATICAL SYSTEMS THEORY, 1987, 20 (01) :53-82
[7]   DUALISTIC GEOMETRY OF THE MANIFOLD OF HIGHER-ORDER NEURONS [J].
AMARI, S .
NEURAL NETWORKS, 1991, 4 (04) :443-451
[8]   Information geometry of estimating functions in semi-parametric statistical models [J].
Amari, S ;
Kawanabe, M .
BERNOULLI, 1997, 3 (01) :29-54
[9]  
AMARI S, 1985, DIFFERENTIAL GEOMETR, V25
[10]  
Amari S.I., 1997, Contemp. Math., V203, P81, DOI DOI 10.1090/C0NM/203/02554