Predictability, complexity, and learning

被引:318
作者
Bialek, W
Nemenman, I
Tishby, N
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
[2] Princeton Univ, Dept Phys, Princeton, NJ 08544 USA
[3] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel
[4] Hebrew Univ Jerusalem, Ctr Neural Computat, IL-91904 Jerusalem, Israel
关键词
D O I
10.1162/089976601753195969
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We define predictive information I-pred(T) as the mutual information between the past and the future of a time series. Three qualitatively different behaviors are found in the limit of large observation times T:I-pred(T) can remain finite, grow logarithmically, or grow as a fractional power law. If the time series allows us to learn a model with a finite number of parameters, then I-pred(T) grows logarithmically with a coefficient that counts the dimensionality of the model space. In contrast, power-law growth is associated, for example, with the learning of infinite parameter (or non-parametric) models such as continuous functions with smoothness constraints. There are connections between the predictive information and measures of complexity that have been defined both in learning theory and the analysis of physical systems through statistical mechanics and dynamical systems theory. Furthermore, in the same way that entropy provides the unique measure of available information consistent with some simple and plausible conditions, we argue that the divergent part of I-pred(T) provides the unique measure for the complexity of dynamics underlying a time series. Finally, we discuss how these ideas may be useful in problems in physics, statistics, and biology.
引用
收藏
页码:2409 / 2463
页数:55
相关论文
共 108 条
[1]   THE ANALYSIS OF OBSERVED CHAOTIC DATA IN PHYSICAL SYSTEMS [J].
ABARBANEL, HDI ;
BROWN, R ;
SIDOROWICH, JJ ;
TSIMRING, LS .
REVIEWS OF MODERN PHYSICS, 1993, 65 (04) :1331-1392
[2]   Physical complexity of symbolic sequences [J].
Adami, C ;
Cerf, NJ .
PHYSICA D, 2000, 137 (1-2) :62-69
[3]   Field theoretical analysis of on-line learning of probability distributions [J].
Aida, T .
PHYSICAL REVIEW LETTERS, 1999, 83 (17) :3554-3557
[4]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[5]  
AKAIKE H, 1974, 2 INT S INF THEOR BU
[6]  
[Anonymous], P S MECH THOUGHT PRO
[7]  
[Anonymous], 2018, TIME SERIES PREDICTI
[8]  
[Anonymous], 1985, EMERGING SYNTHESES S
[9]  
[Anonymous], COMPUTATIONAL MODELS
[10]  
ATICK JJ, 1992, PRINCETON LECTURES ON BIOPHYSICS, P223