A statistical learning theory approach of bloat

被引：0

作者：

Gelly, Sylvain ^{[1
]}

Teytaud, Olivier ^{[1
]}

Bredeche, Nicolas ^{[1
]}

Schoenauer, Marc ^{[1
]}

机构：

[1] Univ Paris 11, INRIA Futurs, Equipe TAO, LRI, F-91405 Orsay, France

来源：

GECCO 2005: Genetic and Evolutionary Computation Conference, Vols 1 and 2 | 2005年

关键词：

algorithms; reliability; theory; code bloat; code growth; Genetic Programming; Statistical Learning Theory;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Code bloat, the excessive increase of code size, is an important issue in Genetic Programming (GP). This paper proposes a theoretical analysis of code bloat in the framework of symbolic regression in GP, from the viewpoint of Statistical Learning Theory, a well grounded mathematical toolbox for Machine Learning. Two kinds of bloat must be distinguished in that context, depending whether the target function lies in the search space or not. Then, important mathematical results are proved using classical results from Statistical Learning. Namely, the Vapnik-Chervonenkis dimension of programs is computed, and further results from Statistical Learning allow to prove that a parsimonious fitness ensures Universal Consistency (the solution minimizing the empirical error does converge to the best possible error when the number of examples goes to infinity). However, it is proved that the standard method consisting in choosing a maximal program size depending on the number of examples might still result in programs of infinitely increasing size with their accuracy; a more complicated modification of the fitness is proposed that theoretically avoids unnecessary bloat while nevertheless preserving the Universal Consistency.

引用

页码：1783 / 1784

页数：2

共 50 条

[1] The theory of on-line learning - A statistical physics approach
Saad, D
EXPLORATORY DATA ANALYSIS IN EMPIRICAL RESEARCH, PROCEEDINGS, 2003, : 300 - 308
[2] Universal consistency and bloat in GP some theoretical considerations about genetic programming from a statistical learning theory viewpoint
LRI, Bat. 490, University Paris-Sud, F-91405 Orsay Cedex
Rev Intell Artif, 2006, 6 (805-827):
[3] PROBABILITY LEARNING IN STATISTICAL LEARNING THEORY
FEICHTINGER, G
METRIKA, 1971, 18 (01) : 35 - 55
[4] A statistical learning theory approach for uncertain linear and bilinear matrix inequalities
Chamanbaz, Mohammadreza
Dabbene, Fabrizio
Tempo, Roberto
Venkataramanan, Venkatakrishnan
Wang, Qing-Guo
AUTOMATICA, 2014, 50 (06) : 1617 - 1625
[5] Rethinking statistical learning theory: learning using statistical invariants
Vapnik, Vladimir
Izmailov, Rauf
MACHINE LEARNING, 2019, 108 (03) : 381 - 423
[6] Complete Statistical Theory of Learning (Learning Using Statistical Invariants)
Vapnik, Vladimir
Izmailov, Rauf
CONFORMAL AND PROBABILISTIC PREDICTION AND APPLICATIONS, VOL 128, 2020, 128 : 4 - 40
[7] Rethinking statistical learning theory: learning using statistical invariants
Vladimir Vapnik
Rauf Izmailov
Machine Learning, 2019, 108 : 381 - 423
[8] Statistical learning theory: a tutorial
Kulkarni, Sanjeev R.
Harman, Gilbert
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (06): : 543 - 556
[9] TOWARD A STATISTICAL THEORY OF LEARNING
ESTES, WK
PSYCHOLOGICAL REVIEW, 1950, 57 (02) : 94 - 107
[10] Complete Statistical Theory of Learning
Vapnik, V. N.
AUTOMATION AND REMOTE CONTROL, 2019, 80 (11) : 1949 - 1975

← 1 2 3 4 5 →