Coupling weight elimination with genetic algorithms to reduce network size and preserve generalization

被引:45
作者
Bebis, G [1 ]
Georgiopoulos, M [1 ]
Kasparis, T [1 ]
机构
[1] UNIV CENT FLORIDA, DEPT ELECT & COMP ENGN, ORLANDO, FL 32816 USA
关键词
neural networks; genetic algorithms; weight elimination; pruning;
D O I
10.1016/S0925-2312(97)00050-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent theoretical results support that decreasing the number of free parameters in a neural network (i.e., weights) can improve generalization. These results have triggered the development of many approaches which try to determine an ''appropriate'' network size for a given problem. The main goal has been to find a network size just large enough to capture the general class properties of the data. In some cases, however, network size is not reduced significantly or the reduction is satisfactory but generalization is affected. In this paper, we propose the coupling of genetic algorithms with weight elimination. Our objective is not only to significantly reduce network size, by pruning larger size networks, but also to preserve generalization, that is, to come up with pruned networks which generalize as good or even better than their unpruned counterparts. The innovation of our work relies on a fitness function which uses an adaptive parameter to encourage reproduction of networks having small size and good generalization. The proposed approach has been tested using both artificial and real databases demonstrating good performance.
引用
收藏
页码:167 / 194
页数:28
相关论文
共 38 条
[1]  
Ash T., 1989, Connection Science, V1, P365, DOI 10.1080/09540098908915647
[2]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[3]  
BEBIS G, 1996, INT C NEUR NETW ICNN, V2, P1115
[4]  
BEBIS G, 1990, P NEURONET INT S NEU, P33
[5]  
CAUDILL M, 1991, AI EXPERT MAR, P29
[6]  
Chauvin Y., 1989, ADV NEURAL INFORMATI, P519
[7]  
DEBIS G, 1994, IEEE POTENTIALS OCT, P27
[8]  
DEPENAU J, 1994, P WORLD C NEUR NETW, V3, P504
[9]  
Fahlman S., 1990, ADV NEURAL INFORMATI, V2, P524
[10]  
Frean M, 1990, NEURAL COMPUT, V2, P198