Advances in the Replacement and Enhanced Replacement Method in QSAR and QSPR Theories

被引:60
|
作者
Mercader, Andrew G. [1 ,2 ]
Duchowicz, Pablo R. [1 ]
Fernandez, Francisco M. [1 ]
Castro, Eduardo A. [1 ]
机构
[1] UNLP, INIFTA, CCT La Plata CONICET, Inst Invest Fisicoquim Teor & Aplicadas, RA-1900 La Plata, Argentina
[2] Univ Buenos Aires, Fac Farm & Bioquim, PRALIB UBA CONICET, RA-1113 Buenos Aires, DF, Argentina
关键词
MOLECULAR DESCRIPTORS; VARIABLE SELECTION; OPTIMIZATION; ALGORITHM; PREDICTION; SEARCH; OXYGEN;
D O I
10.1021/ci200079b
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The selection of an optimal set of molecular descriptors from a much greater pool of such regression variables is a crucial step in the development of QSAR and QSPR models. The aim of this work is to further improve this important selection process. For this reason three different alternatives for the initial steps of our recently developed enhanced replacement method (ERM) and replacement method (RM) are proposed. These approaches had previously proven to yield near optimal results with a much smaller number of linear regressions than the full search. The algorithms were tested on four different experimental data sets, formed by collections of 116, 200, 78, and 100 experimental records from different compounds and 1268, 1338, 1187, and 1306 molecular descriptors, respectively. The comparisons showed that one of the new alternatives further improves the ERM, which has shown to be superior to genetic algorithms for the selection of an optimal set of molecular descriptors from a much greater pool. The new proposed alternative also improves the simpler and the lower computational demand algorithm RM.
引用
收藏
页码:1575 / 1581
页数:7
相关论文
共 50 条