A comparison of methods for modeling quantitative structure-activity relationships

被引:195
|
作者
Sutherland, JJ
O'Brien, LA
Weaver, DF [1 ]
机构
[1] Dalhousie Univ, Sch Biomed Engn, Halifax, NS B3H 4J3, Canada
[2] Queens Univ, Dept Chem & Pathol, Kingston, ON K7L 3N6, Canada
[3] Dalhousie Univ, Dept Med Neurol, Halifax, NS B3H 4J3, Canada
[4] Dalhousie Univ, Dept Chem, Halifax, NS B3H 4J3, Canada
关键词
D O I
10.1021/jm0497141
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A large number of methods are available for modeling quantitative structure-activity relationships (QSAR). We examine the predictive accuracy of several methods applied to data sets of inhibitors for angiotensin converting enzyme, acetylcholinesterase, benzodiazepine receptor, cyclooxygenase-2, dihydrofolate reductase, glycogen phosphorylase b, thermolysin, and thrombin. Descriptors calculated with CoMFA, CoMSIA, EVA, HQSAR, and traditional 2D and 2.5D descriptors were used for developing models with partial least squares (PLS). In addition, the genetic function approximation algorithm, genetic PLS, and back-propagation neural networks were used for deriving models from 2.5D descriptors (i.e., 2D descriptors and 3D descriptors calculated from CORINA structures and Gasteiger-Marsili charges). Predictive accuracy was assessed using designed test sets. It was found that HQSAR generally performs as well as CoMFA and CoMSIA; other descriptor sets performed less well. When 2.5D descriptors were used, only neural network ensembles were found to be similarly or more predictive than PLS models. In addition, we show that many cross-validation procedures yield similar estimates of the interpolative accuracy of methods. However, the lack of correspondence between cross-validated and test set predictive accuracy for four sets underscores the benefit of using designed test sets.
引用
收藏
页码:5541 / 5554
页数:14
相关论文
共 50 条