Exploration of linear modelling techniques and their combination with multivariate adaptive regression splines to predict gastro-intestinal absorption of drugs

被引:34
|
作者
Deconinck, E.
Coomans, D.
Heyden, Y. Vander
机构
[1] Vrije Univ Brussel VIB, Dept Analyt Chem & Pharmaceut Technol, Inst Pharmaceut, B-1090 Brussels, Belgium
[2] James Cook Univ N Queensland, Stat & Intelligent Data Anal Grp, Townsville, Qld 4814, Australia
关键词
QSAR; drug absorption; in silico prediction; TMARS;
D O I
10.1016/j.jpba.2006.06.022
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In general, linear modelling techniques such as multiple linear regression (MLR), principal component regression (PCR) and partial least squares (PLS), are used to model QSAR data. This type of data can be very complex and linear modelling techniques often model only a limited part of the information captured in the data. In this study, it was tried to combine linear techniques with the flexible non-linear technique multivariate adaptive regression splines (MARS). Models were built using an MLR model, combined with either a stepwise procedure or a genetic algorithm for variable selection, a PCR model or a PLS model as starting points for the MARS algorithm. The descriptive and predictive power of the models was evaluated in a QSAR context and compared to the performances of the individual linear models and the single MARS model. In general, the combined methods resulted in significant improvements compared to the linear models and can be considered valuable techniques in modelling complex QSAR data. For the used data set the best model was obtained using a combination of PLS and MARS. This combination resulted in a model with a Pearson correlation coefficient of 0.90 and a cross-validation error, evaluated with 10-fold cross-validation of 9.9%, pointing at good descriptive and high predictive properties. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 3 条
  • [1] Prediction of gastro-intestinal absorption using multivariate adaptive regression splines
    Deconinck, E
    Xu, QS
    Put, R
    Coomans, D
    Massart, DL
    Heyden, YV
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2005, 39 (05) : 1021 - 1030
  • [2] Modelling energy consumption of the Jordanian transportation sector: the application of multivariate linear regression and adaptive neuro-fuzzy techniques
    Obaidat, Mazin
    Obeidat, Amr M.
    Al-Ghandoor, Ahmed
    Gharaibeh, Mohammad A.
    Almomani, Hesham A.
    INTERNATIONAL JOURNAL OF SUSTAINABLE ENERGY, 2019, 38 (09) : 814 - 820
  • [3] Boosted regression trees, multivariate adaptive regression splines and their two-step combinations with multiple linear regression or partial least squares to predict blood-brain barrier passage: A case study
    Deconinck, E.
    Zhang, M. H.
    Petitet, F.
    Dubus, E.
    Ijjaali, I.
    Coomans, D.
    Heyden, Y. Vander
    ANALYTICA CHIMICA ACTA, 2008, 609 (01) : 13 - 23