Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR

被引:40
|
作者
Racz, Anita [1 ]
Bajusz, David [2 ]
Heberger, Karoly [1 ]
机构
[1] Hungarian Acad Sci, Plasma Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[2] Hungarian Acad Sci, Med Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
关键词
analysis of variance; correlation; descriptor; QSAR; regression; sum of ranking differences; QSAR MODELS; DERIVATIVES; INHIBITION; VALIDATION; PREDICTION; RANKING;
D O I
10.1002/minf.201800154
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
QSAR/QSPR (quantitative structure-activity/property relationship) modeling has been a prevalent approach in various, overlapping sub-fields of computational, medicinal and environmental chemistry for decades. The generation and selection of molecular descriptors is an essential part of this process. In typical QSAR workflows, the starting pool of molecular descriptors is rationalized based on filtering out descriptors which are (i) constant throughout the whole dataset, or (ii) very strongly correlated to another descriptor. While the former is fairly straightforward, the latter involves a level of subjectivity when deciding what exactly is considered to be a strong correlation. Despite that, most QSAR modeling studies do not report on this step. In this study, we examine in detail the effect of various possible descriptor intercorrelation limits on the resulting QSAR models. Statistical comparisons are carried out based on four case studies from contemporary QSAR literature, using a combined methodology based on sum of ranking differences (SRD) and analysis of variance (ANOVA).
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Is your QSAR/QSPR descriptor real or trash?
    Kiralj, Rudolf
    Ferreira, Marcia M. C.
    JOURNAL OF CHEMOMETRICS, 2010, 24 (11-12) : 681 - 693
  • [2] EVA: A new theoretically based molecular descriptor for use in QSAR/QSPR analysis
    A.M. Ferguson
    T. Heritage
    P. Jonathon
    S.E. Pack
    L. Phillips
    J. Rogan
    P.J. Snaith
    Journal of Computer-Aided Molecular Design, 1997, 11 : 143 - 152
  • [3] EVA: A new theoretically based molecular descriptor for use in QSAR/QSPR analysis
    Ferguson, AM
    Heritage, T
    Jonathon, P
    Pack, SE
    Phillips, L
    Rogan, J
    Snaith, PJ
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1997, 11 (02) : 143 - 152
  • [4] Use of electron-electron repulsion energy as a molecular descriptor in QSAR and QSPR studies
    Gironés, X
    Amat, L
    Robert, D
    Carbó-Dorca, R
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2000, 14 (05) : 477 - 485
  • [5] Use of electron-electron repulsion energy as a molecular descriptor in QSAR and QSPR studies
    Xavier Gironés
    Lluís Amat
    David Robert
    Ramon Carbó-Dorca
    Journal of Computer-Aided Molecular Design, 2000, 14 : 477 - 485
  • [6] Molecular descriptors in QSAR/QSPR
    Karelson, M.
    Angewandte Chemie (International Edition in English), 2001, 40 (06):
  • [7] Legitimate Utilization of Large Descriptor Pools for QSPR/QSAR Models
    Katritzky, Alan R.
    Dobchev, Dimitar A.
    Slavov, Svetoslav
    Karelson, Mati
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (11) : 2207 - 2213
  • [8] An alignment-independent versatile structure descriptor for QSAR and QSPR based on the distribution of molecular features
    Baumann, K
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01): : 26 - 35
  • [10] Molecular surfaces, QSAR, QSPR and reactivity
    Clark, Tim
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 234