Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR

被引:40
|
作者
Racz, Anita [1 ]
Bajusz, David [2 ]
Heberger, Karoly [1 ]
机构
[1] Hungarian Acad Sci, Plasma Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[2] Hungarian Acad Sci, Med Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
关键词
analysis of variance; correlation; descriptor; QSAR; regression; sum of ranking differences; QSAR MODELS; DERIVATIVES; INHIBITION; VALIDATION; PREDICTION; RANKING;
D O I
10.1002/minf.201800154
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
QSAR/QSPR (quantitative structure-activity/property relationship) modeling has been a prevalent approach in various, overlapping sub-fields of computational, medicinal and environmental chemistry for decades. The generation and selection of molecular descriptors is an essential part of this process. In typical QSAR workflows, the starting pool of molecular descriptors is rationalized based on filtering out descriptors which are (i) constant throughout the whole dataset, or (ii) very strongly correlated to another descriptor. While the former is fairly straightforward, the latter involves a level of subjectivity when deciding what exactly is considered to be a strong correlation. Despite that, most QSAR modeling studies do not report on this step. In this study, we examine in detail the effect of various possible descriptor intercorrelation limits on the resulting QSAR models. Statistical comparisons are carried out based on four case studies from contemporary QSAR literature, using a combined methodology based on sum of ranking differences (SRD) and analysis of variance (ANOVA).
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A Lu index for QSAR/QSPR studies
    Lu, CH
    Guo, WM
    Hu, XF
    Wang, Y
    Yin, CS
    CHEMICAL PHYSICS LETTERS, 2006, 417 (1-3) : 11 - 15
  • [42] QSPR/QSAR computations by PRECLAV software
    Laszlo, T
    REVISTA DE CHIMIE, 2005, 56 (06): : 639 - 648
  • [43] TOWARD BIG DATA IN QSAR/QSPR
    Duprat, A.
    Ploix, J. L.
    Dioury, F.
    Dreyfus, G.
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [44] Topological Indices in QSAR and QSPR - Foreword
    Devillers, J
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2001, 12 (1-2) : I - I
  • [45] Research of QSPR/QSAR for Ionic Liquids
    Zhao Yongsheng
    Zhang Xiangping
    Zhao Jihong
    Zhang Hongzhong
    Kang Xuejing
    Dong Feng
    PROGRESS IN CHEMISTRY, 2012, 24 (07) : 1236 - 1244
  • [46] Current Challenges in QSAR/QSPR Analysis
    Putz, Mihai V.
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2014, 10 (02) : 97 - 98
  • [47] Theory and applications of the integrated molecular transform and the normalized molecular moment structure descriptors: QSAR and QSPR paradigms
    Molnar, SP
    King, JW
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2001, 85 (06) : 662 - 675
  • [48] TOPOLOGICAL INDEXES BASED ON WEIGHTS OF THE MOLECULAR GRAPH VERTICES FOR INVESTIGATIONS IN THE FIELD OF QSAR AND QSPR
    PETELIN, DY
    PALYULIN, VA
    ZEFIROV, NS
    DOKLADY AKADEMII NAUK, 1992, 324 (05) : 1019 - 1022
  • [49] QSAR and QSPR molecular descriptors computed from the resistance distance and electrical conductance matrices
    Ivanciuc, O
    ACH-MODELS IN CHEMISTRY, 2000, 137 (5-6): : 607 - 631
  • [50] The use of electron density-derived TAE molecular descriptors in QSAR and QSPR.
    Breneman, CM
    Martinov, M
    Whitehead, C
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1998, 215 : U520 - U520