Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR

被引:40
|
作者
Racz, Anita [1 ]
Bajusz, David [2 ]
Heberger, Karoly [1 ]
机构
[1] Hungarian Acad Sci, Plasma Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[2] Hungarian Acad Sci, Med Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
关键词
analysis of variance; correlation; descriptor; QSAR; regression; sum of ranking differences; QSAR MODELS; DERIVATIVES; INHIBITION; VALIDATION; PREDICTION; RANKING;
D O I
10.1002/minf.201800154
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
QSAR/QSPR (quantitative structure-activity/property relationship) modeling has been a prevalent approach in various, overlapping sub-fields of computational, medicinal and environmental chemistry for decades. The generation and selection of molecular descriptors is an essential part of this process. In typical QSAR workflows, the starting pool of molecular descriptors is rationalized based on filtering out descriptors which are (i) constant throughout the whole dataset, or (ii) very strongly correlated to another descriptor. While the former is fairly straightforward, the latter involves a level of subjectivity when deciding what exactly is considered to be a strong correlation. Despite that, most QSAR modeling studies do not report on this step. In this study, we examine in detail the effect of various possible descriptor intercorrelation limits on the resulting QSAR models. Statistical comparisons are carried out based on four case studies from contemporary QSAR literature, using a combined methodology based on sum of ranking differences (SRD) and analysis of variance (ANOVA).
引用
收藏
页数:6
相关论文
共 50 条
  • [31] QSAR and the ultimate molecular descriptor: the shape of electron density clouds
    Paul G. Mezey
    Journal of Mathematical Chemistry, 2009, 45 : 544 - 549
  • [32] QSAR and the ultimate molecular descriptor: the shape of electron density clouds
    Mezey, Paul G.
    JOURNAL OF MATHEMATICAL CHEMISTRY, 2009, 45 (02) : 544 - 549
  • [33] Combined QSAR/QSPR and molecular docking study on fluoroquinolones to reduce biological enrichment
    Zhao, Xiaohui
    Zhao, Yuanyuan
    Ren, Zhixing
    Li, Yu
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 79 : 177 - 184
  • [34] Novel local (fragment-based) topological molecular descriptors for QSPR/QSAR and molecular design
    Estrada, E
    Molina, E
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2001, 20 (01): : 54 - 64
  • [35] Use of semiempirical quantum-chemical molecular descriptors in QSAR/QSPR.
    Karelson, M
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1996, 211 : 154 - COMP
  • [36] QSPR/QSAR solely based on molecular surface electrostatic potentials for benzenoid hydrocarbons
    Raouf Ghavami
    Bakhtyar Sepehri
    Journal of the Iranian Chemical Society, 2016, 13 : 519 - 529
  • [37] Wavelet representations of molecular electronic properties: Applications in ADME, QSPR and QSAR.
    Breneman, CM
    Sukumar, N
    Bennett, KP
    Embrechts, MJ
    Sundling, M
    Lockwood, L
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 220 : U278 - U278
  • [38] Modified and enhanced replacement method for the selection of molecular descriptors in QSAR and QSPR theories
    Mercader, Andrew G.
    Duchowicz, Pablo R.
    Fernandez, Francisco M.
    Castro, Eduardo A.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 92 (02) : 138 - 144
  • [39] QSPR/QSAR solely based on molecular surface electrostatic potentials for benzenoid hydrocarbons
    Ghavami, Raouf
    Sepehri, Bakhtyar
    JOURNAL OF THE IRANIAN CHEMICAL SOCIETY, 2016, 13 (03) : 519 - 529
  • [40] The Studies of QSPR/QSAR for Ionic Liquids
    Zheng Yansheng
    Mo Qian
    Liu Zhaoming
    PROGRESS IN CHEMISTRY, 2009, 21 (09) : 1772 - 1781