Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR

被引:40
|
作者
Racz, Anita [1 ]
Bajusz, David [2 ]
Heberger, Karoly [1 ]
机构
[1] Hungarian Acad Sci, Plasma Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
[2] Hungarian Acad Sci, Med Chem Res Grp, Res Ctr Nat Sci, Magyar Tudosok Krt 2, H-1117 Budapest, Hungary
关键词
analysis of variance; correlation; descriptor; QSAR; regression; sum of ranking differences; QSAR MODELS; DERIVATIVES; INHIBITION; VALIDATION; PREDICTION; RANKING;
D O I
10.1002/minf.201800154
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
QSAR/QSPR (quantitative structure-activity/property relationship) modeling has been a prevalent approach in various, overlapping sub-fields of computational, medicinal and environmental chemistry for decades. The generation and selection of molecular descriptors is an essential part of this process. In typical QSAR workflows, the starting pool of molecular descriptors is rationalized based on filtering out descriptors which are (i) constant throughout the whole dataset, or (ii) very strongly correlated to another descriptor. While the former is fairly straightforward, the latter involves a level of subjectivity when deciding what exactly is considered to be a strong correlation. Despite that, most QSAR modeling studies do not report on this step. In this study, we examine in detail the effect of various possible descriptor intercorrelation limits on the resulting QSAR models. Statistical comparisons are carried out based on four case studies from contemporary QSAR literature, using a combined methodology based on sum of ranking differences (SRD) and analysis of variance (ANOVA).
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Additive inductive learning in QSAR/QSPR studies and molecular modeling
    II Baskin
    NI Zhokhova
    VA Palyulin
    NS Zefirov
    Chemistry Central Journal, 3 (Suppl 1)
  • [22] On QSPR Analysis of Molecular Descriptor and Thermodynamic Features of Narcotic Drugs
    Zhou, Hao
    Mahboob, Abid
    Rasheed, Muhammad Waheed
    Ovais, Ali
    Siddiqui, Muhammad Kamran
    Cheema, Imran Zulfiqar
    POLYCYCLIC AROMATIC COMPOUNDS, 2024, 44 (05) : 3079 - 3099
  • [23] Simplex representation of molecular structure as universal QSAR/QSPR tool
    Victor Kuz’min
    Anatoly Artemenko
    Luidmyla Ognichenko
    Alexander Hromov
    Anna Kosinskaya
    Sergij Stelmakh
    Zoe L. Sessions
    Eugene N. Muratov
    Structural Chemistry, 2021, 32 : 1365 - 1392
  • [24] QSAR/QSPR models to support fragrance ingredient molecular design
    Chekmarev, Dmitriy
    Kattas, Jeffrey
    Bikker, Jack
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [25] Simplex representation of molecular structure as universal QSAR/QSPR tool
    Kuz'min, Victor
    Artemenko, Anatoly
    Ognichenko, Luidmyla
    Hromov, Alexander
    Kosinskaya, Anna
    Stelmakh, Sergij
    Sessions, Zoe L.
    Muratov, Eugene N.
    STRUCTURAL CHEMISTRY, 2021, 32 (04) : 1365 - 1392
  • [26] Chemical Graph Theory for Property Modeling in QSAR and QSPR-Charming QSAR & QSPR
    Costa, Paulo C. S.
    Evangelista, Joel S.
    Leal, Igor
    Miranda, Paulo C. M. L.
    MATHEMATICS, 2021, 9 (01) : 1 - 19
  • [27] Molecular Descriptors, Structure Generation, and Inverse QSAR/QSPR Based on SELFIES
    Kaneko, Hiromasa
    ACS OMEGA, 2023, 8 (24): : 21781 - 21786
  • [28] Improving Molecular Design with Direct Inverse Analysis of QSAR/QSPR Model
    Shino, Yuto
    Kaneko, Hiromasa
    MOLECULAR INFORMATICS, 2025, 44 (01)
  • [29] QSAR/QSPR and proprietary data
    Jorgensen, William L.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) : 937 - 937
  • [30] The progress of QSAR/QSPR for surfactants
    Zhang, XY
    Zhu, HW
    Li, L
    Li, GZ
    Wang, ZW
    PROGRESS IN CHEMISTRY, 2003, 15 (05) : 351 - 360