Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity

被引:125
|
作者
Polishchuk, Pavel G. [1 ]
Muratov, Eugene N. [1 ,2 ]
Artemenko, Anatoly G. [1 ]
Kolumbin, Oleg G. [3 ]
Muratov, Nail N. [4 ]
Kuz'min, Victor E. [1 ]
机构
[1] AV Bogatsky Phys Chem Inst NAS Ukraine, Lab Theoret Chem, UA-65080 Odessa, Ukraine
[2] Univ N Carolina, Sch Pharm, Lab Mol Modeling, Chapel Hill, NC 27599 USA
[3] Pridnestrovskij State Univ, Dept Chem, MD-3300 Tiraspol, Moldova
[4] Odessa Natl Polytech Univ, Dept Chem Technol, UA-65000 Odessa, Ukraine
关键词
QUANTITATIVE STRUCTURE; VARIABLE SELECTION; SIMPLEX REPRESENTATION; APPLICABILITY DOMAIN; MODELS; PLS; NITROAROMATICS; DERIVATIVES; TECHNOLOGY; REGRESSION;
D O I
10.1021/ci900203n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This work is devoted to the application of the random forest approach to QSAR analysis of aquatic toxicity of chemical compounds tested on Tetrahymena pyriformis. The simplex representation of the molecular structure approach implemented in HiT QSAR Software was used for descriptors generation on a two-dimensional level. Adequate models based on simplex descriptors and the RF statistical approach were obtained on a modeling set of 644 compounds. Model predictivity was validated on two external test sets of 339 and 110 compounds. The high impact of lipophilicity and polarizability of investigated compounds on toxicity was determined. It was shown that RF models were tolerant for insertion of irrelevant descriptors as well as for randomization of some part of toxicity values that were representing a "noise". The fast procedure of optimization of the number of trees in the random forest has been proposed. The discussed RF model had comparable or better statistical characteristics than the corresponding PLS or KNN models.
引用
收藏
页码:2481 / 2488
页数:8
相关论文
共 50 条
  • [1] QSAR IN TOXICOLOGY .1. PREDICTION OF AQUATIC TOXICITY
    CRONIN, MTD
    DEARDEN, JC
    QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1995, 14 (01): : 1 - 7
  • [2] Prediction of Aquatic Toxicity Mode of Action Using Linear Discriminant and Random Forest Models
    Martin, Todd M.
    Grulke, Christopher M.
    Young, Douglas M.
    Russom, Christine L.
    Wang, Nina Y.
    Jackson, Crystal R.
    Barron, Mace G.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (09) : 2229 - 2239
  • [3] Prediction of aquatic toxicity mode of action using linear discriminant and random forest models
    Martin, Todd M.
    Grulke, Christopher M.
    Young, Douglas M.
    Russom, Christine L.
    Wang, Nina
    Jackson, Crystal R.
    Barron, Mace G.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [4] A QSAR APPROACH FOR ESTIMATING THE AQUATIC TOXICITY OF SOFT ELECTROPHILES [QSAR FOR SOFT ELECTROPHILES]
    VEITH, GD
    MEKENYAN, OG
    QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1993, 12 (04): : 349 - 356
  • [5] In Silico Prediction of the Toxicity of Nitroaromatic Compounds: Application of Ensemble Learning QSAR Approach
    Daghighi, Amirreza
    Casanola-Martin, Gerardo M.
    Timmerman, Troy
    Milenkovic, Dejan
    Lucic, Bono
    Rasulev, Bakhtiyor
    TOXICS, 2022, 10 (12)
  • [6] Prediction of Toxicity of Nanomaterials Using QSAR Approach
    Singh, Dilpreet
    Chawla, Pooja A.
    CURRENT ANALYTICAL CHEMISTRY, 2023, 19 (06) : 436 - 439
  • [7] Prediction of aquatic toxicity of chemical mixtures by the QSAR approach using 2D structural descriptors
    Chatterjee, Mainak
    Roy, Kunal
    JOURNAL OF HAZARDOUS MATERIALS, 2021, 408
  • [8] PREDICTION OF AQUATIC TOXICITY USING QSAR TOOLBOX: AN IN SILICO APPROACH FOR CHEMICAL HAZARD ASSESSMENT IN HYDRAULIC FRACTURE
    Ensuncho, Adolfo E.
    Ramirez, Diana B.
    Lopez, Jesus M.
    QUIMICA NOVA, 2025, 48 (03):
  • [9] A QSAR for baseline toxicity:: Validation, domain of application, and prediction
    Öberg, T
    CHEMICAL RESEARCH IN TOXICOLOGY, 2004, 17 (12) : 1630 - 1637
  • [10] QSAR ISSUES IN AQUATIC TOXICITY OF SURFACTANTS
    ROBERTS, DW
    SCIENCE OF THE TOTAL ENVIRONMENT, 1991, 109 : 557 - 568