Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity

被引:125
|
作者
Polishchuk, Pavel G. [1 ]
Muratov, Eugene N. [1 ,2 ]
Artemenko, Anatoly G. [1 ]
Kolumbin, Oleg G. [3 ]
Muratov, Nail N. [4 ]
Kuz'min, Victor E. [1 ]
机构
[1] AV Bogatsky Phys Chem Inst NAS Ukraine, Lab Theoret Chem, UA-65080 Odessa, Ukraine
[2] Univ N Carolina, Sch Pharm, Lab Mol Modeling, Chapel Hill, NC 27599 USA
[3] Pridnestrovskij State Univ, Dept Chem, MD-3300 Tiraspol, Moldova
[4] Odessa Natl Polytech Univ, Dept Chem Technol, UA-65000 Odessa, Ukraine
关键词
QUANTITATIVE STRUCTURE; VARIABLE SELECTION; SIMPLEX REPRESENTATION; APPLICABILITY DOMAIN; MODELS; PLS; NITROAROMATICS; DERIVATIVES; TECHNOLOGY; REGRESSION;
D O I
10.1021/ci900203n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This work is devoted to the application of the random forest approach to QSAR analysis of aquatic toxicity of chemical compounds tested on Tetrahymena pyriformis. The simplex representation of the molecular structure approach implemented in HiT QSAR Software was used for descriptors generation on a two-dimensional level. Adequate models based on simplex descriptors and the RF statistical approach were obtained on a modeling set of 644 compounds. Model predictivity was validated on two external test sets of 339 and 110 compounds. The high impact of lipophilicity and polarizability of investigated compounds on toxicity was determined. It was shown that RF models were tolerant for insertion of irrelevant descriptors as well as for randomization of some part of toxicity values that were representing a "noise". The fast procedure of optimization of the number of trees in the random forest has been proposed. The discussed RF model had comparable or better statistical characteristics than the corresponding PLS or KNN models.
引用
收藏
页码:2481 / 2488
页数:8
相关论文
共 50 条
  • [21] Development of QSAR Model for Subchronic Inhalation Toxicity Using Random Forest Regression Method
    Shin, Jae Hong
    Lee, Byeong Hun
    Lee, Sung Kwang
    BULLETIN OF THE KOREAN CHEMICAL SOCIETY, 2019, 40 (08) : 819 - 825
  • [22] Prediction of the Toxicity of Binary Mixtures by QSAR Approach Using the Hypothetical Descriptors
    Wang, Ting
    Tang, Lili
    Luan, Feng
    Cordeiro, M. Natalia D. S.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (11)
  • [23] QSAR and mechanism of action for aquatic toxicity of cationic surfactants
    Roberts, DW
    Costello, J
    QSAR & COMBINATORIAL SCIENCE, 2003, 22 (02): : 220 - 225
  • [24] QSAR assessment of aquatic toxicity potential of diverse agrochemicals
    Nath, A.
    Ojha, P. K.
    Roy, K.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2023, 34 (11) : 923 - 942
  • [25] Early Prediction of University Dropouts - A Random Forest Approach
    Behr, Andreas
    Giese, Marco
    Teguim, Herve D. K.
    Theune, Katja
    JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2020, 240 (06): : 743 - 789
  • [26] Prediction of Preeclampsia by Using Random Forest Approach.
    Xie, Fagen
    Zhuang, Zimin
    Fassett, Michael J.
    Getahun, Darios
    REPRODUCTIVE SCIENCES, 2019, 26 : 179A - 179A
  • [27] Random forest algorithm-based classification model of pesticide aquatic toxicity to fishes
    Yu, Xinliang
    Zeng, Qun
    AQUATIC TOXICOLOGY, 2022, 251
  • [28] Computer-aided prediction of toxicity with substructure pattern and random forest
    Cao, Dong-Sheng
    Yang, Yan-Ning
    Zhao, Jian-Chao
    Yan, Jun
    Liu, Shao
    Hu, Qian-Nan
    Xu, Qing-Song
    Liang, Yi-Zeng
    JOURNAL OF CHEMOMETRICS, 2012, 26 (01) : 7 - 15
  • [29] A transfer learning approach based on random forest with application to breast cancer prediction in underrepresented populations
    Gu, Tian
    Han, Yi
    Duan, Rui
    BIOCOMPUTING 2023, PSB 2023, 2023, : 186 - 197
  • [30] Machine learning model for random forest acute oral toxicity prediction
    Elsayad, A. M.
    Elsayad, K. A.
    Zeghid, M.
    Khan, A. N.
    Baareh, A. K. M.
    Sadiq, A.
    Mukhtar, S. A.
    Ali, H. F.
    Abd El-kade, S.
    GLOBAL JOURNAL OF ENVIRONMENTAL SCIENCE AND MANAGEMENT-GJESM, 2025, 11 (01): : 21 - 38