Carcinogenicity Prediction of Noncongeneric Chemicals by a Support Vector Machine

被引:25
|
作者
Zhong, Min [1 ]
Nie, Xianglei [1 ]
Yan, Aixia [1 ]
Yuan, Qipeng [1 ]
机构
[1] Beijing Univ Chem Technol, Dept Pharmaceut Engn, State Key Lab Chem Resource Engn, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
ORBITAL ELECTRONEGATIVITY; QSAR;
D O I
10.1021/tx4000182
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The ability to identify carcinogenic compounds is of fundamental importance to the safe application of chemicals. In this study, we generated an array of in silk models allowing the classification of compounds into carcinogenic and noncarcinogenic agents based on a data set of 852 noncongeneric chemicals collected from the Carcinogenic Potency Database (CPDBAS). Twenty-four molecular descriptors were selected by Pearson correlation, F-score, and stepwise regression analysis. These descriptors cover a range of physicochemical properties, including electrophilicity, geometry, molecular weight, size, and solubility. The descriptor mutagenic showed the highest correlation coefficient with carcinogenicity. On the basis of these descriptors, a support vector machine-based (SVM) classification model was developed and fine-tuned by a 10-fold cross-validation approach. Both the SVM model (Model A1) and the best model from the 10-fold cross-validation (Model B3) runs gave good results on the test set with prediction accuracy over 80%, sensitivity over 76%, and specificity over 82%. In addition, extended connectivity fingerprints (ECFPs) and the Toxtree software were used to analyze the functional groups and substructures linked to carcinogenicity. It was found that the results of both methods are in good agreement.
引用
收藏
页码:741 / 749
页数:9
相关论文
共 50 条
  • [1] Prediction of Carcinogenicity of Noncongeneric Chemical Substances by a Support Vector Machine
    Tanabe, Kazutoshi
    Suzuki, Takahiro
    Kaihara, Mikio
    Onodera, Natsuo
    JOURNAL OF COMPUTER CHEMISTRY-JAPAN, 2008, 7 (03) : 93 - 101
  • [2] Carcinogenicity prediction of noncongeneric chemicals by augmented top priority fragment classification
    Casalegno, Mose
    Sello, Guido
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 61 : 145 - 154
  • [3] Prediction of carcinogenicity of diverse chemical substances by a support vector machine
    Tanabe, K.
    Suzuki, T.
    JOURNAL OF PHARMACY AND PHARMACOLOGY, 2008, 60 : A35 - A35
  • [4] Carcinogenicity modelling of diverse chemicals based on substructure grouping and support vector machines
    Tanabe, K.
    Lucic, B.
    Amic, D.
    Suzuki, T.
    JOURNAL OF PHARMACY AND PHARMACOLOGY, 2009, 61 : A65 - A66
  • [5] Prediction of rodent carcinogenicity for 30 chemicals
    Ashby, J
    ENVIRONMENTAL HEALTH PERSPECTIVES, 1996, 104 : 1101 - 1104
  • [6] Application of support vector machine for prediction and classification
    AKramar, V.
    Alchakov, V. V.
    Dushko, V. R.
    Kramar, T. V.
    INTERNATIONAL CONFERENCE INFORMATION TECHNOLOGIES IN BUSINESS AND INDUSTRY 2018, PTS 1-4, 2018, 1015
  • [7] Prediction in marketing using the support vector machine
    Cui, DP
    Curry, D
    MARKETING SCIENCE, 2005, 24 (04) : 595 - 615
  • [8] Support vector machine to criminal recidivism prediction
    Kovalchuk, Olha
    Shevchuk, Ruslan
    Babala, Ludmila
    Kasianchuk, Mykhailo
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (03) : 691 - 697
  • [9] The Application of Support Vector Machine to Operon Prediction
    Wang, Xiumei
    Du, Wei
    Wang, Yan
    Zhang, Chen
    Zhou, Chunguang
    Wang, Shuqin
    Liang, Yanchun
    FGCN: PROCEEDINGS OF THE 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING, VOLS 1 AND 2, 2008, : 1011 - 1014
  • [10] Prediction of Protein Thermostability with Support Vector Machine
    Ai, Haixin
    Zhang, Jikuan
    Zhang, Li
    Deng, Fangbo
    Zhao, Jian
    Liu, Hongsheng
    8TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2014), 2014, : 63 - 68