Multiclass Synthetic Accessibility Prediction

被引:0
|
作者
Li, Xinqi [1 ]
Walsh, Ryan [2 ,3 ]
Abbas, Waseem [1 ]
Pascual-Diaz, Sergio [1 ]
Hand, Calum [1 ]
Garland, Rory [1 ]
Khan, Faiz Mohammad [1 ]
Das, Nikhil Mohan [1 ]
Desai, Vedant [1 ]
Abouzleikha, Mohamed [1 ]
Clark, Matthew A. [3 ]
机构
[1] X Chem UK, Altrincham WA14 2DT, Cheshire, England
[2] X Chem Canada, Montreal, PQ H4R 2P1, Canada
[3] X Chem Global HQ, Waltham, MA 02453 USA
关键词
D O I
10.1021/acs.jcim.4c01663
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Evaluating synthetic accessibility of in silico molecules is an integral component of the drug discovery process. While the application of machine learning models to predict whether small molecules are easy or hard to synthesize has gained attention recently, predetermined thresholds and data set imbalances present challenges for these binary classification approaches. In this study, we introduce a novel multiclass fold-ensembled classification approach to predict the minimum number of steps needed to synthesize a small molecule. By ensembling the base models trained on multiple stratified subsampled folds, this approach effectively mitigates the impact of class imbalance through probability aggregation or voting aggregation strategies. Additionally, we propose fuzzy evaluation metrics that account for practical tolerances in predictions, providing a more flexible and realistic assessment of model performance. Through experimentation on two reaction benchmark data sets, we demonstrate the effectiveness of our model in a multiclass synthetic accessibility prediction task and the superiority of our proposed method over six existing models in binary synthetic accessibility prediction tasks.
引用
收藏
页码:1155 / 1165
页数:11
相关论文
共 50 条
  • [31] Model Evaluation Improvements for Multiclass Classification in Diagnosis Prediction
    Coroiu, Adriana Mihaela
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 782 - 783
  • [32] Mixtron: Bandit Online Multiclass Prediction with Implicit Feedback
    Feng, Wanjin
    Shi, Hailong
    Zhao, Peilin
    Gao, Xingyu
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1004 - 1012
  • [33] Degree of Differential Prioritization Prediction for Multiclass Molecular Classification
    Ooi, Chia Huey
    Chetty, Madhu
    Teng, Shyh Wei
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2009, 28 (04): : 45 - 51
  • [34] Empirical Analysis of Machine Learning Algorithms for Multiclass Prediction
    Ishfaq, Umar
    Shabbir, Danial
    Khan, Jumshaid
    Khan, Hikmat Ullah
    Naseer, Salman
    Irshad, Azeem
    Shafiq, Muhammad
    Hamam, Habib
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [35] Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
    Xuan, Jianhua
    Wang, Yue
    Dong, Yibin
    Feng, Yuanjian
    Wang, Bin
    Khan, Javed
    Bakay, Maria
    Wang, Zuyi
    Pachman, Lauren
    Winokur, Sara
    Chen, Yi-Wen
    Clarke, Robert
    Hoffman, Eric
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01):
  • [36] Empirical Analysis of Machine Learning Algorithms for Multiclass Prediction
    Ishfaq, Umar
    Shabbir, Danial
    Khan, Jumshaid
    Khan, Hikmat Ullah
    Naseer, Salman
    Irshad, Azeem
    Shafiq, Muhammad
    Hamam, Habib
    Wireless Communications and Mobile Computing, 2022, 2022
  • [37] A Realistic Seizure Prediction Study Based on Multiclass SVM
    Direito, Bruno
    Teixeira, Cesar A.
    Sales, Francisco
    Castelo-Branco, Miguel
    Dourado, Antonio
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2017, 27 (03)
  • [38] New 2-tier Multiclass Prediction Framework
    Awad, Mamoun
    2015 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ITS APPLICATIONS (ICCSA), 2015, : 77 - 81
  • [39] Decision trees to multiclass prediction for analysis of arecanut data
    Suresha, M.
    Danti, Ajit
    Narasimhamurthy, S. K.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2014, 29 (01): : 105 - 114
  • [40] Performance enhancement in AODV with accessibility prediction
    Rehman, Habib-ur
    Wolf, Lars
    2007 IEEE INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR SYSTEMS, VOLS 1-3, 2007, : 839 - 844