Evaluating synthetic accessibility of in silico molecules is an integral component of the drug discovery process. While the application of machine learning models to predict whether small molecules are easy or hard to synthesize has gained attention recently, predetermined thresholds and data set imbalances present challenges for these binary classification approaches. In this study, we introduce a novel multiclass fold-ensembled classification approach to predict the minimum number of steps needed to synthesize a small molecule. By ensembling the base models trained on multiple stratified subsampled folds, this approach effectively mitigates the impact of class imbalance through probability aggregation or voting aggregation strategies. Additionally, we propose fuzzy evaluation metrics that account for practical tolerances in predictions, providing a more flexible and realistic assessment of model performance. Through experimentation on two reaction benchmark data sets, we demonstrate the effectiveness of our model in a multiclass synthetic accessibility prediction task and the superiority of our proposed method over six existing models in binary synthetic accessibility prediction tasks.
机构:
Natl Inst Adv Ind Sci & Technol, Mol Profiling Res Ctr Drug Discovery Molprof, Koto Ku, Tokyo 1350064, Japan
Technol Res Assoc Next Generat Nat Prod Chem, Koto Ku, Tokyo 1350064, JapanNatl Inst Adv Ind Sci & Technol, Mol Profiling Res Ctr Drug Discovery Molprof, Koto Ku, Tokyo 1350064, Japan
Fukunishi, Yoshifumi
Kurosawa, Takashi
论文数: 0引用数: 0
h-index: 0
机构:
Technol Res Assoc Next Generat Nat Prod Chem, Koto Ku, Tokyo 1350064, Japan
Hitachi Solut East Japan, Kawasaki Ku, Kawasaki, Kanagawa 2100007, JapanNatl Inst Adv Ind Sci & Technol, Mol Profiling Res Ctr Drug Discovery Molprof, Koto Ku, Tokyo 1350064, Japan
Kurosawa, Takashi
Mikami, Yoshiaki
论文数: 0引用数: 0
h-index: 0
机构:
Technol Res Assoc Next Generat Nat Prod Chem, Koto Ku, Tokyo 1350064, Japan
Hitachi Solut East Japan, Kawasaki Ku, Kawasaki, Kanagawa 2100007, JapanNatl Inst Adv Ind Sci & Technol, Mol Profiling Res Ctr Drug Discovery Molprof, Koto Ku, Tokyo 1350064, Japan
Mikami, Yoshiaki
Nakamura, Haruki
论文数: 0引用数: 0
h-index: 0
机构:
Osaka Univ, Inst Prot Res, Suita, Osaka 5650871, JapanNatl Inst Adv Ind Sci & Technol, Mol Profiling Res Ctr Drug Discovery Molprof, Koto Ku, Tokyo 1350064, Japan
机构:
Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Univ Adelaide, Comp Vis Grp, Adelaide, SA 5005, AustraliaUniv Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Shi, Qinfeng
Reid, Mark
论文数: 0引用数: 0
h-index: 0
机构:
Australian Natl Univ, Canberra, ACT 0200, Australia
NICTA, Canberra, ACT, AustraliaUniv Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Reid, Mark
Caetano, Tiberio
论文数: 0引用数: 0
h-index: 0
机构:
Australian Natl Univ, Canberra, ACT 0200, Australia
NICTA, Canberra, ACT, AustraliaUniv Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Caetano, Tiberio
Van den Hengel, Anton
论文数: 0引用数: 0
h-index: 0
机构:
Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Univ Adelaide, Comp Vis Grp, Adelaide, SA 5005, AustraliaUniv Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Van den Hengel, Anton
Wang, Zhenhua
论文数: 0引用数: 0
h-index: 0
机构:
Univ Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
Univ Adelaide, Comp Vis Grp, Adelaide, SA 5005, AustraliaUniv Adelaide, Australian Ctr Visual Technol, Adelaide, SA 5005, Australia
机构:
Hunan Univ, Coll Informat Sci & Engn, 116 Lu Shan South Rd, Changsha, Hunan, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, 116 Lu Shan South Rd, Changsha, Hunan, Peoples R China
Zhu, Tuanfei
Lin, Yaping
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Coll Informat Sci & Engn, 116 Lu Shan South Rd, Changsha, Hunan, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, 116 Lu Shan South Rd, Changsha, Hunan, Peoples R China
Lin, Yaping
Liu, Yonghe
论文数: 0引用数: 0
h-index: 0
机构:
Univ Texas Arlington, Dept Comp Sci & Engn, 701 S Nedderman Dr, Arlington, TX 76019 USAHunan Univ, Coll Informat Sci & Engn, 116 Lu Shan South Rd, Changsha, Hunan, Peoples R China
机构:
Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
Feng, Wanjin
Gao, Xingyu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R ChinaChinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
Gao, Xingyu
Zhao, Peilin
论文数: 0引用数: 0
h-index: 0
机构:
Tencent AI Lab, Shenzhen 518000, Peoples R ChinaChinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China