Small dataset solves big problem: An outlier-insensitive binary classifier for inhibitory potency prediction

被引:13
|
作者
Zhou, Teng [1 ]
Dou, Haowen [1 ]
Tan, Jie [2 ]
Song, Youyi [3 ]
Wang, Fei [1 ,4 ,5 ]
Wang, Jiaqi [2 ]
机构
[1] Shantou Univ, Dept Comp Sci, Shantou, Peoples R China
[2] Sun Yat Sen Univ, Sch Pharmaceut Sci Shenzhen, Shenzhen, Peoples R China
[3] Hong Kong Polytech Univ, Ctr Smart Hlth, Sch Nursing, Hong Kong, Peoples R China
[4] Shantou Univ, MoE Key Lab Intelligent Mfg Technol, Shantou, Peoples R China
[5] Guangdong Prov Key Lab Infect Dis & Mol Immunopath, Shantou, Peoples R China
关键词
Drug screening; Inhibitory potency prediction; Machine learning; Outlier -insensitive learning; Feature selection; NAMPT; CORRENTROPY; DISCOVERY; ANTIBODY; VIRUS;
D O I
10.1016/j.knosys.2022.109242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nicotinamide phosphoribosyltransferase (NAMPT) inhibitors show importance in cancer disease treatment while selecting compounds from a library according to inhibitory potency for further experiments is considered to be the main way for drug discovery. Meanwhile, computational methods have been widely used to accelerate the process of drug discovery. Hence, we propose a machine learning model that only needs to be trained on an extremely small dataset to predict the inhibition constant (Ki) and half maximal inhibitory concentration (IC50) for a compound. The key idea is to directly rank compounds according to inhibitory potency by solving a simpler binary classification problem since we only need the relative ranks of the inhibitors for drug screening. To this end, we develop an adaptive data augmentation method to consider and effectively capture the relative information between compounds in the original dataset. However, outliers in small samples can always be tricky to detect, and may severely affect the learned distribution of the classifier. In this regard, we propose an outlierinsensitive classifier with an effective feature selection module for the one-to-all classification task. Extensive experiments show that our model gains high and reliable accuracy in ranking compounds according to inhibitory potency. The current results demonstrate that the proposed model achieves reliability in prioritizing chemicals for experiment research and analysis through a ligand-based in silico approach.
引用
收藏
页数:10
相关论文
共 2 条
  • [1] Transfer inhibitory potency prediction to binary classification: A model only needs a small training set
    Dou, Haowen
    Tan, Jie
    Wei, Huiling
    Wang, Fei
    Yang, Jinzhu
    Ma, X-G
    Wang, Jiaqi
    Zhou, Teng
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 215
  • [2] SAE-SV: A Stacked-AutoEncoder and Soft Voting Joint Approach Based on Small Dataset with High Dimensions for Inhibitory Potency Prediction
    Zhang, Haotian
    Ma, Xiaoguang
    Lin, Zhizhe
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 1170 - 1175