MultiCon: A Semi-Supervised Approach for Predicting Drug Function from Chemical Structure Analysis

被引:17
|
作者
Sahoo, Pracheta [1 ]
Roy, Indranil [2 ]
Wang, Zhuoyi [1 ]
Mi, Feng [1 ]
Yu, Lin [1 ]
Balasubramani, Pradeep [1 ]
Khan, Latifur [1 ]
Stoddart, J. Fraser [2 ,3 ,4 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75080 USA
[2] Northwestern Univ, Dept Chem, Evanston, IL 60208 USA
[3] Tianjin Univ, Inst Mol Design & Synth, Tianjin 300072, Peoples R China
[4] Univ New South Wales, Sch Chem, Sydney, NSW 2052, Australia
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.0c00801
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Semi-supervised learning has proved its efficacy in utilizing extensive unlabeled data to alleviate the use of a large amount of supervised data and improve model performance. Despite its tremendous potential, semi-supervised learning has yet to be implemented in the field of drug discovery. Empirical testing of drugs and their classification is costly and time-consuming. In contrast, predicting therapeutic applications of drugs from their structural formulas using semi-supervised learning would reduce costs and time significantly. Herein, we employ a new multicontrastive-based semi-supervised learning algorithm-MultiCon-for classifying drugs into 12 categories, according to therapeutic applications, on the basis of image analyses of their structural formulas. By rational use of data balancing, online augmentations of the drug image data during training, and the combined use of multicontrastive loss with consistency regularization, MultiCon achieves better class prediction accuracies when compared with the state-of-the-art machine learning methods across a variety of existing semi-supervised learning benchmarks. In particular, it performs exceptionally well with a limited number of labeled examples. For instance, with just 5000 labeled drugs in a PubChem (D-3) data set, MultiCon achieved a class prediction accuracy of 97.74%.
引用
收藏
页码:5995 / 6006
页数:12
相关论文
共 50 条
  • [41] A novel semi-supervised approach for feature extraction
    Qiu, Junyang
    Zhang, Yanyan
    Pan, Zhisong
    Yang, Haimin
    Ren, Huifeng
    Li, Xin
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3765 - 3770
  • [42] Semi-supervised approach to Romanian noun declension
    Octavia-Maria, Sulea
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 664 - 671
  • [43] An artificial life approach for semi-supervised learning
    Herrmann, Lutz
    Ultsch, Alfred
    DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 139 - 146
  • [44] A genetic algorithm approach for semi-supervised clustering
    Demiriz, Ayhan
    Bennett, Kristin P.
    Embrechts, Mark J.
    International Journal of Smart Engineering System Design, 2002, 4 (01): : 21 - 30
  • [45] A Semi-Supervised Approach to Message Stance Classification
    Giasemidis, Georgios
    Kaplis, Nikolaos
    Agrafiotis, Ioannis
    Nurse, Jason R. C.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (01) : 1 - 11
  • [46] Semi-supervised graph clustering: a kernel approach
    Brian Kulis
    Sugato Basu
    Inderjit Dhillon
    Raymond Mooney
    Machine Learning, 2009, 74 : 1 - 22
  • [47] A semi-supervised approach for the semantic segmentation of trajectories
    Soares Junior, Amilcar
    Times, Valeria Cesario
    Renso, Chiara
    Matwin, Stan
    Cabral, Lucidio A. F.
    2018 19TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2018), 2018, : 145 - 154
  • [48] An Evidential Semi-supervised Label Aggregation Approach
    Abassi, Lina
    Boukhris, Imen
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 676 - 686
  • [49] MixMatch: A Holistic Approach to Semi-Supervised Learning
    Berthelot, David
    Carlini, Nicholas
    Goodfellow, Ian
    Oliver, Avital
    Papernot, Nicolas
    Raffel, Colin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [50] A semi-supervised approach to growing classification trees
    Santhiappan, Sudarsun
    Ravindran, Balaraman
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 29 - 37