Comparative Studies on Resampling Techniques in Machine Learning and Deep Learning Models for Drug-Target Interaction Prediction

被引:8
|
作者
Azlim Khan, Azwaar Khan [1 ]
Ahamed Hassain Malim, Nurul Hashimah [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town 11800, Malaysia
来源
MOLECULES | 2023年 / 28卷 / 04期
关键词
drug-target interaction; data resampling; machine learning; deep learning; class imbalance; SMOTE; DISCOVERY; IDENTIFICATION; THERAPY; SMOTE;
D O I
10.3390/molecules28041663
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of drug-target interactions (DTIs) is a vital step in drug discovery. The success of machine learning and deep learning methods in accurately predicting DTIs plays a huge role in drug discovery. However, when dealing with learning algorithms, the datasets used are usually highly dimensional and extremely imbalanced. To solve this issue, the dataset must be resampled accordingly. In this paper, we have compared several data resampling techniques to overcome class imbalance in machine learning methods as well as to study the effectiveness of deep learning methods in overcoming class imbalance in DTI prediction in terms of binary classification using ten (10) cancer-related activity classes from BindingDB. It is found that the use of Random Undersampling (RUS) in predicting DTIs severely affects the performance of a model, especially when the dataset is highly imbalanced, thus, rendering RUS unreliable. It is also found that SVM-SMOTE can be used as a go-to resampling method when paired with the Random Forest and Gaussian Naive Bayes classifiers, whereby a high F1 score is recorded for all activity classes that are severely and moderately imbalanced. Additionally, the deep learning method called Multilayer Perceptron recorded high F1 scores for all activity classes even when no resampling method was applied.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A learning-based method for drug-target interaction prediction based on feature representation learning and deep neural network
    Jiajie Peng
    Jingyi Li
    Xuequn Shang
    BMC Bioinformatics, 21
  • [42] A Heterogeneous Cross Contrastive Learning Method for Drug-Target Interaction Prediction
    Wang, Qi
    Gu, Jiachang
    Zhang, Jiahao
    Liu, Mingming
    Jin, Xu
    Xie, Maoqiang
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT I, ICIC 2024, 2024, 14881 : 183 - 194
  • [43] HiGraphDTI: Hierarchical Graph Representation Learning for Drug-Target Interaction Prediction
    Liu, Bin
    Wu, Siqi
    Wang, Jin
    Deng, Xin
    Zhou, Ao
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK, PT VI, ECML PKDD 2024, 2024, 14946 : 354 - 370
  • [44] A learning-based method for drug-target interaction prediction based on feature representation learning and deep neural network
    Peng, Jiajie
    Li, Jingyi
    Shang, Xuequn
    BMC BIOINFORMATICS, 2020, 21 (Suppl 13)
  • [45] A Federated Learning Benchmark for Drug-Target Interaction
    Mittone, Gianluca
    Svoboda, Filip
    Aldinucci, Marco
    Lane, Nicholas D.
    Lio, Pietro
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1177 - 1181
  • [46] A Machine Learning Approach for Drug-Target Interaction Prediction using Wrapper Feature Selection and Class Balancing
    Redkar, Shweta
    Mondal, Sukanta
    Joseph, Alex
    Hareesha, K. S.
    MOLECULAR INFORMATICS, 2020, 39 (05)
  • [47] A Machine Learning-Based Biological Drug-Target Interaction Prediction Method for a Tripartite Heterogeneous Network
    Zheng, Ying
    Wu, Zheng
    ACS OMEGA, 2021, 6 (04): : 3037 - 3045
  • [48] A Robust Drug-Target Interaction Prediction Framework with Capsule Network and Transfer Learning
    Huang, Yixian
    Huang, Hsi-Yuan
    Chen, Yigang
    Lin, Yang-Chi-Dung
    Yao, Lantian
    Lin, Tianxiu
    Leng, Junlin
    Chang, Yuan
    Zhang, Yuntian
    Zhu, Zihao
    Ma, Kun
    Cheng, Yeong-Nan
    Lee, Tzong-Yi
    Huang, Hsien-Da
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (18)
  • [49] MINDG: A Drug-Target Interaction Prediction Method Based on an Integrated Learning Algorithm
    Yang, Hailong
    Chen, Yue
    Zuo, Yun
    Deng, Zhaohong
    Pan, Xiaoyong
    Shen, Hong-Bin
    Choi, Kup-Sze
    Yu, Dong-Jun
    BIOINFORMATICS, 2024, 40 (04)
  • [50] NegStacking: Drug-Target Interaction Prediction Based on Ensemble Learning and Logistic Regression
    Yang, Jie
    He, Song
    Zhang, Zhongnan
    Bo, Xiaochen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2624 - 2634