Breaking the barriers of data scarcity in drug-target affinity prediction

被引:10
|
作者
Pei, Qizhi
Wu, Lijun
Zhu, Jinhua
Xia, Yingce
Xie, Shufang
Qin, Tao
Liu, Haiguang
Liu, Tie-Yan
Yan, Rui
机构
基金
中国国家自然科学基金;
关键词
Drug-Target Affinity Prediction; Data Scarcity; Multi-task Learning; Semi-supervised Learning; Masked Language Modeling; PROTEIN; DOCKING;
D O I
10.1093/bib/bbad386
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate prediction of drug-target affinity (DTA) is of vital importance in early-stage drug discovery, facilitating the identification of drugs that can effectively interact with specific targets and regulate their activities. While wet experiments remain the most reliable method, they are time-consuming and resource-intensive, resulting in limited data availability that poses challenges for deep learning approaches. Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue. To overcome this challenge, we present the Semi-Supervised Multi-task training (SSM) framework for DTA prediction, which incorporates three simple yet highly effective strategies: (1) A multi-task training approach that combines DTA prediction with masked language modeling using paired drug-target data. (2) A semi-supervised training method that leverages large-scale unpaired molecules and proteins to enhance drug and target representations. This approach differs from previous methods that only employed molecules or proteins in pre-training. (3) The integration of a lightweight cross-attention module to improve the interaction between drugs and targets, further enhancing prediction accuracy. Through extensive experiments on benchmark datasets such as BindingDB, DAVIS and KIBA, we demonstrate the superior performance of our framework. Additionally, we conduct case studies on specific drug-target binding activities, virtual screening experiments, drug feature visualizations and real-world applications, all of which showcase the significant potential of our work. In conclusion, our proposed SSM-DTA framework addresses the data limitation challenge in DTA prediction and yields promising results, paving the way for more efficient and accurate drug discovery processes.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Prediction of drug-target binding affinity based on deep learning models
    Zhang H.
    Liu X.
    Cheng W.
    Wang T.
    Chen Y.
    Computers in Biology and Medicine, 2024, 174
  • [12] MFFDTA: A Multimodal Feature Fusion Framework for Drug-Target Affinity Prediction
    Wang, Wei
    Su, Ziwen
    Liu, Dong
    Zhang, Hongjun
    Shang, Jiangli
    Zhou, Yun
    Wang, Xianfang
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 243 - 254
  • [13] AttentionDTA: prediction of drug-target binding affinity using attention model
    Zhao, Qichang
    Xiao, Fen
    Yang, Mengyun
    Li, Yaohang
    Wang, Jianxin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 64 - 69
  • [14] Multimodal contrastive representation learning for drug-target binding affinity prediction
    Zhang, Linlin
    Ouyang, Chunping
    Liu, Yongbin
    Liao, Yiming
    Gao, Zheng
    METHODS, 2023, 220 : 126 - 133
  • [15] Hierarchical graph representation learning for the prediction of drug-target binding affinity
    Chu, Zhaoyang
    Huang, Feng
    Fu, Haitao
    Quan, Yuan
    Zhou, Xionghui
    Liu, Shichao
    Zhang, Wen
    INFORMATION SCIENCES, 2022, 613 : 507 - 523
  • [16] Deep drug-target binding affinity prediction with multiple attention blocks
    Zeng, Yuni
    Chen, Xiangru
    Luo, Yujie
    Li, Xuedong
    Peng, Dezhong
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [17] Integrating sequence and graph information for enhanced drug-target affinity prediction
    He, Haohuai
    Chen, Guanxing
    Chen, Calvin Yu-Chian
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
  • [18] A Computational Software for Training Robust Drug-Target Affinity Prediction Models: pydebiaseddta
    Barsbey, Melih
    Ozcelik, Riza
    Bag, Alperen
    Atil, Berk
    Ozgur, Arzucan
    Ozkirimli, Elif
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (11) : 1240 - 1245
  • [19] DTITR: End-to-end drug-target binding affinity prediction with transformers
    Monteiro, Nelson R. C.
    Oliveira, Jose L.
    Arrais, Joel P.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 147
  • [20] Drug-target affinity prediction with extended graph learning-convolutional networks
    Qi, Haiou
    Yu, Ting
    Yu, Wenwen
    Liu, Chenxi
    BMC BIOINFORMATICS, 2024, 25 (01)