Breaking the barriers of data scarcity in drug-target affinity prediction

被引:10
|
作者
Pei, Qizhi
Wu, Lijun
Zhu, Jinhua
Xia, Yingce
Xie, Shufang
Qin, Tao
Liu, Haiguang
Liu, Tie-Yan
Yan, Rui
机构
基金
中国国家自然科学基金;
关键词
Drug-Target Affinity Prediction; Data Scarcity; Multi-task Learning; Semi-supervised Learning; Masked Language Modeling; PROTEIN; DOCKING;
D O I
10.1093/bib/bbad386
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate prediction of drug-target affinity (DTA) is of vital importance in early-stage drug discovery, facilitating the identification of drugs that can effectively interact with specific targets and regulate their activities. While wet experiments remain the most reliable method, they are time-consuming and resource-intensive, resulting in limited data availability that poses challenges for deep learning approaches. Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue. To overcome this challenge, we present the Semi-Supervised Multi-task training (SSM) framework for DTA prediction, which incorporates three simple yet highly effective strategies: (1) A multi-task training approach that combines DTA prediction with masked language modeling using paired drug-target data. (2) A semi-supervised training method that leverages large-scale unpaired molecules and proteins to enhance drug and target representations. This approach differs from previous methods that only employed molecules or proteins in pre-training. (3) The integration of a lightweight cross-attention module to improve the interaction between drugs and targets, further enhancing prediction accuracy. Through extensive experiments on benchmark datasets such as BindingDB, DAVIS and KIBA, we demonstrate the superior performance of our framework. Additionally, we conduct case studies on specific drug-target binding activities, virtual screening experiments, drug feature visualizations and real-world applications, all of which showcase the significant potential of our work. In conclusion, our proposed SSM-DTA framework addresses the data limitation challenge in DTA prediction and yields promising results, paving the way for more efficient and accurate drug discovery processes.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Breaking the barriers of data scarcity in drug-target affinity prediction (vol 24, bbad386, 2023)
    Pei, Qizhi
    Wu, Lijun
    Zhu, Jinhua
    Xia, Yingce
    Xie, Shufang
    Qin, Tao
    Liu, Haiguang
    Liu, Tie-Yan
    Yan, Rui
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [2] DeepDTA: deep drug-target binding affinity prediction
    Ozturk, Hakime
    Ozgur, Arzucan
    Ozkirimli, Elif
    BIOINFORMATICS, 2018, 34 (17) : 821 - 829
  • [3] Drug-target affinity prediction using applicability domain based on data density
    Sugita, Shunya
    Ohue, Masahito
    2021 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2021, : 224 - 229
  • [4] Explainable deep drug-target representations for binding affinity prediction
    Monteiro, Nelson R. C.
    Simoes, Carlos J. V.
    avila, Henrique V.
    Abbasi, Maryam
    Oliveira, Jose L.
    Arrais, Joel P.
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [5] Prediction of Drug-Target Affinity Using Attention Neural Network
    Tang, Xin
    Lei, Xiujuan
    Zhang, Yuchen
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (10)
  • [6] GEFA: Early Fusion Approach in Drug-Target Affinity Prediction
    Tri Minh Nguyen
    Thin Nguyen
    Thao Minh Le
    Truyen Tran
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 718 - 728
  • [7] ImageDTA: A Simple Model for Drug-Target Binding Affinity Prediction
    Han, Li
    Kang, Ling
    Guo, Quan
    ACS OMEGA, 2024, 9 (26): : 28485 - 28493
  • [8] Drug-target Affinity Prediction by Molecule Secondary Structure Representation Network
    Tang, Yuewei
    Li, Yunhai
    Li, Pengpai
    Liu, Zhi-Ping
    CURRENT MEDICINAL CHEMISTRY, 2024,
  • [9] Effective drug-target affinity prediction via generative active learning
    Liu, Yuansheng
    Zhou, Zhenran
    Cao, Xiaofeng
    Cao, Dongsheng
    Zeng, Xiangxiang
    INFORMATION SCIENCES, 2024, 679
  • [10] Integrating sequence and graph information for enhanced drug-target affinity prediction
    Haohuai HE
    Guanxing CHEN
    Calvin Yu-Chian CHEN
    Science China(Information Sciences), 2024, 67 (02) : 325 - 326