GraphsformerCPI: Graph Transformer for Compound-Protein Interaction Prediction

被引:5
|
作者
Ma, Jun [1 ,2 ]
Zhao, Zhili [1 ]
Li, Tongfeng [1 ,3 ]
Liu, Yunwu [1 ]
Ma, Jun [1 ,2 ]
Zhang, Ruisheng [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Peoples R China
[2] Lanzhou Univ Finance & Econ, Sch Informat Engn, Lanzhou 730020, Peoples R China
[3] Qinghai Normal Univ, Comp Coll, Xining 810016, Peoples R China
基金
中国国家自然科学基金;
关键词
CPI prediction; Deep learning; Molecular graph; Attention mechanism;
D O I
10.1007/s12539-024-00609-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurately predicting compound-protein interactions (CPI) is a critical task in computer-aided drug design. In recent years, the exponential growth of compound activity and biomedical data has highlighted the need for efficient and interpretable prediction approaches. In this study, we propose GraphsformerCPI, an end-to-end deep learning framework that improves prediction performance and interpretability. GraphsformerCPI treats compounds and proteins as sequences of nodes with spatial structures, and leverages novel structure-enhanced self-attention mechanisms to integrate semantic and graph structural features within molecules for deep molecule representations. To capture the vital association between compound atoms and protein residues, we devise a dual-attention mechanism to effectively extract relational features through .cross-mapping. By extending the powerful learning capabilities of Transformers to spatial structures and extensively utilizing attention mechanisms, our model offers strong interpretability, a significant advantage over most black-box deep learning methods. To evaluate GraphsformerCPI, extensive experiments were conducted on benchmark datasets including human, C. elegans, Davis and KIBA datasets. We explored the impact of model depth and dropout rate on performance and compared our model against state-of-the-art baseline models. Our results demonstrate that GraphsformerCPI outperforms baseline models in classification datasets and achieves competitive performance in regression datasets. Specifically, on the human dataset, GraphsformerCPI achieves an average improvement of 1.6% in AUC, 0.5% in precision, and 5.3% in recall. On the KIBA dataset, the average improvement in Concordance index (CI) and mean squared error (MSE) is 3.3% and 7.2%, respectively. Molecular docking shows that our model provides novel insights into the intrinsic interactions and binding mechanisms. Our research holds practical significance in effectively predicting CPIs and binding affinities, identifying key atoms and residues, enhancing model interpretability.
引用
收藏
页码:361 / 377
页数:17
相关论文
共 50 条
  • [31] Helix encoder: a compound-protein interaction prediction model specifically designed for class A GPCRs
    Yamane, Haruki
    Ishida, Takashi
    FRONTIERS IN BIOINFORMATICS, 2023, 3
  • [32] Improving compound-protein interaction prediction by building up highly credible negative samples
    Liu, Hui
    Sun, Jianjiang
    Guan, Jihong
    Zheng, Jie
    Zhou, Shuigeng
    BIOINFORMATICS, 2015, 31 (12) : 221 - 229
  • [33] Article Compound-protein interaction prediction based on heterogeneous network reveals potential antihepatoma agents
    Wang, Yong-Cui
    Li, Tian-Ze
    Chen, Ji-Jun
    ISCIENCE, 2024, 27 (08)
  • [34] Compound-Protein Interaction Prediction Within Chemogenomics: Theoretical Concepts, Practical Usage, and Future Directions
    Brown, J. B.
    Niijima, Satoshi
    Okuno, Yasushi
    MOLECULAR INFORMATICS, 2013, 32 (11-12) : 906 - 921
  • [35] Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences
    Tsubaki, Masashi
    Tomii, Kentaro
    Sese, Jun
    BIOINFORMATICS, 2019, 35 (02) : 309 - 318
  • [36] Identifying compound-protein interactions with knowledge graph embedding of perturbation transcriptomics
    Ni, Shengkun
    Kong, Xiangtai
    Zhang, Yingying
    Chen, Zhengyang
    Wang, Zhaokun
    Fu, Zunyun
    Huo, Ruifeng
    Tong, Xiaochu
    Qu, Ning
    Wu, Xiaolong
    Wang, Kun
    Zhang, Wei
    Zhang, Runze
    Zhang, Zimei
    Shi, Jiangshan
    Wang, Yitian
    Yang, Ruirui
    Li, Xutong
    Zhang, Sulin
    Zheng, Mingyue
    CELL GENOMICS, 2024, 4 (10):
  • [37] Improved compound-protein interaction site and binding affinity prediction using self-supervised protein embeddings
    Wu, Jialin
    Liu, Zhe
    Yang, Xiaofeng
    Lin, Zhanglin
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [38] Scalable prediction of compound-protein interactions using minwise hashing
    Tabei, Yasuo
    Yamanishi, Yoshihiro
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [39] Compound-Protein Interaction Analysis in Condition Following Cardiac Arrest
    Azodi, Mona Zamanian
    Tavirani, Mostafa Rezaei
    Tavirani, Majid Rezaei
    GALEN MEDICAL JOURNAL, 2018, 7 (01):
  • [40] MDL-CPI: Multi-view deep learning model for compound-protein interaction prediction
    Wei, Lesong
    Long, Wentao
    Wei, Leyi
    METHODS, 2022, 204 : 418 - 427