MuCoCP: a priori chemical knowledge-based multimodal contrastive learning pre-trained neural network for the prediction of cyclic peptide membrane penetration ability

被引:0
|
作者
Yu, Yunxiang [1 ]
Gu, Mengyun [1 ]
Guo, Hai [2 ]
Deng, Yabo [1 ]
Chen, Danna [1 ,3 ,4 ]
Wang, Jianwei [4 ]
Wang, Caixia
Liu, Xia [1 ]
Yan, Wenjin [1 ]
Huang, Jinqi [3 ,4 ]
机构
[1] Lanzhou Univ, Sch Basic Med Sci, 199 Donggang West Rd, Lanzhou 730000, Peoples R China
[2] Lanzhou Univ, Hosp 2, Clin Med Sch, Lanzhou 730000, Peoples R China
[3] Guangdong Med Univ, Affiliated Hosp, Zhanjiang 524000, Peoples R China
[4] South China Univ Technol, Guangzhou Peoples Hosp 1, 1 Panfu Rd, Guangzhou 510180, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/btae473
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation There has been a burgeoning interest in cyclic peptide therapeutics due to their various outstanding advantages and strong potential for drug formation. However, it is undoubtedly costly and inefficient to use traditional wet lab methods to clarify their biological activities. Using artificial intelligence instead is a more energy-efficient and faster approach. MuCoCP aims to build a complete pre-trained model for extracting potential features of cyclic peptides, which can be fine-tuned to accurately predict cyclic peptide bioactivity on various downstream tasks. To maximize its effectiveness, we use a novel data augmentation method based on a priori chemical knowledge and multiple unsupervised training objective functions to greatly improve the information-grabbing ability of the model.Results To assay the efficacy of the model, we conducted validation on the membrane-permeability of cyclic peptides which achieved an accuracy of 0.87 and R-squared of 0.503 on CycPeptMPDB using semi-supervised training and obtained an accuracy of 0.84 and R-squared of 0.384 using a model with frozen parameters on an external dataset. This result has achieved state-of-the-art, which substantiates the stability and generalization capability of MuCoCP. It means that MuCoCP can fully explore the high-dimensional information of cyclic peptides and make accurate predictions on downstream bioactivity tasks, which will serve as a guide for the future de novo design of cyclic peptide drugs and promote the development of cyclic peptide drugs.Availability and implementation All code used in our proposed method can be found at https://github.com/lennonyu11234/MuCoCP.
引用
收藏
页数:9
相关论文
共 1 条
  • [1] DeepAIP: Deep learning for anti-inflammatory peptide prediction using pre-trained protein language model features based on contextual self-attention network
    Zhu, Lun
    Yang, Qingguo
    Yang, Sen
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 280