pLM4CPPs: Protein Language Model-Based Predictor for Cell Penetrating Peptides

被引:0
|
作者
Kumar, Nandan [1 ]
Du, Zhenjiao [1 ]
Li, Yonghui [1 ]
机构
[1] Kansas State Univ, Dept Grain Sci & Ind, Manhattan, KS 66506 USA
关键词
RICH ANTIMICROBIAL PEPTIDES; DELIVERY; MECHANISMS; VEHICLES;
D O I
10.1021/acs.jcim.4c01338
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Cell-penetrating peptides (CPPs) are short peptides capable of penetrating cell membranes, making them valuable for drug delivery and intracellular targeting. Accurate prediction of CPPs can streamline experimental validation in the lab. This study aims to assess pretrained protein language models (pLMs) for their effectiveness in representing CPPs and develop a reliable model for CPP classification. We evaluated peptide embeddings generated from BEPLER, CPCProt, SeqVec, various ESM variants (ESM, ESM-2 with expanded feature set, ESM-1b, and ESM-1v), ProtT5-XL UniRef50, ProtT5-XL BFD, and ProtBERT. We developed pLM4CCPs, a novel deep learning architecture using convolutional neural networks (CNNs) as the classifier for binary classification of CPPs. pLM4CCPs demonstrated superior performance over existing state-of-the-art CPP prediction models, achieving improvements in accuracy (ACC) by 4.9-5.5%, Matthews correlation coefficient (MCC) by 9.3-10.2%, and sensitivity (Sn) by 14.1-19.6%. Among all the tested models, ESM-1280 and ProtT5-XL BFD demonstrated the highest overall performance on the kelm data set. ESM-1280 achieved an ACC of 0.896, an MCC of 0.796, a Sn of 0.844, and a specificity (Sp) of 0.978. ProtT5-XL BFD exhibited superior performance with an ACC of 0.901, an MCC of 0.802, an Sn of 0.885, and an Sp of 0.917. pLM4CCPs combine predictions from multiple models to provide a consensus on whether a given peptide sequence is classified as a CPP or non-CPP. This approach will enhance prediction reliability by leveraging the strengths of each individual model. A user-friendly web server for bioactivity predictions, along with data sets, is available at https://ry2acnp6ep.us-east-1.awsapprunner.com. The source code and protocol for adapting pLM4CPPs can be accessed on GitHub at https://github.com/drkumarnandan/pLM4CPPs. This platform aims to advance CPP prediction and peptide functionality modeling, aiding researchers in exploring peptide functionality effectively.
引用
收藏
页码:1128 / 1139
页数:12
相关论文
共 44 条
  • [1] pLM4Alg: Protein Language Model-Based Predictors for Allergenic Proteins and Peptides
    Du, Zhenjiao
    Xu, Yixiang
    Liu, Changqi
    Li, Yonghui
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2023, 72 (01) : 752 - 760
  • [2] pLM4ACE: A protein language model based predictor for antihypertensive peptide screening
    Du, Zhenjiao
    Ding, Xingjian
    Hsu, William
    Munir, Arslan
    Xu, Yixiang
    Li, Yonghui
    FOOD CHEMISTRY, 2024, 431
  • [3] Proline-based γ-peptides as novel cell penetrating peptides (CPPs) in Leishmania
    Lopez-Sanchez, A.
    Carbajo, D.
    Angeles Abengozar, M.
    Albericio, F.
    Royo, M.
    Rivas, L.
    FEBS JOURNAL, 2012, 279 : 359 - 360
  • [4] Cholesterol Influence on the Interaction of Cell Penetrating Peptides (CPPs) with Model Membranes
    Silva, Viviana E.
    Guzman, Fanny
    Sotomayor, Patricio
    Aguilar, Luis F.
    BIOPHYSICAL JOURNAL, 2016, 110 (03) : 245A - 245A
  • [5] Understanding the mechanism of cell penetrating peptides (CPPs) by a new approach based on advanced model membranes
    Esposito, Cinzia
    Baumgart, Tobias
    D'Ursi, Anna Maria
    JOURNAL OF PEPTIDE SCIENCE, 2008, 14 (08) : 2 - 3
  • [6] A model-based solution for process modeling in practice environments: PLM4BS
    Alberto Garcia-Garcia, Julian
    Garcia-Borgonon, Laura
    Jose Escalona, Maria
    Mejias, Manuel
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (12)
  • [7] Flakify: A Black-Box, Language Model-Based Predictor for Flaky Tests
    Fatima, Sakina
    Ghaleb, Taher A.
    Briand, Lionel
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 1912 - 1927
  • [8] CPPred-RF: A Sequence-based Predictor for Identifying Cell Penetrating Peptides and Their Uptake Efficiency
    Wei, Leyi
    Xing, PengWei
    Su, Ran
    Shi, Gaotao
    Ma, Zhanshan Sam
    Zou, Quan
    JOURNAL OF PROTEOME RESEARCH, 2017, 16 (05) : 2044 - 2053
  • [9] Osmolality is a predictor for model-based real time monitoring of concentration in protein chromatography
    Felfoedi, Edith
    Scharl, Theresa
    Melcher, Michael
    Duerauer, Astrid
    Wright, Kristeena
    Jungbauer, Alois
    JOURNAL OF CHEMICAL TECHNOLOGY AND BIOTECHNOLOGY, 2020, 95 (04) : 1146 - 1152
  • [10] Graph-BERT and language model-based framework for protein–protein interaction identification
    Kanchan Jha
    Sourav Karmakar
    Sriparna Saha
    Scientific Reports, 13