Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method

被引:1
|
作者
Chu, Yanyan [1 ,2 ,3 ]
Zhang, Huanhuan [1 ]
Zhang, Lei [3 ]
机构
[1] Ocean Univ China, Sch Med & Pharm, Qingdao 266003, Peoples R China
[2] Pilot Natl Lab Marine Sci & Technol Qingdao, Qingdao 266200, Peoples R China
[3] Ocean Univ China, Marine Biomed Res Inst Qingdao, Qingdao 266003, Peoples R China
基金
中国国家自然科学基金;
关键词
peptide toxin; active peptide; function prediction; PU learning; sequence-based; VENOM PEPTIDES; CHANNEL; THERAPEUTICS; CLASSIFIER; ZICONOTIDE; REVEAL; DESIGN;
D O I
10.3390/toxins14110811
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Cell Penetrating Peptide: Sequence-Based Computational Prediction for Intercellular Delivery of Arginine Deiminase
    Zarei, Mahboubeh
    Rahbar, Mohammad Reza
    Negahdaripour, Manica
    Morowvat, Mohammad Hossein
    Nezafat, Navid
    Younes, Ghasemi
    CURRENT PROTEOMICS, 2020, 17 (02) : 117 - 131
  • [32] SETE: Sequence-based Ensemble learning approach for TCR Epitope binding prediction
    Tong, Yao
    Wang, Jiayin
    Zheng, Tian
    Zhang, Xuanping
    Xiao, Xiao
    Zhu, Xiaoyan
    Lai, Xin
    Liu, Xiang
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
  • [33] Sequence-based bacterial small RNAs prediction using ensemble learning strategies
    Guifeng Tang
    Jingwen Shi
    Wenjian Wu
    Xiang Yue
    Wen Zhang
    BMC Bioinformatics, 19
  • [34] Sequence-Based Deep Learning Frameworks on Enhancer-Promoter Interactions Prediction
    Min, Xiaoping
    Lu, Fengqing
    Li, Chunyan
    CURRENT PHARMACEUTICAL DESIGN, 2021, 27 (15) : 1847 - 1855
  • [35] Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach
    Nedyalkova, Miroslava
    Vasighi, Mahdi
    Azmoon, Amirreza
    Naneva, Ludmila
    Simeonov, Vasil
    ACS OMEGA, 2023, : 3698 - 3704
  • [36] Sequence-based bacterial small RNAs prediction using ensemble learning strategies
    Tang, Guifeng
    Shi, Jingwen
    Wu, Wenjian
    Yue, Xiang
    Zhang, Wen
    BMC BIOINFORMATICS, 2018, 19
  • [37] A Novel Sequence-Based Method for Phosphorylation Site Prediction with Feature Selection and Analysis
    He, Zhi-Song
    Shi, Xiao-He
    Kong, Xiang-Ying
    Zhu, Yu-Bei
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (01): : 70 - 78
  • [38] SCPSSMpred: A general sequence-based method for ligand-binding site prediction
    Fang, Chun
    Noguchi, Tamotsu
    Yamana, Hayato
    IPSJ Transactions on Bioinformatics, 2013, 6 : 35 - 42
  • [39] De novo sequence-based method for ncRPI prediction using structural information
    Leone, Michele
    Galvani, Marta
    Masseroli, Marco
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 146 - 151
  • [40] Multi-Sensor Data Annotation Using Sequence-based Active Learning
    Denzler, Patrick
    Ziegler, Markus
    Jacobs, Arne
    Eiselein, Volker
    Neumaier, Philipp
    Koeppel, Martin
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 258 - 263