Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method

被引:1
|
作者
Chu, Yanyan [1 ,2 ,3 ]
Zhang, Huanhuan [1 ]
Zhang, Lei [3 ]
机构
[1] Ocean Univ China, Sch Med & Pharm, Qingdao 266003, Peoples R China
[2] Pilot Natl Lab Marine Sci & Technol Qingdao, Qingdao 266200, Peoples R China
[3] Ocean Univ China, Marine Biomed Res Inst Qingdao, Qingdao 266003, Peoples R China
基金
中国国家自然科学基金;
关键词
peptide toxin; active peptide; function prediction; PU learning; sequence-based; VENOM PEPTIDES; CHANNEL; THERAPEUTICS; CLASSIFIER; ZICONOTIDE; REVEAL; DESIGN;
D O I
10.3390/toxins14110811
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning
    Hu, Jun
    Zhou, Xiao-Gen
    Zhu, Yi-Heng
    Yu, Dong-Jun
    Zhang, Gui-Jun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1419 - 1429
  • [42] Prediction of peptide drift time in ion mobility mass spectrometry from sequence-based features
    Bing Wang
    Jun Zhang
    Peng Chen
    Zhiwei Ji
    Shuping Deng
    Chi Li
    BMC Bioinformatics, 14
  • [43] Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine
    Taherzadeh, Ghazaleh
    Yang, Yuedong
    Zhang, Tuo
    Liew, Alan Wee-Chung
    Zhou, Yaoqi
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) : 1223 - 1229
  • [44] Prediction of peptide drift time in ion mobility mass spectrometry from sequence-based features
    Wang, Bing
    Zhang, Jun
    Chen, Peng
    Ji, Zhiwei
    Deng, Shuping
    Li, Chi
    BMC BIOINFORMATICS, 2013, 14
  • [45] A deep learning framework for sequence-based bacteria type IV secreted effectors prediction
    Xue, Li
    Tang, Bin
    Chen, Wei
    Luo, Jiesi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 183 : 134 - 139
  • [46] SEQUENCE-BASED PREDICTION OF PROTEIN B-FACTORS WITH DUAL SUPERVIZED LEARNING MODELS
    Liu, Yu-Cheng
    Hwang, Shien-Ching
    Huang, Yu-Feng
    Lin, Win-Li
    Oyang, Yen-Jen
    Huang, Chien-Kang
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2010, 22 (05): : 385 - 391
  • [47] Promoter analysis and prediction in the human genome using sequence-based deep learning models
    Umarov, Ramzan
    Kuwahara, Hiroyuki
    Li, Yu
    Gao, Xin
    Solovyev, Victor
    BIOINFORMATICS, 2019, 35 (16) : 2730 - 2737
  • [48] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Sun, Tanlin
    Zhou, Bo
    Lai, Luhua
    Pei, Jianfeng
    BMC BIOINFORMATICS, 2017, 18
  • [49] Ten quick tips for sequence-based prediction of protein properties using machine learning
    Hou, Qingzhen
    Waury, Katharina
    Gogishvili, Dea
    Feenstra, K. Anton
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [50] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18