Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties

被引:32
|
作者
Huang, Kai-Yao [1 ,2 ]
Tseng, Yi-Jhan [1 ]
Kao, Hui-Ju [1 ]
Chen, Chia-Hung [1 ]
Yang, Hsiao-Hsiang [1 ]
Weng, Shun-Long [2 ,3 ,4 ]
机构
[1] Hsinchu Mackay Mem Hosp, Dept Med Res, Hsinchu 300, Taiwan
[2] Mackay Med Coll, Dept Med, New Taipei 252, Taiwan
[3] Hsinchu Mackay Mem Hosp, Dept Obstet & Gynecol, Hsinchu 300, Taiwan
[4] Mackay Jr Coll Med, Med Nursing & Management Coll, Taipei 112, Taiwan
关键词
AMINO-ACID-COMPOSITION; HOST-DEFENSE PEPTIDES; ANTIMICROBIAL PEPTIDES; DRUG-RESISTANCE; CANCER-CELLS; PROTEIN; PAIRS;
D O I
10.1038/s41598-021-93124-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Anticancer peptides (ACPs) are a kind of bioactive peptides which could be used as a novel type of anticancer drug that has several advantages over chemistry-based drug, including high specificity, strong tumor penetration capacity, and low toxicity to normal cells. As the number of experimentally verified bioactive peptides has increased significantly, various of in silico approaches are imperative for investigating the characteristics of ACPs. However, the lack of methods for investigating the differences in physicochemical properties of ACPs. In this study, we compared the N- and C-terminal amino acid composition for each peptide, there are three major subtypes of ACPs that are defined based on the distribution of positively charged residues. For the first time, we were motivated to develop a two-step machine learning model for identification of the subtypes of ACPs, which classify the input data into the corresponding group before applying the classifier. Further, to improve the predictive power, the hybrid feature sets were considered for prediction. Evaluation by five-fold cross-validation showed that the two-step model trained with sequence-based features and physicochemical properties was most effective in discriminating between ACPs and non-ACPs. The two-step model trained with the hybrid features performed well, with a sensitivity of 86.75%, a specificity of 85.75%, an accuracy of 86.08%, and a Matthews Correlation Coefficient value of 0.703. Furthermore, the model also consistently provides the effective performance in independent testing set, with sensitivity of 77.6%, specificity of 94.74%, accuracy of 88.99% and the MCC value reached 0.75. Finally, the two-step model has been implemented as a web-based tool, namely iDACP, which is now freely available at http://mer.hc.mmh.org.tw/iDACP/.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties
    Kai-Yao Huang
    Yi-Jhan Tseng
    Hui-Ju Kao
    Chia-Hung Chen
    Hsiao-Hsiang Yang
    Shun-Long Weng
    Scientific Reports, 11
  • [2] Identification of amyloidogenic peptides via optimized integrated features space based on physicochemical properties and PSSM
    Zhou, Cong
    Liu, Sanyang
    Zhang, Shengli
    ANALYTICAL BIOCHEMISTRY, 2019, 583
  • [3] Identification of anticancer peptides based on Random Relevance Vector Machines
    Zhao, Tianyi
    Liang, Cheng
    Zang, Tianyi
    Hu, Yang
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1822 - 1827
  • [4] Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
    Deng, Yiting
    Ma, Shuhan
    Li, Jiayu
    Zheng, Bowen
    Lv, Zhibin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (13)
  • [5] Computational Identification of piRNAs Using Features Based on RNA Sequence, Structure, Thermodynamic and Physicochemical Properties
    Monga, Isha
    Banerjee, Indranil
    CURRENT GENOMICS, 2019, 20 (07) : 508 - 518
  • [6] PHYSICOCHEMICAL PROPERTIES OF PEPTIDES AND THEIR SOLUTIONS
    BADELIN, VG
    KULIKOV, OV
    VATAGIN, VS
    UDZIG, E
    ZIELENKIEWICZ, A
    ZIELENKIEWICZ, W
    KRESTOV, GA
    THERMOCHIMICA ACTA, 1990, 169 : 81 - 93
  • [7] DRACP: a novel method for identification of anticancer peptides
    Tianyi Zhao
    Yang Hu
    Tianyi Zang
    BMC Bioinformatics, 21
  • [8] DRACP: a novel method for identification of anticancer peptides
    Zhao, Tianyi
    Hu, Yang
    Zang, Tianyi
    BMC BIOINFORMATICS, 2020, 21 (Suppl 16)
  • [9] Effective identification and differential analysis of anticancer peptides
    Zhang, Lichao
    Hu, Xueli
    Xiao, Kang
    Kong, Liang
    BIOSYSTEMS, 2024, 241
  • [10] A Computational Predictor for Accurate Identification of Tumor Homing Peptides by Integrating Sequential and Deep BiLSTM Features
    Arif, Roha
    Kanwal, Sameera
    Ahmed, Saeed
    Kabir, Muhammad
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (02) : 503 - 518