Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties

被引:32
|
作者
Huang, Kai-Yao [1 ,2 ]
Tseng, Yi-Jhan [1 ]
Kao, Hui-Ju [1 ]
Chen, Chia-Hung [1 ]
Yang, Hsiao-Hsiang [1 ]
Weng, Shun-Long [2 ,3 ,4 ]
机构
[1] Hsinchu Mackay Mem Hosp, Dept Med Res, Hsinchu 300, Taiwan
[2] Mackay Med Coll, Dept Med, New Taipei 252, Taiwan
[3] Hsinchu Mackay Mem Hosp, Dept Obstet & Gynecol, Hsinchu 300, Taiwan
[4] Mackay Jr Coll Med, Med Nursing & Management Coll, Taipei 112, Taiwan
关键词
AMINO-ACID-COMPOSITION; HOST-DEFENSE PEPTIDES; ANTIMICROBIAL PEPTIDES; DRUG-RESISTANCE; CANCER-CELLS; PROTEIN; PAIRS;
D O I
10.1038/s41598-021-93124-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Anticancer peptides (ACPs) are a kind of bioactive peptides which could be used as a novel type of anticancer drug that has several advantages over chemistry-based drug, including high specificity, strong tumor penetration capacity, and low toxicity to normal cells. As the number of experimentally verified bioactive peptides has increased significantly, various of in silico approaches are imperative for investigating the characteristics of ACPs. However, the lack of methods for investigating the differences in physicochemical properties of ACPs. In this study, we compared the N- and C-terminal amino acid composition for each peptide, there are three major subtypes of ACPs that are defined based on the distribution of positively charged residues. For the first time, we were motivated to develop a two-step machine learning model for identification of the subtypes of ACPs, which classify the input data into the corresponding group before applying the classifier. Further, to improve the predictive power, the hybrid feature sets were considered for prediction. Evaluation by five-fold cross-validation showed that the two-step model trained with sequence-based features and physicochemical properties was most effective in discriminating between ACPs and non-ACPs. The two-step model trained with the hybrid features performed well, with a sensitivity of 86.75%, a specificity of 85.75%, an accuracy of 86.08%, and a Matthews Correlation Coefficient value of 0.703. Furthermore, the model also consistently provides the effective performance in independent testing set, with sensitivity of 77.6%, specificity of 94.74%, accuracy of 88.99% and the MCC value reached 0.75. Finally, the two-step model has been implemented as a web-based tool, namely iDACP, which is now freely available at http://mer.hc.mmh.org.tw/iDACP/.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
    Sun, Mingwei
    Hu, Haoyuan
    Pang, Wei
    Zhou, You
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (20)
  • [32] Pleiotropic Anticancer Properties of Scorpion Venom Peptides: Rhopalurus princeps Venom as an Anticancer Agent
    Mikaelian, Arthur G.
    Traboulay, Eric
    Zhang, Xiaofei Michael
    Yeritsyan, Emma
    Pedersen, Peter L.
    Ko, Young Hee
    Matalka, Khalid Z.
    DRUG DESIGN DEVELOPMENT AND THERAPY, 2020, 14 : 881 - 893
  • [33] Effects of Solution Concentration on the Physicochemical Properties of a Polymeric Anticancer Therapeutic
    Peng, Lili X.
    Yu, Lei
    Howell, Stephen B.
    Gough, David A.
    MOLECULAR PHARMACEUTICS, 2012, 9 (01) : 37 - 47
  • [34] Chlorosulfonated Polyethylene: Structural Features and Physicochemical Properties
    Seleznev, A. A.
    Stepanov, G. V.
    Safronov, S. A.
    Aleynikova, T. P.
    Navrotskiy, V. A.
    POLYMER SCIENCE SERIES A, 2024, : 306 - 314
  • [35] Intelligent computational method for discrimination of anticancer peptides by incorporating sequential and evolutionary profiles information
    Kabir, Muhammad
    Arif, Muhammad
    Ahmad, Saeed
    Ali, Zakir
    Swati, Zar Nawab Khan
    Yu, Dong-Jun
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 : 158 - 165
  • [36] Learning embedding features based on multisense-scaled attention architecture to improve the predictive performance of anticancer peptides
    He, Wenjia
    Wang, Yu
    Cui, Lizhen
    Su, Ran
    Wei, Leyi
    BIOINFORMATICS, 2021, 37 (24) : 4684 - 4693
  • [37] Bioinformatics Identification of Green Tea Anticancer Properties: a Network-Based Approach
    Zamanian-Azodi, Mona
    Rezaei-Tavirani, Mostafa
    Esmaeili, Somayeh
    Tavirani, Majid Rezaei
    RESEARCH JOURNAL OF PHARMACOGNOSY, 2021, 8 (02) : 17 - 25
  • [38] Identifying Dipeptidyl Peptidase-IV Inhibitory Peptides Based on Correlation Information of Physicochemical Properties
    Hongliang Zou
    Zhijian Yin
    International Journal of Peptide Research and Therapeutics, 2021, 27 : 2651 - 2659
  • [39] Identifying Dipeptidyl Peptidase-IV Inhibitory Peptides Based on Correlation Information of Physicochemical Properties
    Zou, Hongliang
    Yin, Zhijian
    INTERNATIONAL JOURNAL OF PEPTIDE RESEARCH AND THERAPEUTICS, 2021, 27 (04) : 2651 - 2659
  • [40] Investigation and identification of protein carbonylation sites based on positionspecific amino acid composition and physicochemical features
    Weng, Shun-Long
    Huang, Kai-Yao
    Kaunang, Fergie Joanda
    Huang, Chien-Hsun
    Kao, Hui-Ju
    Chang, Tzu-Hao
    Wang, Hsin-Yao
    Lu, Jang-Jih
    Lee, Tzong-Yi
    BMC BIOINFORMATICS, 2017, 18