A Transfer-Learning-Based Deep Convolutional Neural Network for Predicting Leukemia-Related Phosphorylation Sites from Protein Primary Sequences

被引:5
|
作者
He, Jian [1 ]
Wu, Yanling [1 ]
Pu, Xuemei [1 ]
Li, Menglong [1 ]
Guo, Yanzhi [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
基金
中国国家自然科学基金;
关键词
leukemia; protein phosphorylation site; protein primary sequences; machine-learning; deep-learning; transfer-learning; BACTERIAL; MODEL; LOGO;
D O I
10.3390/ijms23031741
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As one of the most important post-translational modifications (PTMs), phosphorylation refers to the binding of a phosphate group with amino acid residues like Ser (S), Thr (T) and Tyr (Y) thus resulting in diverse functions at the molecular level. Abnormal phosphorylation has been proved to be closely related with human diseases. To our knowledge, no research has been reported describing specific disease-associated phosphorylation sites prediction which is of great significance for comprehensive understanding of disease mechanism. In this work, focusing on three types of leukemia, we aim to develop a reliable leukemia-related phosphorylation site prediction models by combing deep convolutional neural network (CNN) with transfer-learning. CNN could automatically discover complex representations of phosphorylation patterns from the raw sequences, and hence it provides a powerful tool for improvement of leukemia-related phosphorylation site prediction. With the largest dataset of myelogenous leukemia, the optimal models for S/T/Y phosphorylation sites give the AUC values of 0.8784, 0.8328 and 0.7716 respectively. When transferred learning on the small size datasets, the models for T-cell and lymphoid leukemia also give the promising performance by common sharing the optimal parameters. Compared with other five machine-learning methods, our CNN models reveal the superior performance. Finally, the leukemia-related pathogenesis analysis and distribution analysis on phosphorylated proteins along with K-means clustering analysis and position-specific conversation profiles on the phosphorylation site all indicate the strong practical feasibility of our easy-to-use CNN models.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep convolutional neural networks for predicting leukemia-related transcription factor binding sites from DNA sequence data
    He, Jian
    Pu, Xuemei
    Li, Menglong
    Li, Chuan
    Guo, Yanzhi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 199
  • [2] Predicting Protein Phosphorylation Sites Based on Deep Learning
    Long, Haixia
    Sun, Zhao
    Li, Manzhi
    Fu, Hai Yan
    Lin, Ming Cai
    CURRENT BIOINFORMATICS, 2020, 15 (04) : 300 - 308
  • [3] A Novel Prediction Method for ATP-Binding Sites From Protein Primary Sequences Based on Fusion of Deep Convolutional Neural Network and Ensemble Learning
    Song, Jiazhi
    Liang, Yanchun
    Liu, Guixia
    Wang, Rongquan
    Sun, Liyan
    Zhang, Ping
    IEEE ACCESS, 2020, 8 : 21485 - 21495
  • [4] DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites
    Zhang, Jidong
    Liu, Bo
    Wang, Zhihan
    Lehnert, Klaus
    Gahegan, Mark
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [5] DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites
    Jidong Zhang
    Bo Liu
    Zhihan Wang
    Klaus Lehnert
    Mark Gahegan
    BMC Bioinformatics, 23
  • [6] Predicting protein-peptide binding sites with a deep convolutional neural network
    Wardah, Wafaa
    Dehzangi, Abdollah
    Taherzadeh, Ghazaleh
    Rashid, Mahmood A.
    Khan, M. G. M.
    Tsunoda, Tatsuhiko
    Sharma, Alok
    JOURNAL OF THEORETICAL BIOLOGY, 2020, 496
  • [7] A novel graph convolutional neural network for predicting interaction sites on protein kinase inhibitors in phosphorylation
    Wang, Feiqi
    Chen, Yun-Ti
    Yang, Jinn-Moon
    Akutsu, Tatsuya
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] A novel graph convolutional neural network for predicting interaction sites on protein kinase inhibitors in phosphorylation
    Feiqi Wang
    Yun-Ti Chen
    Jinn-Moon Yang
    Tatsuya Akutsu
    Scientific Reports, 12
  • [9] DeepMPSF: A Deep Learning Network for Predicting General Protein Phosphorylation Sites Based on Multiple Protein Sequence Features
    Xie, Jingxin
    Quan, Lijun
    Wang, Xuejiao
    Wu, Hongjie
    Jin, Zhi
    Pan, Deng
    Chen, Taoning
    Wu, Tingfang
    Lyu, Qiang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (22) : 7258 - 7271
  • [10] Fitness Movement Types and Completeness Detection Using a Transfer-Learning-Based Deep Neural Network
    Chen, Kuan-Yu
    Shin, Jungpil
    Hasan, Md Al Mehedi
    Liaw, Jiun-Jian
    Yuichi, Okuyama
    Tomioka, Yoichi
    SENSORS, 2022, 22 (15)