DisPredict3.0: Prediction of intrinsically disordered regions/ proteins using protein language model

被引:1
|
作者
UI Kabir, Md Wasi [1 ]
Hoque, Md Tamjidul [1 ]
机构
[1] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
基金
美国国家卫生研究院;
关键词
Protein language models; Intrinsically disordered proteins; Predict disordered protein; Machine learning; ACCURATE; DATABASE; DISPROT;
D O I
10.1016/j.amc.2024.128630
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Intrinsically disordered proteins (IDPs) or protein regions (IDRs) do not have a stable threedimensional structure, even though they exhibit important biological functions. They are structurally and functionally very different from ordered proteins and can cause many critical diseases. Accurate identification of disordered proteins/regions significantly impacts fields such as drug design, protein engineering, protein design, and related research. However, experimental identification of IDRs is complex and time-consuming, necessitating the development of an accurate and efficient computational method. The recent development of deep learning methods for protein language models shows the ability to learn evolutionary information from billions of protein sequences. This motivates us to develop a computational method, named DisPredict3.0, to predict proteins' disordered regions (IDRs) using evolutionary information from a protein language model. Compared to the state-of-the-art method in the CAID (2018) assessment, DisPredict3.0 has an improvement of 2.51 %, 16.13 %, 17.98 %, and 11.94 % in terms of AUC, F1score, MCC, and kappa, respectively. In addition, in the CAID-2 assessment (2022), DisPredict3.0 shows promising results and is ranked first for disorder residue prediction on the Disorder-NOX dataset. The DisPredict3.0 webserver is available at https://bmll.cs.uno.edu.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] EPR in Protein Science Intrinsically Disordered Proteins
    Drescher, Malte
    EPR SPECTROSCOPY: APPLICATIONS IN CHEMISTRY AND BIOLOGY, 2012, 321 : 91 - 119
  • [42] Intrinsically disordered proteins: dancing protein clouds
    Uversky, V. N.
    FEBS JOURNAL, 2014, 281 : 63 - 63
  • [43] Accurate and Fast Prediction of Intrinsically Disordered Protein by Multiple Protein Language Models and Ensemble Learning
    Xu, Shijie
    Onoda, Akira
    Journal of Chemical Information and Modeling, 2024, 64 (07) : 2901 - 2911
  • [44] Accurate and Fast Prediction of Intrinsically Disordered Protein by Multiple Protein Language Models and Ensemble Learning
    Xu, Shijie
    Onoda, Akira
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 64 (07) : 2901 - 2911
  • [45] Fuzzy Filtering in Large-Scale Prediction of Intrinsically Disordered Regions of Proteins on Apache Spark
    Malysiak-Mrozek, Bozena
    Bozek, Lukasz
    Mrozek, Dariusz
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 1020 - 1027
  • [46] Identification of Disordered Regions of Intrinsically Disordered Proteins by Multi-features Fusion
    Canzhuang, Sun
    Yonge, Feng
    CURRENT BIOINFORMATICS, 2021, 16 (09) : 1126 - 1132
  • [47] Intrinsically disordered protein regions at membrane contact sites
    Jamecna, Denisa
    Antonny, Bruno
    BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR AND CELL BIOLOGY OF LIPIDS, 2021, 1866 (11):
  • [48] INTRINSICALLY DISORDERED PROTEINS: ANALYSIS, PREDICTION, SIMULATION, AND BIOLOGY
    Chen, Jianhan
    Cheng, Jianlin
    Dunker, A. Keith
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2012, 2012, : 67 - 69
  • [49] Entropy and Information within Intrinsically Disordered Protein Regions
    Pritisanac, Iva
    Vernon, Robert M.
    Moses, Alan M.
    Kay, Julie D. Forman
    ENTROPY, 2019, 21 (07)
  • [50] Intrinsically disordered proteins: Analysis, prediction, simulation, and biology
    Chen, Jianhan
    Cheng, Jianlin
    Dunker, A. Keith
    Pacific Symposium on Biocomputing, 2012, : 67 - 69