Succinylation Site Prediction Based on Protein Sequences Using the IFS-LightGBM (BO) Model

被引:20
|
作者
Zhang, Lu [1 ]
Liu, Min [1 ]
Qin, Xinyi [1 ]
Liu, Guangzhong [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
LYSINE SUCCINYLATION; POSTTRANSLATIONAL MODIFICATION; UBIQUITINATION SITES; IDENTIFICATION; EXPRESSION; PATTERNS; SIRT5; TOOL;
D O I
10.1155/2020/8858489
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Succinylation is an important posttranslational modification of proteins, which plays a key role in protein conformation regulation and cellular function control. Many studies have shown that succinylation modification on protein lysine residue is closely related to the occurrence of many diseases. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. In this study, we develop a new model, IFS-LightGBM (BO), which utilizes the incremental feature selection (IFS) method, the LightGBM feature selection method, the Bayesian optimization algorithm, and the LightGBM classifier, to predict succinylation sites in proteins. Specifically, pseudo amino acid composition (PseAAC), position-specific scoring matrix (PSSM), disorder status, and Composition of k-spaced Amino Acid Pairs (CKSAAP) are firstly employed to extract feature information. Then, utilizing the combination of the LightGBM feature selection method and the incremental feature selection (IFS) method selects the optimal feature subset for the LightGBM classifier. Finally, to increase prediction accuracy and reduce the computation load, the Bayesian optimization algorithm is used to optimize the parameters of the LightGBM classifier. The results reveal that the IFS-LightGBM (BO)-based prediction model performs better when it is evaluated by some common metrics, such as accuracy, recall, precision, Matthews Correlation Coefficient (MCC), and F-measure.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] An Interpretable Depression Prediction Model for the Elderly Based on ISSA Optimized LightGBM
    Jie Wang
    Zitong Wang
    Jinze Li
    Yan Peng
    JournalofBeijingInstituteofTechnology, 2023, 32 (02) : 168 - 180
  • [22] Stock Trend Prediction Based on ARIMA-LightGBM Hybrid Model
    Zheng, Xiuyan
    Cai, Jiajing
    Zhang, Guangfu
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 227 - 231
  • [23] CWSI Prediction Model of Greenhouse Tomato Canopy Based on LightGBM Algorithm
    Sun Q.
    Geng L.
    Zhao Q.
    Yang J.
    Lu P.
    Li L.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 : 270 - 276and308
  • [24] Research on Vehicle Fault Prediction Scheme Based on Improved LightGBM Model
    Yan S.
    Zhu P.
    Liu Z.
    Qiche Gongcheng/Automotive Engineering, 2020, 42 (06): : 815 - 819and825
  • [25] Global prediction model for indoor temperature based on CFD and LightGBM algorithm
    Shi X.
    Tian W.
    Leng Z.
    Lu H.
    Shi, Xin (meetshixin@gmail.com), 1600, Science Press (42): : 237 - 247
  • [26] Construction of a financial default risk prediction model based on the LightGBM algorithm
    Gao, Bo
    Balyan, Vipin
    JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 767 - 779
  • [27] An Interpretable Depression Prediction Model for the Elderly Based on ISSA Optimized LightGBM
    Wang J.
    Wang Z.
    Li J.
    Peng Y.
    Journal of Beijing Institute of Technology (English Edition), 2023, 32 (02): : 168 - 180
  • [28] Multi-modal features-based human-herpesvirus protein-protein interaction prediction by using LightGBM
    Yang, Xiaodi
    Wuchty, Stefan
    Liang, Zeyin
    Ji, Li
    Wang, Bingjie
    Zhu, Jialin
    Zhang, Ziding
    Dong, Yujun
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [29] A novel approach to protein chemical shift prediction from sequences using a protein language model
    Zhu, He
    Hu, Lingyue
    Yang, Yu
    Chen, Zhong
    DIGITAL DISCOVERY, 2025, 4 (02): : 331 - 337
  • [30] A protein succinylation sites prediction method based on the hybrid architecture of LSTM network and CNN
    Zhang, Die
    Wang, Shunfang
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2022, 20 (02)