Succinylation Site Prediction Based on Protein Sequences Using the IFS-LightGBM (BO) Model

被引:20
|
作者
Zhang, Lu [1 ]
Liu, Min [1 ]
Qin, Xinyi [1 ]
Liu, Guangzhong [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
LYSINE SUCCINYLATION; POSTTRANSLATIONAL MODIFICATION; UBIQUITINATION SITES; IDENTIFICATION; EXPRESSION; PATTERNS; SIRT5; TOOL;
D O I
10.1155/2020/8858489
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Succinylation is an important posttranslational modification of proteins, which plays a key role in protein conformation regulation and cellular function control. Many studies have shown that succinylation modification on protein lysine residue is closely related to the occurrence of many diseases. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. In this study, we develop a new model, IFS-LightGBM (BO), which utilizes the incremental feature selection (IFS) method, the LightGBM feature selection method, the Bayesian optimization algorithm, and the LightGBM classifier, to predict succinylation sites in proteins. Specifically, pseudo amino acid composition (PseAAC), position-specific scoring matrix (PSSM), disorder status, and Composition of k-spaced Amino Acid Pairs (CKSAAP) are firstly employed to extract feature information. Then, utilizing the combination of the LightGBM feature selection method and the incremental feature selection (IFS) method selects the optimal feature subset for the LightGBM classifier. Finally, to increase prediction accuracy and reduce the computation load, the Bayesian optimization algorithm is used to optimize the parameters of the LightGBM classifier. The results reveal that the IFS-LightGBM (BO)-based prediction model performs better when it is evaluated by some common metrics, such as accuracy, recall, precision, Matthews Correlation Coefficient (MCC), and F-measure.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Spoofing attack recognition for GNSS-based train positioning using a BO-LightGBM method
    Bi, Jiaqi
    Liu, Jiang
    Cai, Baigen
    Wang, Jian
    SCIENCE PROGRESS, 2024, 107 (04)
  • [32] DegSampler3: Pairwise Dependency Model in Degradation Motif Site Prediction of Substrate Protein Sequences
    Maruyama, Osamu
    Matsuzaki, Fumiko
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 11 - 17
  • [33] Lidom: A Disease Risk Prediction Model Based on LightGBM Applied to Nursing Homes
    Zhou, Feng
    Hu, Shijing
    Du, Xin
    Wan, Xiaoli
    Lu, Zhihui
    Wu, Jie
    ELECTRONICS, 2023, 12 (04)
  • [34] Prediction of air quality index based on the SSA-BiLSTM-LightGBM model
    Zhang, Xiaowen
    Jiang, Xuchu
    Li, Ying
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [35] LightGBM Low-Temperature Prediction Model Based on LassoCV Feature Selection
    Duan, Shangqi
    Huang, Shuangde
    Bu, Wei
    Ge, Xingke
    Chen, Haidong
    Liu, Jing
    Luo, Jiqiang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [36] Prediction of air quality index based on the SSA-BiLSTM-LightGBM model
    Xiaowen Zhang
    Xuchu Jiang
    Ying Li
    Scientific Reports, 13
  • [37] Asymmetric Risk Injection Molding Product Size Prediction Model Based on LightGBM
    Liu Y.
    Tang X.
    Zhong J.
    Zhong Z.
    Zhou X.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (08): : 965 - 969
  • [38] Prediction of protein crotonylation sites through LightGBM classifier based on SMOTE and elastic net
    Liu, Yaning
    Yu, Zhaomin
    Chen, Cheng
    Han, Yu
    Yu, Bin
    ANALYTICAL BIOCHEMISTRY, 2020, 609
  • [39] Network security situation prediction method based on IFS-NARX model
    Han X.-L.
    Liu Y.
    Zhang Z.-J.
    Lyu X.
    Li Y.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2019, 49 (02): : 592 - 598
  • [40] PLMC: Language Model of Protein Sequences Enhances Protein Crystallization Prediction
    Xiong, Dapeng
    Kaicheng, U.
    Sun, Jianfeng
    Cribbs, Adam P.
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (04) : 802 - 813