Succinylation Site Prediction Based on Protein Sequences Using the IFS-LightGBM (BO) Model

被引:20
|
作者
Zhang, Lu [1 ]
Liu, Min [1 ]
Qin, Xinyi [1 ]
Liu, Guangzhong [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
LYSINE SUCCINYLATION; POSTTRANSLATIONAL MODIFICATION; UBIQUITINATION SITES; IDENTIFICATION; EXPRESSION; PATTERNS; SIRT5; TOOL;
D O I
10.1155/2020/8858489
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Succinylation is an important posttranslational modification of proteins, which plays a key role in protein conformation regulation and cellular function control. Many studies have shown that succinylation modification on protein lysine residue is closely related to the occurrence of many diseases. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. In this study, we develop a new model, IFS-LightGBM (BO), which utilizes the incremental feature selection (IFS) method, the LightGBM feature selection method, the Bayesian optimization algorithm, and the LightGBM classifier, to predict succinylation sites in proteins. Specifically, pseudo amino acid composition (PseAAC), position-specific scoring matrix (PSSM), disorder status, and Composition of k-spaced Amino Acid Pairs (CKSAAP) are firstly employed to extract feature information. Then, utilizing the combination of the LightGBM feature selection method and the incremental feature selection (IFS) method selects the optimal feature subset for the LightGBM classifier. Finally, to increase prediction accuracy and reduce the computation load, the Bayesian optimization algorithm is used to optimize the parameters of the LightGBM classifier. The results reveal that the IFS-LightGBM (BO)-based prediction model performs better when it is evaluated by some common metrics, such as accuracy, recall, precision, Matthews Correlation Coefficient (MCC), and F-measure.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A parallel model of DenseCNN and ordered-neuron LSTM for generic and species-specific succinylation site prediction
    Wang, Huiqing
    Zhao, Hong
    Zhang, Jing
    Han, Jiale
    Liu, Zhihao
    BIOTECHNOLOGY AND BIOENGINEERING, 2022, 119 (07) : 1755 - 1767
  • [42] Online performance prediction using the fusion model of LightGBM and TabNet for large laser facilities
    He, Zizhou
    Shen, Wenwen
    Li, Suicheng
    Luo, Jun
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [43] Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
    Amos-Binks, Adam
    Patulea, Catalin
    Pitre, Sylvain
    Schoenrock, Andrew
    Gui, Yuan
    Green, James R.
    Golshani, Ashkan
    Dehne, Frank
    BMC BIOINFORMATICS, 2011, 12
  • [44] DeepBindPPI: Protein–Protein Binding Site Prediction Using Attention Based Graph Convolutional Network
    Sharon Sunny
    Pebbeti Bhanu Prakash
    G. Gopakumar
    P. B. Jayaraj
    The Protein Journal, 2023, 42 : 276 - 287
  • [45] Stock complex networks based on the GA-LightGBM model: The prediction of firm performance
    Huang, Chuangxia
    Cai, Yaqian
    Cao, Jinde
    Deng, Yanchen
    INFORMATION SCIENCES, 2025, 700
  • [46] Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences
    Adam Amos-Binks
    Catalin Patulea
    Sylvain Pitre
    Andrew Schoenrock
    Yuan Gui
    James R Green
    Ashkan Golshani
    Frank Dehne
    BMC Bioinformatics, 12
  • [47] Deep Splicer: A CNN Model for Splice Site Prediction in Genetic Sequences
    Fernandez-Castillo, Elisa
    Ibeth Barbosa-Santillan, Liliana
    Falcon-Morales, Luis
    Jaime Sanchez-Escobar, Juan
    GENES, 2022, 13 (05)
  • [48] Prediction of Protein Acetylation Sites using Kernel Naive Bayes Classifier Based on Protein Sequences Profiling
    Ahmed, Md. Shakil
    Shahjaman, Md.
    Kabir, Enamul
    Kamruzzaman, Md.
    BIOINFORMATION, 2018, 14 (05) : 213 - 218
  • [49] A Combined Prediction Model for Hog Futures Prices Based on WOA-LightGBM-CEEMDAN
    Wang, Xiang
    Gao, Shen
    Guo, Yibin
    Zhou, Shiyu
    Duan, Yonghui
    Wu, Daqing
    COMPLEXITY, 2022, 2022
  • [50] LightBBB: computational prediction model of blood-brain-barrier penetration based on LightGBM
    Shaker, Bilal
    Yu, Myeong-Sang
    Song, Jin Sook
    Ahn, Sunjoo
    Ryu, Jae Yong
    Oh, Kwang-Seok
    Na, Dokyun
    BIOINFORMATICS, 2021, 37 (08) : 1135 - 1139