XGBFEMF: An XGBoost-Based Framework for Essential Protein Prediction

被引:101
|
作者
Zhong, Jiancheng [1 ]
Sun, Yusui [1 ]
Peng, Wei [2 ]
Xie, Minzhu [1 ]
Yang, Jiahong [1 ]
Tang, Xiwei [3 ]
机构
[1] Hunan Normal Univ, Sch Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
[2] Kunming Univ Sci & Technol, Comp Ctr, Kunming 650050, Yunnan, Peoples R China
[3] Hunan First Normal Univ, Dept Informat Sci & Engn, Changsha 410205, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential protein; feature engineering; multi-model fusion; XGBoost; SUB-EXPAND-SHRINK; XGBFEMF; ESSENTIAL GENES; SUBCELLULAR-LOCALIZATION; CENTRALITY; NETWORKS; DATABASE; GENOME; IDENTIFICATION; BETWEENNESS;
D O I
10.1109/TNB.2018.2842219
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins as a vital part of maintaining the cells' life play an important role in the study of biology and drug design. With the generation of large amounts of biological data related to essential proteins, an increasing number of computational methods have been proposed. Different from the methods which adopt a single machine learning method or an ensemble machine learning method, this paper proposes a predicting framework named by XGBFEMF for identifying essential proteins, which includes a SUB-EXPAND-SHRINK method for constructing the composite features with original features and obtaining the better subset of features for essential protein prediction, and also includes a model fusion method for getting a more effective prediction model. We carry out experiments on Yeast data to assess the performance of the XGBFEMF with ROC analysis, accuracy analysis, and top analysis. Meanwhile, we set up experiments on E. coli data for the validation of performance. The test results show that the XGBFEMF framework can effectively improve many essential indicators. In addition, we analyze each step in the XGBFEMF framework; our results show that both each step of the SUB-EXPAND-SHRINK method as well as the step of multi-model fusion can improve prediction performance.
引用
收藏
页码:243 / 250
页数:8
相关论文
共 50 条
  • [41] Data-driven XGBoost-based filter for target tracking
    Zhai, Bowen
    Yi, Wei
    Li, Ming
    Ju, Hao
    Kong, Lingjiang
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (20): : 6683 - 6687
  • [42] A CEEMDAN and XGBOOST-Based Approach to Forecast Crude Oil Prices
    Zhou, Yingrui
    Li, Taiyong
    Shi, Jiayi
    Qian, Zijie
    COMPLEXITY, 2019, 2019
  • [43] XGBoost-based model for predicting hydrogen content in electroslag remelting
    Yu-xiao Liu
    Yan-wu Dong
    Zhou-hua Jiang
    Yu-shuo Li
    Wei Zha
    Yao-xin Du
    Shu-yang Du
    Journal of Iron and Steel Research International, 2023, 30 : 887 - 896
  • [44] Speech-Based Parkinson’s Disease Prediction Using XGBoost-Based Features Selection and the Stacked Ensemble of Classifiers
    Karan B.
    Journal of The Institution of Engineers (India): Series B, 2023, 104 (02) : 475 - 483
  • [45] XGBoost-Based Algorithm Interpretation and Application on Post-Fault Transient Stability Status Prediction of Power System
    Chen, Minghua
    Liu, Qunying
    Chen, Shuheng
    Liu, Yicen
    Zhang, Chang-Hua
    Liu, Ruihua
    IEEE ACCESS, 2019, 7 : 13149 - 13158
  • [46] Enhanced XGBoost-Based Automatic Diagnosis System for Chronic Kidney Disease
    Ogunleye, Adeola
    Wang, Qing-Guo
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 805 - 810
  • [47] Pipeline Stress Test Simulation Under Freeze-Thaw Cycling via the XGBoost-Based Prediction Model
    Teng, Zhen-Chao
    Teng, Yun-Chao
    Li, Bo
    Liu, Xiao-Yan
    Liu, Yu
    Zhou, Ya-Dong
    FRONTIERS IN EARTH SCIENCE, 2022, 10
  • [48] XGBoost-Based Intelligent Decision Making of HVDC System with Knowledge Graph
    Li, Qiang
    Chen, Qian
    Wu, Jiyang
    Qiu, Youqiang
    Zhang, Changhong
    Huang, Yilong
    Guo, Jianbao
    Yang, Bo
    ENERGIES, 2023, 16 (05)
  • [49] Epileptic Seizure Detection in Clinical EEGs Using an XGboost-based Method
    Wei, L.
    Mooney, C.
    2020 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM, 2020,
  • [50] A XGBoost-Based Prediction Method for Meat Sheep Transport Stress Using Wearable Photoelectric Sensors and Infrared Thermometry
    Ma, Ruiqin
    Chen, Runqing
    Liang, Buwen
    Li, Xinxing
    SENSORS, 2024, 24 (23)