XGBFEMF: An XGBoost-Based Framework for Essential Protein Prediction

被引:101
|
作者
Zhong, Jiancheng [1 ]
Sun, Yusui [1 ]
Peng, Wei [2 ]
Xie, Minzhu [1 ]
Yang, Jiahong [1 ]
Tang, Xiwei [3 ]
机构
[1] Hunan Normal Univ, Sch Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
[2] Kunming Univ Sci & Technol, Comp Ctr, Kunming 650050, Yunnan, Peoples R China
[3] Hunan First Normal Univ, Dept Informat Sci & Engn, Changsha 410205, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential protein; feature engineering; multi-model fusion; XGBoost; SUB-EXPAND-SHRINK; XGBFEMF; ESSENTIAL GENES; SUBCELLULAR-LOCALIZATION; CENTRALITY; NETWORKS; DATABASE; GENOME; IDENTIFICATION; BETWEENNESS;
D O I
10.1109/TNB.2018.2842219
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins as a vital part of maintaining the cells' life play an important role in the study of biology and drug design. With the generation of large amounts of biological data related to essential proteins, an increasing number of computational methods have been proposed. Different from the methods which adopt a single machine learning method or an ensemble machine learning method, this paper proposes a predicting framework named by XGBFEMF for identifying essential proteins, which includes a SUB-EXPAND-SHRINK method for constructing the composite features with original features and obtaining the better subset of features for essential protein prediction, and also includes a model fusion method for getting a more effective prediction model. We carry out experiments on Yeast data to assess the performance of the XGBFEMF with ROC analysis, accuracy analysis, and top analysis. Meanwhile, we set up experiments on E. coli data for the validation of performance. The test results show that the XGBFEMF framework can effectively improve many essential indicators. In addition, we analyze each step in the XGBFEMF framework; our results show that both each step of the SUB-EXPAND-SHRINK method as well as the step of multi-model fusion can improve prediction performance.
引用
收藏
页码:243 / 250
页数:8
相关论文
共 50 条
  • [21] A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites
    Wang, Pan
    Zhang, Guiyang
    Yu, Zu-Guo
    Huang, Guohua
    FRONTIERS IN GENETICS, 2021, 12
  • [22] XGBoost-Based Travel Time Prediction between Bus Stations an Analysis of Influencing Factors
    Zhu, Lingxiang
    Shu, Sisi
    Zou, Liang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [23] An XGBoost-based multivariate deep learning framework for stock index futures price forecasting
    Wang, Jujie
    Cheng, Qian
    Dong, Ying
    KYBERNETES, 2023, 52 (10) : 4158 - 4177
  • [24] XGBoost-Based Instantaneous Drowsiness Detection Framework Using Multitaper Spectral Information of Electroencephalography
    Choi, Hyun-Soo
    Kim, Siwon
    Oh, Jung Eun
    Yoon, Jee Eun
    Park, Jung Ah
    Yun, Chang-Ho
    Yoon, Sungroh
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 111 - 121
  • [25] An XGBoost-Based Method for Improved Orbit Prediction With an Orbit-Separate Modeling Strategy
    Huang, Wenbin
    Tang, Rui
    Qu, Guangzhi
    Zhang, Feng
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (04) : 4887 - 4895
  • [26] XGBoost-based short-term prediction method for power system inertia and its interpretability
    Zhang, Lei
    Guo, Zhihao
    Tao, Qianhui
    Xiong, Zhizhi
    Ye, Jing
    ENERGY REPORTS, 2023, 9 : 1458 - 1469
  • [27] An investigation of XGBoost-based algorithm for breast cancer classification
    Liew, Xin Yu
    Hameed, Nazia
    Clos, Jeremie
    MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [28] XGBoost-based method for flash flood risk assessment
    Ma, Meihong
    Zhao, Gang
    He, Bingshun
    Li, Qing
    Dong, Haoyue
    Wang, Shenggang
    Wang, Zhongliang
    JOURNAL OF HYDROLOGY, 2021, 598
  • [29] iBLP: An XGBoost-Based Predictor for Identifying Bioluminescent Proteins
    Zhang, Dan
    Chen, Hua-Dong
    Zulfiqar, Hasan
    Yuan, Shi-Shi
    Huang, Qin-Lai
    Zhang, Zhao-Yue
    Deng, Ke-Jun
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [30] XGBoost-based short-term prediction method for power system inertia and its interpretability
    Zhang, Lei
    Guo, Zhihao
    Tao, Qianhui
    Xiong, Zhizhi
    Ye, Jing
    ENERGY REPORTS, 2023, 9 : 1458 - 1469