A new sequence based encoding for prediction of host-pathogen protein interactions

被引:13
|
作者
Kosesoy, Irfan [1 ]
Gok, Murat [1 ]
Oz, Cemil [2 ]
机构
[1] Yalova Univ, Dept Comp Engn, TR-77100 Merkez, Yalova, Turkey
[2] Sakarya Univ, Dept Comp & Informat Sci, TR-54050 Serdivan, Sakarya, Turkey
关键词
Infectious diseases; Host-pathogen interactions; Protein-protein interactions; Protein networks; Machine learning; DATABASE;
D O I
10.1016/j.compbiolchem.2018.12.001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pathogen-host interactions are very important to figure out the infection process at the molecular level, where pathogen proteins physically bind to human proteins to manipulate critical biological processes in the host cell. Data scarcity and data unavailability are two major problems for computational approaches in the prediction of pathogen-host interactions. Developing a computational method to predict pathogen-host interactions with high accuracy, based on protein sequences alone, is of great importance because it can eliminate these problems. In this study, we propose a novel and robust sequence based feature extraction method, named Location Based Encoding, to predict pathogen-host interactions with machine learning based algorithms. In this context, we use Bacillus Anthracis and Yersinia Pestis data sets as the pathogen organisms and human proteins as the host model to compare our method with sequence based protein encoding methods, which are widely used in the literature, namely amino acid composition, amino acid pair, and conjoint triad. We use these encoding methods with decision trees (Random Forest, j48), statistical (Bayesian Networks, Naive Bayes), and instance based (kNN) classifiers to predict pathogen-host interactions. We conduct different experiments to evaluate the effectiveness of our method. We obtain the best results among all the experiments with RF classifier in terms of F1, accuracy, MCC, and AUC.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 50 条
  • [1] Structure-based prediction of host-pathogen protein interactions
    Mariano, Rachelle
    Wuchty, Stefan
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 44 : 119 - 124
  • [2] Computational prediction of host-pathogen protein-protein interactions
    Dyer, Matthew D.
    Murali, T. M.
    Sobral, Bruno W.
    BIOINFORMATICS, 2007, 23 (13) : I159 - I166
  • [3] On the choice of negative examples for prediction of host-pathogen protein interactions
    Neumann, Don
    Roy, Soumyadip
    Minhas, Fayyaz Ul Amir Afsar
    Ben-Hur, Asa
    FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [4] Prediction of host-pathogen protein interactions by extended network model
    Kosesoy, Irfan
    Gok, Murat
    Kahveci, Tamer
    TURKISH JOURNAL OF BIOLOGY, 2021, 45 (02) : 138 - 148
  • [5] A review on host-pathogen interactions: classification and prediction
    Sen, R.
    Nayak, L.
    De, R. K.
    EUROPEAN JOURNAL OF CLINICAL MICROBIOLOGY & INFECTIOUS DISEASES, 2016, 35 (10) : 1581 - 1599
  • [6] Multitask learning for host-pathogen protein interactions
    Kshirsagar, Meghana
    Carbonell, Jaime
    Klein-Seetharaman, Judith
    BIOINFORMATICS, 2013, 29 (13) : 217 - 226
  • [7] Editorial: Protein homeostasis in host-pathogen interactions
    Yeom, Jinki
    Shin, Donghyuk
    Qiao, Yuan
    FRONTIERS IN MICROBIOLOGY, 2023, 13
  • [8] Host-pathogen interactions
    Kaisho, Tsuneyasu
    Wagner, Hermann
    CURRENT OPINION IN IMMUNOLOGY, 2008, 20 (04) : 369 - 370
  • [9] Host-pathogen interactions
    Kaiser, P
    VETERINARY IMMUNOLOGY AND IMMUNOPATHOLOGY, 2004, 100 (3-4) : 115 - 115
  • [10] Host-pathogen interactions
    Kaufmann, Stefan H. E.
    Walker, Bruce D.
    CURRENT OPINION IN IMMUNOLOGY, 2006, 18 (04) : 371 - 373