A new sequence based encoding for prediction of host-pathogen protein interactions

被引:13
|
作者
Kosesoy, Irfan [1 ]
Gok, Murat [1 ]
Oz, Cemil [2 ]
机构
[1] Yalova Univ, Dept Comp Engn, TR-77100 Merkez, Yalova, Turkey
[2] Sakarya Univ, Dept Comp & Informat Sci, TR-54050 Serdivan, Sakarya, Turkey
关键词
Infectious diseases; Host-pathogen interactions; Protein-protein interactions; Protein networks; Machine learning; DATABASE;
D O I
10.1016/j.compbiolchem.2018.12.001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pathogen-host interactions are very important to figure out the infection process at the molecular level, where pathogen proteins physically bind to human proteins to manipulate critical biological processes in the host cell. Data scarcity and data unavailability are two major problems for computational approaches in the prediction of pathogen-host interactions. Developing a computational method to predict pathogen-host interactions with high accuracy, based on protein sequences alone, is of great importance because it can eliminate these problems. In this study, we propose a novel and robust sequence based feature extraction method, named Location Based Encoding, to predict pathogen-host interactions with machine learning based algorithms. In this context, we use Bacillus Anthracis and Yersinia Pestis data sets as the pathogen organisms and human proteins as the host model to compare our method with sequence based protein encoding methods, which are widely used in the literature, namely amino acid composition, amino acid pair, and conjoint triad. We use these encoding methods with decision trees (Random Forest, j48), statistical (Bayesian Networks, Naive Bayes), and instance based (kNN) classifiers to predict pathogen-host interactions. We conduct different experiments to evaluate the effectiveness of our method. We obtain the best results among all the experiments with RF classifier in terms of F1, accuracy, MCC, and AUC.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 50 条
  • [41] Jenner-predict server: prediction of protein vaccine candidates (PVCs) in bacteria based on host-pathogen interactions
    Varun Jaiswal
    Sree Krishna Chanumolu
    Ankit Gupta
    Rajinder S Chauhan
    Chittaranjan Rout
    BMC Bioinformatics, 14
  • [42] Jenner-predict server: prediction of protein vaccine candidates (PVCs) in bacteria based on host-pathogen interactions
    Jaiswal, Varun
    Chanumolu, Sree Krishna
    Gupta, Ankit
    Chauhan, Rajinder S.
    Rout, Chittaranjan
    BMC BIOINFORMATICS, 2013, 14
  • [43] On the principle of host evolution in host-pathogen interactions
    Martcheva, Maia
    Tuncer, Necibe
    Kim, Yena
    JOURNAL OF BIOLOGICAL DYNAMICS, 2017, 11 (01) : 102 - 119
  • [44] Computational predictions of host-pathogen interactions using domain and sequence signature
    Das, Dibyajyoti
    Krishnan, Sowmya Ramaswamy
    Bulusu, Gopalakrishnan
    Roy, Arijit
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 935 - 938
  • [45] Focusing on the Host Side of Host-Pathogen Interactions
    Jhaveri, Ravi
    CLINICAL THERAPEUTICS, 2019, 41 (10) : 1904 - 1906
  • [46] Evolutionary insights into host-pathogen interactions from mammalian sequence data
    Sironi, Manuela
    Cagliani, Rochele
    Forni, Diego
    Clerici, Mario
    NATURE REVIEWS GENETICS, 2015, 16 (04) : 224 - 236
  • [47] Bacterial Serine/Threonine Protein Kinases in Host-Pathogen Interactions*
    Canova, Marc J.
    Molle, Virginie
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2014, 289 (14) : 9473 - 9479
  • [48] Issues in performance evaluation for host-pathogen protein interaction prediction
    Abbasi, Wajid Arshad
    Minhas, Fayyaz Ul Amir Afsar
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (03)
  • [49] Host-pathogen Protein Interaction Prediction Based on Local Topology Structures of a Protein Interaction Network
    Jindalertudomdee, Jira
    Hayashida, Morihiro
    Song, Jiangning
    Akutsu, Tatsuya
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2016, : 7 - 12
  • [50] Protein prenylation: A new mode of host-pathogen interaction
    Amaya, Moushimi
    Baranova, Ancha
    van Hoek, Monique L.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2011, 416 (1-2) : 1 - 6