Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields

被引:30
作者
Chang, Kuan Y. [1 ]
Lin, Tung-pei [1 ]
Shih, Ling-Yi [1 ]
Wang, Chien-Kuo [2 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung, Taiwan
[2] Asia Univ, Dept Biotechnol, Taichung, Taiwan
关键词
MESSENGER-RNA; WEB SERVER; LACTOFERRIN; PROTEINS; CATHELICIDIN; EXPRESSION; SEQUENCE; HCAP-18; LL-37; CDNA;
D O I
10.1371/journal.pone.0119490
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Antimicrobial peptides (AMPs) are potent drug candidates against microbes such as bacteria, fungi, parasites, and viruses. The size of AMPs ranges from less than ten to hundreds of amino acids. Often only a few amino acids or the critical regions of antimicrobial proteins matter the functionality. Accurately predicting the AMP critical regions could benefit the experimental designs. However, no extensive analyses have been done specifically on the AMP critical regions and computational modeling on them is either non-existent or settled to other problems. With a focus on the AMP critical regions, we thus develop a computational model AMPcore by introducing a state-of-the-art machine learning method, conditional random fields. We generate a comprehensive dataset of 798 AMPs cores and a low similarity dataset of 510 representative AMP cores. AMPcore could reach a maximal accuracy of 90% and 0.79 Matthew's correlation coefficient on the comprehensive dataset and a maximal accuracy of 83% and 0.66 MCC on the low similarity dataset. Our analyses of AMP cores follow what we know about AMPs: High in glycine and lysine, but low in aspartic acid, glutamic acid, and methionine; the abundance of a-helical structures; the dominance of positive net charges; the peculiarity of amphipathicity. Two amphipathic sequence motifs within the AMP cores, an amphipathic alpha-helix and an amphipathic p-helix, are revealed. In addition, a short sequence motif at the N-terminal boundary of AMP cores is reported for the first time: arginine at the P(-1) coupling with glycine at the P1 of AMP cores occurs the most, which might link to microbial cell adhesion.
引用
收藏
页数:16
相关论文
共 55 条
[1]   C- and N-truncated antimicrobial peptides from LFampin 265 - 284: Biophysical versus microbiology results [J].
Adao, Regina ;
Nazmi, Kamran ;
Bolscher, Jan G. M. ;
Bastos, Margarida .
JOURNAL OF PHARMACY AND BIOALLIED SCIENCES, 2011, 3 (01) :60-69
[2]  
[Anonymous], P 2003 C N AM CHAPT
[3]  
[Anonymous], 2013, CRF YET ANOTHER CRF
[4]  
[Anonymous], 2001, P INT C MACH LEARN I
[5]  
[Anonymous], 2004 CVPR 2004 P 200
[6]  
[Anonymous], PSSPRED MULTIPLE NEU
[7]   MEME SUITE: tools for motif discovery and searching [J].
Bailey, Timothy L. ;
Boden, Mikael ;
Buske, Fabian A. ;
Frith, Martin ;
Grant, Charles E. ;
Clementi, Luca ;
Ren, Jingyuan ;
Li, Wilfred W. ;
Noble, William S. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W202-W208
[8]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[9]   IDENTIFICATION OF THE BACTERICIDAL DOMAIN OF LACTOFERRIN [J].
BELLAMY, W ;
TAKASE, M ;
YAMAUCHI, K ;
WAKABAYASHI, H ;
KAWASE, K ;
TOMITA, M .
BIOCHIMICA ET BIOPHYSICA ACTA, 1992, 1121 (1-2) :130-136
[10]   Analysis and Prediction of Highly Effective Antiviral Peptides Based on Random Forests [J].
Chang, Kuan Y. ;
Yang, Je-Ruei .
PLOS ONE, 2013, 8 (08)