Development of a novel clustering tool for linear peptide sequences

被引:53
|
作者
Dhanda, Sandeep K. [1 ]
Vaughan, Kerrie [1 ]
Schulten, Veronique [1 ]
Grifoni, Alba [1 ]
Weiskopf, Daniela [1 ]
Sidney, John [1 ]
Peters, Bjoern [1 ,2 ]
Sette, Alessandro [1 ,2 ]
机构
[1] La Jolla Inst Allergy & Immunol, Div Vaccine Discovery, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Dept Med, San Diego, CA 92103 USA
基金
美国国家卫生研究院;
关键词
Allergy; Antigens/Peptides/Epitopes; Bioinformatics >; MHC/HLA; Viral; EPITOPE; REACTIVITY; MOLECULES; ALIGNMENT; ALLERGEN; DATABASE;
D O I
10.1111/imm.12984
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Epitopes identified in large-scale screens of overlapping peptides often share significant levels of sequence identity, complicating the analysis of epitope-related data. Clustering algorithms are often used to facilitate these analyses, but available methods are generally insufficient in their capacity to define biologically meaningful epitope clusters in the context of the immune response. To fulfil this need we developed an algorithm that generates epitope clusters based on representative or consensus sequences. This tool allows the user to cluster peptide sequences on the basis of a specified level of identity by selecting among three different method options. These include the 'clique method', in which all members of the cluster must share the same minimal level of identity with each other, and the 'connected graph method', in which all members of a cluster must share a defined level of identity with at least one other member of the cluster. In cases where it is not possible to define a clear consensus sequence with the connected graph method, a third option provides a novel 'cluster-breaking algorithm' for consensus sequence driven sub-clustering. Herein we demonstrate the tool's clustering performance and applicability using (i) a selection of dengue virus epitopes for the 'clique method', (ii) sets of allergen-derived peptides from related species for the 'connected graph method' and (iii) large data sets of eluted ligand, major histocompatibility complex binding and T-cell recognition data captured within the Immune Epitope Database (IEDB) with the newly developed 'cluster-breaking algorithm'. This novel clustering tool is accessible at http://tools.iedb.org/cluster2/.
引用
收藏
页码:331 / 345
页数:15
相关论文
共 50 条
  • [21] MotifCluster: an interactive online tool for clustering and visualizing sequences using shared motifs
    Micah Hamady
    Jeremy Widmann
    Shelley D Copley
    Rob Knight
    Genome Biology, 9
  • [22] CLAGen: A tool for clustering and annotating gene sequences using a suffix tree algorithm
    Han, Sang il
    Lee, Sung Gun
    Kim, Kyung-Hoon
    Choi, Chung Jung
    Kim, Young Han
    Hwang, Kyu Suk
    BIOSYSTEMS, 2006, 84 (03) : 175 - 182
  • [23] A Novel Variable-order Markov Model for Clustering Categorical Sequences
    Xiong, Tengke
    Wang, Shengrui
    Jiang, Qingshan
    Huang, Joshua Zhexue
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (10) : 2339 - 2353
  • [24] Development of novel ligands for peptide GPCRs
    Moran, Brian M.
    McKillop, Aine M.
    O'Harte, Finbarr P. M.
    CURRENT OPINION IN PHARMACOLOGY, 2016, 31 : 57 - 62
  • [25] Development of a Thermal Analysis Tool for Linear Machines
    Eguren, Imanol
    Almandoz, Gaizka
    Egea, Aritz
    Elorza, Leire
    Urdangarin, Ander
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [26] CAVES: A Novel Tool for Comparative Analysis of Variant Epitope Sequences
    Li, Katherine
    Lowey, Connor
    Sandstrom, Paul
    Ji, Hezhao
    VIRUSES-BASEL, 2022, 14 (06):
  • [27] Substrate phage as a tool to identify novel substrate sequences of proteases
    Ohkubo, S
    Miyadera, K
    Sugimoto, Y
    Matsuo, K
    Wierzba, K
    Yamada, Y
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2001, 4 (07) : 573 - 583
  • [28] Development of Antimicrobial Peptide Prediction Tool for Aquaculture Industries
    Aditi Gautam
    Asuda Sharma
    Sarika Jaiswal
    Samar Fatma
    Vasu Arora
    M. A. Iquebal
    S. Nandi
    J. K. Sundaray
    P. Jayasankar
    Anil Rai
    Dinesh Kumar
    Probiotics and Antimicrobial Proteins, 2016, 8 : 141 - 149
  • [29] Development of Antimicrobial Peptide Prediction Tool for Aquaculture Industries
    Gautam, Aditi
    Sharma, Asuda
    Jaiswal, Sarika
    Fatma, Samar
    Arora, Vasu
    Iquebal, M. A.
    Nandi, S.
    Sundaray, J. K.
    Jayasankar, P.
    Rai, Anil
    Kumar, Dinesh
    PROBIOTICS AND ANTIMICROBIAL PROTEINS, 2016, 8 (03) : 141 - 149
  • [30] CLUSTERING CDNA SEQUENCES
    PARSONS, JD
    BRENNER, S
    BISHOP, MJ
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (05): : 461 - 466