CTDGFinder: A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes

被引:3
|
作者
Ortiz, Juan F. [1 ]
Rokas, Antonis [1 ]
机构
[1] Vanderbilt Univ, Dept Biol Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
基金
美国国家科学基金会;
关键词
synteny; tandem gene duplication; genetic linkage; functional divergence; OLFACTORY RECEPTOR GENES; BETA-GLOBIN GENES; HOX GENE; EVOLUTIONARY DYNAMICS; ORGANIZATION; FAMILY; MOUSE; EXPRESSION; SEQUENCE; ORIGIN;
D O I
10.1093/molbev/msw227
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Closely spaced clusters of tandemly duplicated genes (CTDGs) contribute to the diversity of many phenotypes, including chemosensation, snake venom, and animal body plans. CTDGs have traditionally been identified subjectively as genomic neighborhoods containing several gene duplicates in close proximity; however, CTDGs are often highly variable with respect to gene number, intergenic distance, and synteny. This lack of formal definition hampers the study of CTDG evolutionary dynamics and the discovery of novel CTDGs in the exponentially growing body of genomic data. To address this gap, we developed a novel homology-based algorithm, CTDGFinder, which formalizes and automates the identification of CTDGs by examining the physical distribution of individual members of families of duplicated genes across chromosomes. Application of CTDGFinder accurately identified CTDGs for many well-known gene clusters (e.g., Hox and beta-globin gene clusters) in the human, mouse and 20 other mammalian genomes. Differences between previously annotated gene clusters and our inferred CTDGs were due to the exclusion of nonhomologs that have historically been considered parts of specific gene clusters, the inclusion or absence of genes between the CTDGs and their corresponding gene clusters, and the splitting of certain gene clusters into distinct CTDGs. Examination of human genes showing tissue-specific enhancement of their expression by CTDGFinder identified members of several well-known gene clusters (e.g., cytochrome P450s and olfactory receptors) and revealed that they were unequally distributed across tissues. By formalizing and automating CTDG identification, CTDGFinder will facilitate understanding of CTDG evolutionary dynamics, their functional implications, and how they are associated with phenotypic diversity.
引用
收藏
页码:215 / 229
页数:15
相关论文
共 3 条
  • [1] A Novel Measurement Data Classification Algorithm Based on SVM for Tracking Closely Spaced Targets
    Zhao, Zongmin
    Wang, Xiang
    Wang, Tao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 68 (04) : 1089 - 1100
  • [2] A novel fuzzy-based MOPSO algorithm for identifying clusters from complex networks
    Yang, Yue
    Su, Xiaorui
    Zhao, Bowei
    He, Yizhou
    Hu, Lun
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1126 - 1131
  • [3] Identifying novel fruit-related genes in Arabidopsis thaliana based on the random walk with restart algorithm
    Zhang, Yunhua
    Dai, Li
    Liu, Ying
    Zhang, YuHang
    Wang, ShaoPeng
    PLOS ONE, 2017, 12 (05):