PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SIC Avar gene family

被引:27
|
作者
Lapp, S. A. [1 ]
Geraldo, J. A. [2 ,3 ]
Chien, J. -T. [1 ,4 ]
Ay, F. [5 ]
Pakala, S. B. [6 ,7 ]
Batugedara, G. [8 ]
Humphrey, J. [6 ,7 ]
Debarry, J. D. [6 ,7 ]
Le Roch, K. G. [8 ]
Galinski, M. R. [1 ,4 ,10 ]
Kissinger, J. C. [6 ,7 ,11 ]
机构
[1] Emory Univ, Yerkes Natl Primate Res Ctr, Emory Vaccine Ctr, Atlanta, GA 30322 USA
[2] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
[3] Rene Rachou Res Ctr CPqRR FIOCRUZ, Belo Horizonte, MG, Brazil
[4] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
[5] La Jolla Inst Allergy & Immunol, La Jolla, CA 92037 USA
[6] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[7] Univ Georgia, Ctr Trop & Emerging Global Dis, Athens, GA 30602 USA
[8] Univ Calif Riverside, Inst Integrat Genome Biol, Ctr Dis & Vector Res, Dept Cell Biol & Neurosci, Riverside, CA 92521 USA
[9] Malaria Host Pathogen Interact Ctr, Atlanta, GA USA
[10] Emory Univ, Dept Med, Div Infect Dis, Atlanta, GA 30322 USA
[11] Univ Georgia, Dept Genet, Athens, GA 30602 USA
基金
美国国家卫生研究院;
关键词
Plasmodium knowlesi; PacBio; Hi-C; SICAvar; MaHPIC; genome; sequence; annotation; antigenic variation; ANTIGENIC VARIATION; VARIANT ANTIGEN; ERYTHROCYTE-MEMBRANE; ZOONOTIC MALARIA; HUMAN INFECTIONS; EXPRESSION; REVEALS; ORGANIZATION; SOFTWARE; MONKEYS;
D O I
10.1017/S0031182017001329
中图分类号
R38 [医学寄生虫学]; Q [生物科学];
学科分类号
07 ; 0710 ; 09 ; 100103 ;
摘要
Plasmodium knowlesi has risen in importance as a zoonotic parasite that has been causing regular episodes of malaria throughout South East Asia. The P. knowlesi genome sequence generated in 2008 highlighted and confirmed many similarities and differences in Plasmodium species, including a global view of several multigene families, such as the large SIC Avar multigene family encoding the variant antigens known as the schizont-infected cell agglutination proteins. However, repetitive DNA sequences are the bane of any genome project, and this and other Plasmodium genome projects have not been immune to the gaps, rearrangements and other pitfalls created by these genomic features. Today, long-read PacBio and chromatin conformation technologies are overcoming such obstacles. Here, based on the use of these technologies, we present a highly refined de novo P. knowlesi genome sequence of the Pk1(A+) clone. This sequence and annotation, referred to as the 'MaHPIC Pk genome sequence', includes manual annotation of the SIC Avar gene family with 136 full-length members categorized as type I or II. This sequence provides a framework that will permit a better understanding of the SICAvar repertoire, selective pressures acting on this gene family and mechanisms of antigenic variation in this species and other pathogens.
引用
收藏
页码:71 / 84
页数:14
相关论文
共 44 条
  • [1] Chromosome Genome Assembly and Annotation of the Capitulum mitella With PacBio and Hi-C Sequencing Data
    Chen, Duo
    Zheng, Xuehai
    Huang, Zhen
    Chen, Youqiang
    Xue, Ting
    Li, Ke
    Rao, Xiaozhen
    Lin, Gang
    FRONTIERS IN GENETICS, 2021, 12
  • [2] Chromosome genome assembly and annotation of the yellowbelly pufferfish with PacBio and Hi-C sequencing data
    Yitao Zhou
    Shijun Xiao
    Gang Lin
    Duo Chen
    Wan Cen
    Ting Xue
    Zhiyu Liu
    Jianxing Zhong
    Yanting Chen
    Yijun Xiao
    Jianhua Chen
    Yunhai Guo
    Youqiang Chen
    Yanding Zhang
    Xuefeng Hu
    Zhen Huang
    Scientific Data, 6
  • [3] Chromosome genome assembly and annotation of the yellowbelly pufferfish with PacBio and Hi-C sequencing data
    Zhou, Yitao
    Xiao, Shijun
    Lin, Gang
    Chen, Duo
    Cen, Wan
    Xue, Ting
    Liu, Zhiyu
    Zhong, Jianxing
    Chen, Yanting
    Xiao, Yijun
    Chen, Jianhua
    Guo, Yunhai
    Chen, Youqiang
    Zhang, Yanding
    Hu, Xuefeng
    Huang, Zhen
    SCIENTIFIC DATA, 2019, 6 (1)
  • [4] Chromosome-level genome assembly and annotation of the Yunling cattle with PacBio and Hi-C sequencing data
    Zaichao Wei
    Lilian Zhang
    Lutao Gao
    Jian Chen
    Lin Peng
    Linnan Yang
    Scientific Data, 11
  • [5] Chromosome-level genome assembly and annotation of the Yunling cattle with PacBio and Hi-C sequencing data
    Wei, Zaichao
    Zhang, Lilian
    Gao, Lutao
    Chen, Jian
    Peng, Lin
    Yang, Linnan
    SCIENTIFIC DATA, 2024, 11 (01)
  • [6] The sequence and de novo assembly of Takifugu bimaculatus genome using PacBio and Hi-C technologies
    Zhixiong Zhou
    Bo Liu
    Baohua Chen
    Yue Shi
    Fei Pu
    Huaqiang Bai
    Leibin Li
    Peng Xu
    Scientific Data, 6
  • [7] The sequence and de novo assembly of Takifugu bimaculatus genome using PacBio and Hi-C technologies
    Zhou, Zhixiong
    Liu, Bo
    Chen, Baohua
    Shi, Yue
    Pu, Fei
    Bai, Huaqiang
    Li, Leibin
    Xu, Peng
    SCIENTIFIC DATA, 2019, 6 (1)
  • [8] Chromosome genome assembly of the Camphora longepaniculata (Gamble) with PacBio and Hi-C sequencing data
    Yan, Kuan
    Zhu, Hui
    Cao, Guiling
    Meng, Lina
    Li, Junqiang
    Zhang, Jian
    Liu, Sicen
    Wang, Yujie
    Feng, Ruizhang
    Soaud, Salma A.
    Abd Elhamid, Mohamed A.
    Heakel, Rania M. Y.
    Wei, Qin
    El-Sappah, Ahmed H.
    Ru, Dafu
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [9] Chromosome-Level Genome Assembly ofCerasus humilisUsing PacBio and Hi-C Technologies
    Wang, Pengfei
    Yi, Shaokui
    Mu, Xiaopeng
    Zhang, Jiancheng
    Du, Junjie
    FRONTIERS IN GENETICS, 2020, 11
  • [10] The sequencing and de novo assembly of the Larimichthys crocea genome using PacBio and Hi-C technologies
    Baohua Chen
    Zhixiong Zhou
    Qiaozhen Ke
    Yidi Wu
    Huaqiang Bai
    Fei Pu
    Peng Xu
    Scientific Data, 6