The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

被引:224
作者
Ahn, Sung-Min [2 ,5 ]
Kim, Tae-Hyung [1 ]
Lee, Sunghoon [1 ]
Kim, Deokhoon [2 ]
Ghang, Ho [1 ]
Kim, Dae-Soo [1 ]
Kim, Byoung-Chul [1 ]
Kim, Sang-Yoon [1 ]
Kim, Woo-Yeon [1 ]
Kim, Chulhong [1 ]
Park, Daeui [1 ]
Lee, Yong Seok [1 ]
Kim, Sangsoo [3 ]
Reja, Rohit [1 ]
Jho, Sungwoong [1 ]
Kim, Chang Geun [6 ]
Cha, Ji-Young [2 ]
Kim, Kyung-Hee [4 ]
Lee, Bonghee [2 ]
Bhak, Jong [1 ]
Kim, Seong-Jin [2 ]
机构
[1] KRIBB, KOBIC, Taejon 305806, South Korea
[2] Gachon Univ Med & Sci, Lee Gil Ya Canc & Diabet Inst, Inchon 460799, South Korea
[3] Soongsil Univ, Dept Bioinformat & Life Sci, Seoul 156743, South Korea
[4] Gachon Univ, Dept Lab Med, Gil Hosp, Inchon 405760, South Korea
[5] Gachon Univ, Dept Translat Med, Gil Hosp, Inchon 405760, South Korea
[6] Korea Res Inst Stand & Sci, Natl Ctr Stand Reference Data, Taejon 305340, South Korea
关键词
DUAL ORIGINS; DNA; DATABASE;
D O I
10.1101/gr.092197.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present the first Korean individual genome sequence (SJK) and analysis results. The diploid genome of a Korean male was sequenced to 28.95-fold redundancy using the Illumina paired-end sequencing method. SJK covered 99.9% of the NCBI human reference genome. We identified 420,083 novel single nucleotide polymorphisms (SNPs) that are not in the dbSNP database. Despite a close similarity, significant differences were observed between the Chinese genome (YH), the only other Asian genome available, and SJK: (1) 39.87% ( 1,371,239 out of 3,439,107) SNPs were SJK-specific (49.51% against Venter's, 46.94% against Watson's, and 44.17% against the Yoruba genomes); (2) 99.5% ( 22,495 out of 22,605) of short indels (<4 bp) discovered on the same loci had the same size and type as YH; and (3) 11.3% ( 331 out of 2920) deletion structural variants were SJK-specific. Even after attempting to map unmapped reads of SJK to unanchored NCBI scaffolds, HGSV, and available personal genomes, there were still 5.77% SJK reads that could not be mapped. All these findings indicate that the overall genetic differences among individuals from closely related ethnic groups may be significant. Hence, constructing reference genomes for minor socio-ethnic groups will be useful for massive individual genome sequencing.
引用
收藏
页码:1622 / 1629
页数:8
相关论文
共 34 条
[1]   SEQUENCE AND ORGANIZATION OF THE HUMAN MITOCHONDRIAL GENOME [J].
ANDERSON, S ;
BANKIER, AT ;
BARRELL, BG ;
DEBRUIJN, MHL ;
COULSON, AR ;
DROUIN, J ;
EPERON, IC ;
NIERLICH, DP ;
ROE, BA ;
SANGER, F ;
SCHREIER, PH ;
SMITH, AJH ;
STADEN, R ;
YOUNG, IG .
NATURE, 1981, 290 (5806) :457-465
[2]   Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA [J].
Andrews, RM ;
Kubacka, I ;
Chinnery, PF ;
Lightowlers, RN ;
Turnbull, DM ;
Howell, N .
NATURE GENETICS, 1999, 23 (02) :147-147
[3]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[4]   HIGH-RESOLUTION OF HUMAN EVOLUTIONARY TREES WITH POLYMORPHIC MICROSATELLITES [J].
BOWCOCK, AM ;
RUIZLINARES, A ;
TOMFOHRDE, J ;
MINCH, E ;
KIDD, JR ;
CAVALLISFORZA, LL .
NATURE, 1994, 368 (6470) :455-457
[5]  
Brandon MC, 2005, NUCLEIC ACIDS RES, V33, pD611
[6]   The Personal Genome Project [J].
Church, G. M. .
MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1)
[7]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945
[8]   Dual origins of the Japanese: common ground for hunter-gatherer and farmer Y chromosomes [J].
Hammer, MF ;
Karafet, TM ;
Park, H ;
Omoto, K ;
Harihara, S ;
Stoneking, M ;
Horai, S .
JOURNAL OF HUMAN GENETICS, 2006, 51 (01) :47-58
[9]   Detection of large-scale variation in the human genome [J].
Iafrate, AJ ;
Feuk, L ;
Rivera, MN ;
Listewnik, ML ;
Donahoe, PK ;
Qi, Y ;
Scherer, SW ;
Lee, C .
NATURE GENETICS, 2004, 36 (09) :949-951
[10]   The Peopling of Korea Revealed by Analyses of Mitochondrial DNA and Y-Chromosomal Markers [J].
Jin, Han-Jun ;
Tyler-Smith, Chris ;
Kim, Wook .
PLOS ONE, 2009, 4 (01)