Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

被引:6
|
作者
Hung, Che-Lun [1 ]
Chen, Wen-Pei [2 ]
Hua, Guan-Jie [3 ]
Zheng, Huiru [4 ]
Tsai, Suh-Jen Jane [2 ]
Lin, Yaw-Ling [2 ,5 ]
机构
[1] Providence Univ, Dept Comp Sci & Commun Engn, Taichung 43301, Taiwan
[2] Providence Univ, Dept Appl Chem, Taichung 43301, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30013, Taiwan
[4] Univ Ulster, Sch Comp & Math, Newtownabbey BT37 0QB, North Ireland
[5] Providence Univ, Dept Comp Sci & Informat Engn, Taichung 43301, Taiwan
来源
关键词
SNPs; haplotype; cloud computing; parallel processing; MapReduce; DYNAMIC-PROGRAMMING ALGORITHM; LINKAGE DISEQUILIBRIUM; HAPLOTYPE BLOCKS; RECOMBINATION; MAPREDUCE; HISTORY; DISEASE; GENES;
D O I
10.3390/ijms16011096
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
引用
收藏
页码:1096 / 1110
页数:15
相关论文
共 50 条
  • [1] CloudTSS: A TagSNP Selection Approach on Cloud Computing
    Hung, Che-Lun
    Lin, Yaw-Ling
    Hua, Guan-Jie
    Hu, Yu-Chen
    GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 525 - +
  • [2] Cloud computing-based parallel genetic algorithm for gene selection in cancer classification
    Keco, Dino
    Subasi, Abdulhamit
    Kevric, Jasmin
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (05): : 1601 - 1610
  • [3] Cloud computing-based parallel genetic algorithm for gene selection in cancer classification
    Dino Kečo
    Abdulhamit Subasi
    Jasmin Kevric
    Neural Computing and Applications, 2018, 30 : 1601 - 1610
  • [4] Cloud computing-based big data processing and intelligent analytics
    Dong, Fang
    Wu, Chenshu
    Gao, Shangce
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (24):
  • [5] The Impact of Cloud Computing-Based Big Data Platform on IE Education
    Wang, Ziyan
    Wan, Yijie
    Liang, Hua
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [6] An Autonomic Computing-based Architecture for Cloud Computing Elasticity
    Coutinho, Emanuel Ferreira
    Gomes, Danielo Goncalves
    de Souza, Jose Neuman
    LANOMS 2015 8TH LATIN AMERICAN NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2015, : 111 - 112
  • [7] Cloud computing-based map-matching for transportation data center
    Huang, Jian
    Qie, Jinhui
    Liu, Chunwei
    Li, Siyang
    Weng, Jingnong
    Lv, Weifeng
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2015, 14 (06) : 431 - 443
  • [8] Weight-Based Data Center Selection Algorithm in Cloud Computing Environment
    Nandwani, Sunny
    Achhra, Mohit
    Shah, Raveena
    Tamrakar, Aditi
    Joshi, Kiran
    Raksha, Sowmiya
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2015, 2016, 394 : 515 - 525
  • [9] CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data
    Wang, Jun
    Guo, Mao-zu
    Wang, Chun-yu
    BMC BIOINFORMATICS, 2009, 10
  • [10] A hybrid clustering and graph based algorithm for tagSNP selection
    Guo, Mao-Zu
    Wang, Jun
    Wang, Chun-yu
    Liu, Yang
    SOFT COMPUTING, 2009, 13 (12) : 1143 - 1151