Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

被引:6
|
作者
Hung, Che-Lun [1 ]
Chen, Wen-Pei [2 ]
Hua, Guan-Jie [3 ]
Zheng, Huiru [4 ]
Tsai, Suh-Jen Jane [2 ]
Lin, Yaw-Ling [2 ,5 ]
机构
[1] Providence Univ, Dept Comp Sci & Commun Engn, Taichung 43301, Taiwan
[2] Providence Univ, Dept Appl Chem, Taichung 43301, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30013, Taiwan
[4] Univ Ulster, Sch Comp & Math, Newtownabbey BT37 0QB, North Ireland
[5] Providence Univ, Dept Comp Sci & Informat Engn, Taichung 43301, Taiwan
来源
关键词
SNPs; haplotype; cloud computing; parallel processing; MapReduce; DYNAMIC-PROGRAMMING ALGORITHM; LINKAGE DISEQUILIBRIUM; HAPLOTYPE BLOCKS; RECOMBINATION; MAPREDUCE; HISTORY; DISEASE; GENES;
D O I
10.3390/ijms16011096
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
引用
收藏
页码:1096 / 1110
页数:15
相关论文
共 50 条
  • [21] Cloud Computing-Based Marketplace for Collaborative Design and Manufacturing
    Banerjee, Ashis Gopal
    Beckmann, Benjamin
    Carbone, John
    DeRose, Lynn
    Giani, Annarita
    Koudal, Peter
    Mackenzie, Patricia
    Salvo, Joseph
    Yang, Dan
    Yund, Walter
    INTERNET OF THINGS: IOT INFRASTRUCTURES, PT I, 2016, 169 : 409 - 418
  • [22] An Iterative Algorithm for tagSNP Selection Based on Information Entropy Analysis
    Yeh, Chia-Hung
    Jheng, Jing-Wun
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 64 (02): : 233 - 239
  • [23] Soft computing-based preference selection index method for human resource management
    Vahdani, Behnam
    Mousavi, S. Meysam
    Ebrahimnejad, S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (01) : 393 - 403
  • [24] DNA computing-based algorithm for assignment problems
    Department of Mathematics and Physics, Wuhan Polytechnic University, Wuhan 430023, China
    不详
    Huazhong Ligong Daxue Xuebao, 2008, 2 (35-38):
  • [25] MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses
    Hu, Yingwei
    Schnaubelt, Michael
    Chen, Li
    Zhang, Bai
    Hoang, Trung
    Lih, T. Mamie
    Zhang, Zhen
    Zhang, Hui
    ANALYTICAL CHEMISTRY, 2024, 96 (25) : 10145 - 10151
  • [26] Cloud Computing-based Rehabilitation Services Information System
    Chen Xiao-hua
    Ma Yun-long
    Zhang Guo-feng
    Liu Bo
    AUTOMATIC CONTROL AND MECHATRONIC ENGINEERING II, 2013, 415 : 389 - +
  • [27] A trust-based resource selection algorithm in Cloud Computing
    Liu, Xinran, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [28] Cloud Computing-based Product Collaborative Design and Simulation
    Bin, He
    Tao, Cao Jin
    Lin, He Xiao
    Feng, Lv Hai
    2010 ETP/IITA CONFERENCE ON SYSTEM SCIENCE AND SIMULATION IN ENGINEERING (SSSE 2010), 2010, : 242 - 245
  • [29] A Cloud Computing-Based Approach for Efficient Processing of Massive Machine Tool Diagnosis Data
    Li, Heng
    Zhang, Xiaoyang
    Tao, Shuyin
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (16)
  • [30] An Algorithm for Computing Efficiently in Cloud Based Data Centers
    Shah, Idris Afzal
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 397 - 400