Hadoop-based Genome Comparisons

被引:0
|
作者
Heinzlreiter, Paul [1 ]
Krieger, Michael T. [1 ]
Leitner, Iris [1 ]
机构
[1] RISC Software GmbH, A-4232 Hagenberg, Austria
关键词
BigData application; bioinformatics; genome comparison; Hadoop; HBase; MapReduce;
D O I
10.1109/CGC.2012.83
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the ever increasing amounts of data in application areas relevant both for business and research, the requirements for data handling have increased significantly over the last years, often exceeding the capabilities of standard software, which has been used in specific application areas. To face this challenges software needs to be adapted or rewritten to integrate novel big data handling techniques. This paper focuses on the implementation of a genome sequence comparison application from the domain of bioinformatics running on top of Hadoop while relying on HBase for data management and MapReduce jobs for computation.
引用
收藏
页码:695 / 701
页数:7
相关论文
共 50 条
  • [41] CATS: cache-aware task scheduling for Hadoop-based systems
    Byungnam Lim
    Jong Wook Kim
    Yon Dohn Chung
    Cluster Computing, 2017, 20 : 3691 - 3705
  • [42] Hadoop-Based Medical Image Storage and Access Method for Examination Series
    Huang, Xin
    Yi, Wenlong
    Wang, Jiwei
    Xu, Zhijian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021 (2021)
  • [43] Hadoop-Based Power Grid Data Quality Verification and Monitoring Method
    Junlei Zhao
    Chunxiao Li
    Lei Wang
    Journal of Electrical Engineering & Technology, 2023, 18 : 89 - 97
  • [44] A Hadoop-Based Platform for Patient Classification and Disease Diagnosis in Healthcare Applications
    Harb, Hassan
    Mroue, Hussein
    Mansour, Ali
    Nasser, Abbass
    Cruz, Eduardo Motta
    SENSORS, 2020, 20 (07)
  • [45] Hadoop-based Intrusion Detection Technology and Data Visualization for Website Security
    Zhang, Xiao-ming
    Wang, Yu-xin
    Zhang, Ge-tong
    Wang, Guang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 86 - 91
  • [46] CATS: cache-aware task scheduling for Hadoop-based systems
    Lim, Byungnam
    Kim, Jong Wook
    Chung, Yon Dohn
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3691 - 3705
  • [47] A Hadoop-Based Output Analyzer for Large-Scale Simulation Data
    Lee, Kangsun
    Park, Joonho
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 197 - 200
  • [48] Hadoop-based Analysis Model of Network Public Opinion and Its Implementation
    Wang, Fei
    Liu, Peiyu
    Zhu, Zhenfang
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [49] Hadoop-based Dynamic Load Balance Scheduling Algorithm of Logistics Inventory
    Li, Wenjing
    Zhou, Jie
    Lin, Zhong-Ming
    Zhang, Xiang-bo
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 5 - 8
  • [50] Hadoop-based parallel algorithm for data mining in remote sensing images
    Wang Y.
    Liu Y.
    Jing W.
    International Journal of Performability Engineering, 2019, 15 (11): : 2860 - 2870