Delivering Bioinformatics MapReduce Applications in the Cloud

被引:0
|
作者
Forer, Lukas [1 ]
Lipic, Tomislav [2 ]
Schoenherr, Sebastian [1 ]
Weissensteiner, Hansi [1 ]
Davidovic, Davor [2 ]
Kronenberg, Florian [1 ]
Afgan, Enis [2 ]
机构
[1] Med Univ Innsbruck, Div Genet Epidemiol, A-6020 Innsbruck, Austria
[2] Rudjer Boskovic Inst, Ctr Informat & Comp, Zagreb, Croatia
来源
2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO) | 2014年
关键词
SEQUENCING DATA; FRAMEWORK; PLATFORM; TOOLKIT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The ever-increasing data production and availability in the field of bioinformatics demands a paradigm shift towards the utilization of novel solutions for efficient data storage and processing, such as the MapReduce data parallel programming model and the corresponding Apache Hadoop framework. Despite the evident potential of this model and existence of already available algorithms and applications, especially for batch processing of large data sets as in the Next Generation Sequencing analysis, bioinformatics MapReduce applications are yet to become widely adopted in the bioinformatics data analysis. We identify two prerequisites for their adaptation and utilization: (1) the ability to compose complex workflows from multiple bioinformatics MapReduce tools that will abstract technical details of how those tools are combined and executed allowing bioinformatics domain experts to focus on the analysis, and (2) the availability of accessible and flexible computing infrastructure for this type of data processing. This paper presents integration of two existing systems: Cloudgene, a bioinformatics MapReduce workflow framework, and CloudMan, a cloud manager for delivering application execution environments. Together, they enable delivery of bioinformatics MapReduce applications in the Cloud.
引用
收藏
页码:373 / 377
页数:5
相关论文
共 50 条
  • [1] Cloud Computing using Bioinformatics MapReduce Applications
    Kumar, Vineet
    2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [2] An Advanced MapReduce: Cloud MapReduce, Enhancements and Applications
    Dahiphale, Devendra
    Karve, Rutvik
    Vasilakos, Athanasios V.
    Liu, Huan
    Yu, Zhiwei
    Chhajer, Amit
    Wang, Jianmin
    Wang, Chaokun
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2014, 11 (01): : 101 - 115
  • [3] A case study of tuning MapReduce for efficient Bioinformatics in the cloud
    Shi, Lizhen
    Wang, Zhong
    Yu, Weikuan
    Meng, Xiandong
    PARALLEL COMPUTING, 2017, 61 : 83 - 95
  • [4] Cloud Technologies for Bioinformatics Applications
    Ekanayake, Jaliya
    Gunarathne, Thilina
    Qiu, Judy
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) : 998 - 1011
  • [5] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Humphrey, Marty
    2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 74 - 82
  • [6] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Kim, In Kee
    Humphrey, Marty
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 669 - 677
  • [7] Enabling Reliable MapReduce Applications in Dynamic Cloud Infrastructures
    Marozzo, Fabrizio
    Talia, Domenico
    Trunfio, Paolo
    ERCIM NEWS, 2010, (83): : 44 - 45
  • [8] An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
    Taylor, Ronald C.
    BMC BIOINFORMATICS, 2010, 11
  • [9] An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
    Ronald C Taylor
    BMC Bioinformatics, 11
  • [10] BioNimbuZ: a federated cloud platform for bioinformatics applications
    Araujo, Aleteia
    Rosa, Michel
    Moura, Breno
    Vergara, Guilherme
    Holanda, Maristela
    Walter, Maria Emilia
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (02) : 144 - 163