On the role of message broker middleware for many-task computing on a big-data platform

被引:4
|
作者
Cao Ngoc Nguyen
Jaehwan Lee
Soonwook Hwang
Jik-Soo Kim
机构
[1] University of Science & Technology,Korea Institute of Science and Technology Information
[2] Korea Aerospace University,School of Electronics and Information Engineering
[3] Myongji University,Department of Computer Engineering
来源
Cluster Computing | 2019年 / 22卷
关键词
Many-task computing; Message broker middleware; Hadoop; YARN; ActiveMQ; Kafka; MOHA; Load balancing;
D O I
暂无
中图分类号
学科分类号
摘要
We have designed and implemented a new data processing framework called “Many-task computing On HAdoop” (MOHA) which aims to effectively support fine-grained many-task applications that can show another type of data-intensive workloads in the YARN-based Hadoop 2.0 platform. MOHA is developed as one of Hadoop YARN applications so that it can transparently co-host existing many-task computing (MTC) applications with other data processing workflows such as MapReduce in a single Hadoop cluster. In this paper, we investigate main characteristics of two well-known open-source message broker middleware systems (Apache ActiveMQ and Kafka) and their implications on a many-task management scheme in our MOHA framework. Through our extensive experiments with a real MTC application, we demonstrate and discuss trade-offs between parallelism and load balancing of data access patterns in message broker middleware systems for Many-Task Computing on Hadoop.
引用
收藏
页码:2527 / 2540
页数:13
相关论文
共 50 条
  • [1] On the role of message broker middleware for many-task computing on a big-data platform
    Cao Ngoc Nguyen
    Lee, Jaehwan
    Hwang, Soonwook
    Kim, Jik-Soo
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 2527 - 2540
  • [2] MOHA: Many-Task Computing meets the Big Data Platform
    Kim, Jik-Soo
    Nguyen, Cao
    Hwang, Soonwook
    PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2016, : 193 - 202
  • [3] Middleware support for many-task computing
    Raicu, Ioan
    Foster, Ian
    Wilde, Mike
    Zhang, Zhao
    Iskra, Kamil
    Beckman, Peter
    Zhao, Yong
    Szalay, Alex
    Choudhary, Alok
    Little, Philip
    Moretti, Christopher
    Chaudhary, Amitabh
    Thain, Douglas
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2010, 13 (03): : 291 - 314
  • [4] Middleware support for many-task computing
    Ioan Raicu
    Ian Foster
    Mike Wilde
    Zhao Zhang
    Kamil Iskra
    Peter Beckman
    Yong Zhao
    Alex Szalay
    Alok Choudhary
    Philip Little
    Christopher Moretti
    Amitabh Chaudhary
    Douglas Thain
    Cluster Computing, 2010, 13 : 291 - 314
  • [5] Many-Task Computing for Grids and Supercomputers
    Raicu, Ioan
    Foster, Ian T.
    Zhao, Yong
    2008 WORKSHOP ON MANY-TASK COMPUTING ON GRIDS AND SUPERCOMPUTERS, 2008, : 57 - +
  • [6] MANY-TASK COMPUTING ON MANY-CORE ARCHITECTURES
    Valero-Lara, Pedro
    Nookala, Poornima
    Pelayo, Fernando L.
    Jansson, Johan
    Dimitropoulos, Serapheim
    Raicu, Ioan
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2016, 17 (01): : 33 - 46
  • [7] Fog computing: a platform for big-data marketing analytics
    Hornik, Jacob
    Rachamim, Matti
    Graguer, Sergei
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [8] A Middleware for Managing Big-Data Flows
    Gupta, Rajeev
    Gupta, Himanshu
    Gupta, Sanjeev
    Padmanabhan, Sriram
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 410 - 424
  • [9] Platform and Co-runner Affinities for Many-Task Applications in Distributed Computing Platforms
    Kim, Seontae
    Hwang, Eunji
    Yoo, Tae-kyung
    Kim, Jik-Soo
    Hwang, Soonwook
    Choi, Young-ri
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 667 - 676
  • [10] A Data Throughput Prediction and Optimization Service for Widely Distributed Many-Task Computing
    Yin, Dengpan
    Yildirim, Esma
    Kulasekaran, Sivakumar
    Ross, Brandon
    Kosar, Tevfik
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) : 899 - 909