A Framework for Scheduling and Managing Big Data Applications in a Distributed Infrastructure

被引:0
|
作者
Govindarajan, Kannan [1 ]
Somasundaram, Thamarai Selvi [2 ]
Boulanger, David [1 ]
Kumar, Vivekanandan Suresh [1 ]
Kinshuk [1 ]
机构
[1] Athabasca Univ, Edmonton, AB, Canada
[2] Anna Univ, Madras, Tamil Nadu, India
关键词
big data; grid computing; cloud computing; cluster computing; software defined networking; distributed processing; Hadoop Distributed File System;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, big data has received attention from researchers, business industries, education, and scientific communities. Big data analytics has to deal with large scale data that consist of both structured and unstructured data. These data are to be handled properly, that is extracting, processing, and analyzing those data to obtain meaningful information from them in a limited time. To yield insightful information, the processing of big data analytics requires high performance computing system, storage, and network resources. Hence, it is essential to design a high performance computing infrastructure with sufficient bandwidth which is capable to handle the big data processing in an efficient manner. However, the current network architectures in those infrastructures, with predefined network policies, do not allow for just-in-time reconfiguration of the networking infrastructure as demanded by big data analytics. In addressing these limitations, Software-Defined Networking (SDN) offers the means to dynamically configure the network parameters, dynamically provision the networks, and the network itself can be sliced in an on-demand manner. This research aims to characterize SDN with respect to the demands of big data analytics in Cluster, Grid, and Cloud Computing resources. The main motivation behind this research study is to design and develop an intelligent framework named as Big Data Analytics Management System (BDAMS) for collectively managing the compute, storage, and network resources in Cluster, Grid, and Cloud infrastructure for big data analytics.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Scheduling of big data applications on distributed cloud based on QoS parameters
    Rajinder Sandhu
    Sandeep K. Sood
    Cluster Computing, 2015, 18 : 817 - 828
  • [2] Scheduling of big data applications on distributed cloud based on QoS parameters
    Sandhu, Rajinder
    Sood, Sandeep K.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 817 - 828
  • [3] Spark Based Distributed Deep Learning Framework For Big Data Applications
    Khumoyun, Akhmedov
    Cui, Yun
    Hanku, Lee
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMMUNICATIONS TECHNOLOGIES (ICISCT), 2016,
  • [4] Big Data Applications for Managing Roadways
    Mathew, Jijo K.
    Desai, Jairaj C.
    Sakhare, Rahul Suryakant
    Kim, Woosung
    Li, Howell
    Bullock, Darcy M.
    ITE JOURNAL-INSTITUTE OF TRANSPORTATION ENGINEERS, 2021, 91 (02): : 28 - 35
  • [5] Scheduling big data applications within advance reservation framework in optical grids
    Abouelela, Mohamed
    El-Darieby, Mohamed
    APPLIED SOFT COMPUTING, 2016, 38 : 1049 - 1059
  • [6] Task Scheduling for Big Data Management in Fog Infrastructure
    Islam, Tajul
    Hashem, M. M. A.
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [7] A Distributed Framework for Carbon and Cost Aware Geographical Job Scheduling in a Hybrid Data Center Infrastructure
    Mahmud, A. Hasan
    Iyengar, S. S.
    2016 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING (ICAC), 2016, : 75 - 84
  • [8] A Hybrid Cloud Infrastructure for Big Data Applications
    Loreti, Daniela
    Ciampolini, Anna
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1713 - 1718
  • [9] Adaptive cache policy scheduling for big data applications on distributed tiered storage system
    Gu, Rong
    Li, Chongjie
    Shu, Peng
    Yuan, Chunfeng
    Huang, Yihua
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (15):
  • [10] Scheduling distributed applications: The SimGrid simulation framework
    Legrand, A
    Marchal, L
    Casanova, H
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 138 - 145