A Framework for Scheduling and Managing Big Data Applications in a Distributed Infrastructure

被引:0
|
作者
Govindarajan, Kannan [1 ]
Somasundaram, Thamarai Selvi [2 ]
Boulanger, David [1 ]
Kumar, Vivekanandan Suresh [1 ]
Kinshuk [1 ]
机构
[1] Athabasca Univ, Edmonton, AB, Canada
[2] Anna Univ, Madras, Tamil Nadu, India
关键词
big data; grid computing; cloud computing; cluster computing; software defined networking; distributed processing; Hadoop Distributed File System;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, big data has received attention from researchers, business industries, education, and scientific communities. Big data analytics has to deal with large scale data that consist of both structured and unstructured data. These data are to be handled properly, that is extracting, processing, and analyzing those data to obtain meaningful information from them in a limited time. To yield insightful information, the processing of big data analytics requires high performance computing system, storage, and network resources. Hence, it is essential to design a high performance computing infrastructure with sufficient bandwidth which is capable to handle the big data processing in an efficient manner. However, the current network architectures in those infrastructures, with predefined network policies, do not allow for just-in-time reconfiguration of the networking infrastructure as demanded by big data analytics. In addressing these limitations, Software-Defined Networking (SDN) offers the means to dynamically configure the network parameters, dynamically provision the networks, and the network itself can be sliced in an on-demand manner. This research aims to characterize SDN with respect to the demands of big data analytics in Cluster, Grid, and Cloud Computing resources. The main motivation behind this research study is to design and develop an intelligent framework named as Big Data Analytics Management System (BDAMS) for collectively managing the compute, storage, and network resources in Cluster, Grid, and Cloud infrastructure for big data analytics.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Accelerating Big Data Infrastructure and Applications (Ongoing collaboration)
    Brown, Kevin
    Xu, Tianqi
    Iwabuchi, Keita
    Sato, Kento
    Moody, Adam
    Mohror, Kathryn
    Jain, Nikhil
    Bhatele, Abhinav
    Schulz, Martin
    Pearce, Roger
    Gokhale, Maya
    Matuoka, Satoshi
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2017, : 343 - 347
  • [22] A review of infrastructure applications for visual analytics the big data
    Cao, Xiufeng
    Gao, Shu
    Wang, Yan
    Energy Education Science and Technology Part A: Energy Science and Research, 2014, 32 (05): : 4219 - 4226
  • [23] A Novel Architecture of Scheduling System for Big Data Framework
    Dong, Xiaocen
    Lin, Rongheng
    Zou, Hua
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 1329 - 1334
  • [24] BigDataDIRAC: deploying distributed Big Data applications
    Fernandez, Victor
    Mendez, Victor
    Pena, Tomas F.
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1177 - 1180
  • [25] Parallel and distributed computing for Big Data applications
    Senger, Hermes
    Geyer, Claudio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2412 - 2415
  • [26] ICE: Managing Cold State for Big Data Applications
    Chandramouli, Badrish
    Levandoski, Justin
    Cortez, Eli
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 457 - 468
  • [27] Communication infrastructure in distributed scheduling
    Yen, BPC
    COMPUTERS & INDUSTRIAL ENGINEERING, 2002, 42 (2-4) : 149 - 161
  • [28] BigDL: A Distributed Deep Learning Framework for Big Data
    Dai, Jason
    Wang, Yiheng
    Qiu, Xin
    Ding, Ding
    Zhang, Yao
    Wang, Yanzhang
    Jia, Xianyan
    Zhang, Cherry
    Wan, Yan
    Li, Zhichao
    Wang, Jiao
    Huang, Shengsheng
    Wu, Zhongyuan
    Wang, Yang
    Yang, Yuhao
    She, Bowen
    Shi, Dongjie
    Lu, Qi
    Huang, Kai
    Song, Guoqiong
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 50 - 60
  • [29] A Distributed Randomization Framework for Privacy Preservation in Big Data
    Shukla, Samiksha
    Sadashivappa, G.
    2014 CONFERENCE ON IT IN BUSINESS, INDUSTRY AND GOVERNMENT (CSIBIG), 2014,
  • [30] A Hierarchical Distributed Processing Framework for Big Image Data
    Dong, Le
    Lin, Zhiyu
    Liang, Yan
    He, Ling
    Zhang, Ning
    Chen, Qi
    Cao, Xiaochun
    Izquierdo, Ebroul
    IEEE Transactions on Big Data, 2016, 2 (04): : 297 - 309