A Framework for Scheduling and Managing Big Data Applications in a Distributed Infrastructure

被引:0
|
作者
Govindarajan, Kannan [1 ]
Somasundaram, Thamarai Selvi [2 ]
Boulanger, David [1 ]
Kumar, Vivekanandan Suresh [1 ]
Kinshuk [1 ]
机构
[1] Athabasca Univ, Edmonton, AB, Canada
[2] Anna Univ, Madras, Tamil Nadu, India
关键词
big data; grid computing; cloud computing; cluster computing; software defined networking; distributed processing; Hadoop Distributed File System;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, big data has received attention from researchers, business industries, education, and scientific communities. Big data analytics has to deal with large scale data that consist of both structured and unstructured data. These data are to be handled properly, that is extracting, processing, and analyzing those data to obtain meaningful information from them in a limited time. To yield insightful information, the processing of big data analytics requires high performance computing system, storage, and network resources. Hence, it is essential to design a high performance computing infrastructure with sufficient bandwidth which is capable to handle the big data processing in an efficient manner. However, the current network architectures in those infrastructures, with predefined network policies, do not allow for just-in-time reconfiguration of the networking infrastructure as demanded by big data analytics. In addressing these limitations, Software-Defined Networking (SDN) offers the means to dynamically configure the network parameters, dynamically provision the networks, and the network itself can be sliced in an on-demand manner. This research aims to characterize SDN with respect to the demands of big data analytics in Cluster, Grid, and Cloud Computing resources. The main motivation behind this research study is to design and develop an intelligent framework named as Big Data Analytics Management System (BDAMS) for collectively managing the compute, storage, and network resources in Cluster, Grid, and Cloud infrastructure for big data analytics.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Framework Based Ontology for Heterogenous Big Data Correlation in Cloud Infrastructure
    Izhar, Tengku Adil Tengku
    Apduhan, Bernady O.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [42] TARDIS: Distributed Indexing Framework for Big Time Series Data
    Zhang, Liang
    Alghamdi, Noura
    Eltabakh, Mohamed Y.
    Rundensteiner, Elke A.
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1202 - 1213
  • [43] Effective and efficient distributed management of big clinical data: a framework
    Cuzzocrea, Alfredo
    Grasso, Giorgio Mario
    Nolich, Massimiliano
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2019, 11 (03) : 284 - 313
  • [44] ASTOR - a compute framework for Scalable Distributed Big Data Processing
    Prathapan, Smriti
    Golpayegani, Navid
    Wyatt, Bryan
    Halem, Milton
    Dorband, John
    Trantham, Jon D.
    Markey, Chris A.
    BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [45] Managing big data
    Tracy H. Schloemer
    Nature Energy, 2022, 7 : 122 - 123
  • [46] Parallel and distributed clustering framework for big spatial data mining
    Bendechache, Malika
    Tari, A-Kamel
    Kechadi, M-Tahar
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689
  • [47] Distributed Framework for Big Data Processing: a Goal Driven Approach
    Sliman, Layth
    Charroux, Benoit
    Stroppa, Yvan
    SMART DIGITAL FUTURES 2014, 2014, 262 : 385 - 391
  • [48] Big data analysis for distributed computing job scheduling and reliability evaluation
    Wang, Shiow-Luan
    Hou, Yung-Tsung
    MICROELECTRONICS RELIABILITY, 2019, 94 : 41 - 45
  • [49] Managing big data
    Schloemer, Tracy H.
    NATURE ENERGY, 2022, 7 (02) : 122 - 123
  • [50] Distributed scheduling model for infrastructure networks
    Hegazy, T
    Elhakeem, A
    Elbeltagi, E
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2004, 130 (02) : 160 - 167