A data placement strategy for big data based on DCC in cloud computing systems

被引:2
|
作者
Wang, Tao [1 ,2 ]
Yao, Shihong [2 ]
Xu, Zhengquan [1 ,2 ]
Jia, Shan [2 ]
Xu, Qiang [2 ]
机构
[1] Collaborat Innovat Ctr Geospatial Technol, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
基金
美国国家科学基金会;
关键词
Big Data; data placement; data scheduling; dynamic computation correlation;
D O I
10.1109/SmartCity.2015.139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In complex and data-intensive applications, data scheduling between data centers must occur when multiple datasets stored in distributed data centers are processed by one computation. To store massive datasets effectively and reduce data scheduling between data centers during the execution of computations, a mathematical model of data scheduling between data centers in cloud computing is built and dynamic computation correlation (DCC) between datasets is defined. Then a data placement strategy for big data based on DCC is proposed. Datasets with high DCC are placed into the same data center, and new datasets are dynamically distributed into the most appropriate data center. Comprehensive experiments show that the proposed strategy can effectively reduce the number of data scheduling between data centers and has a considerably low and almost constant computational complexity when the number of data centers increases and the datasets are massive. It can be expected that the proposed strategy will be applicable to the practical large-scale distributed storage systems for big data management.
引用
收藏
页码:623 / 630
页数:8
相关论文
共 50 条
  • [31] Data Cleaning Mechanism for Big Data and Cloud Computing
    Rahul, Kumar
    Banyal, R. K.
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 195 - 198
  • [32] DATA FUSION IN CLOUD COMPUTING:BIG DATA APPROACH
    Szuster, Piotr
    Molina, Jose M.
    Garcia-Herrero, Jesus
    Kolodziej, Joanna
    PROCEEDINGS - 31ST EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2017, 2017, : 569 - 575
  • [33] Energy Efficient Strategy for Cloud based Big Data
    Solanki, Neha
    Kachhwaha, Rajendra
    2017 6TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2017, : 519 - 522
  • [34] Service Innovation of Insurance Data Based on Cloud Computing in the Era of Big Data
    Yang, Wei
    Zhou, Junkai
    COMPLEXITY, 2021, 2021
  • [35] Data Transfer Scheduling for Maximizing Throughput of Big-Data Computing in Cloud Systems
    Xie, Ruitao
    Jia, Xiaohua
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (01) : 87 - 98
  • [36] IoT-Based Big Data Storage Systems in Cloud Computing: Perspectives and Challenges
    Cai, Hongming
    Xu, Boyi
    Jiang, Lihong
    Vasilakos, Athanasios V.
    IEEE INTERNET OF THINGS JOURNAL, 2017, 4 (01): : 75 - 87
  • [37] Construction of Big Data Mining Platform Based on Cloud Computing
    Sun, Mali
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 17 : 375 - 378
  • [38] Big Data Mining Analysis Method based on Cloud Computing
    Cai, QingQiu
    Cui, HongGang
    Tang, Hao
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [39] The Modeling of Big Traffic Data Processing Based on Cloud Computing
    Zhang, Dongbo
    YanfangShou
    Xu, Jianmin
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2394 - 2399
  • [40] Study and Application of Big Data Mining Based on Cloud Computing
    Shao, Jie
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 34 - 38