A data placement strategy for big data based on DCC in cloud computing systems

被引:2
|
作者
Wang, Tao [1 ,2 ]
Yao, Shihong [2 ]
Xu, Zhengquan [1 ,2 ]
Jia, Shan [2 ]
Xu, Qiang [2 ]
机构
[1] Collaborat Innovat Ctr Geospatial Technol, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
基金
美国国家科学基金会;
关键词
Big Data; data placement; data scheduling; dynamic computation correlation;
D O I
10.1109/SmartCity.2015.139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In complex and data-intensive applications, data scheduling between data centers must occur when multiple datasets stored in distributed data centers are processed by one computation. To store massive datasets effectively and reduce data scheduling between data centers during the execution of computations, a mathematical model of data scheduling between data centers in cloud computing is built and dynamic computation correlation (DCC) between datasets is defined. Then a data placement strategy for big data based on DCC is proposed. Datasets with high DCC are placed into the same data center, and new datasets are dynamically distributed into the most appropriate data center. Comprehensive experiments show that the proposed strategy can effectively reduce the number of data scheduling between data centers and has a considerably low and almost constant computational complexity when the number of data centers increases and the datasets are massive. It can be expected that the proposed strategy will be applicable to the practical large-scale distributed storage systems for big data management.
引用
收藏
页码:623 / 630
页数:8
相关论文
共 50 条
  • [21] Multi-objective Optimization for Data Placement Strategy in Cloud Computing
    Guo, Lizheng
    He, Zongyao
    Zhao, Shuguang
    Zhang, Na
    Wang, Junhao
    Jiang, Changyun
    INFORMATION COMPUTING AND APPLICATIONS, PT 2, 2012, 308 : 119 - 126
  • [22] Cognitive systems and operations research in big data and cloud computing
    Marek R. Ogiela
    Hoon Ko
    Annals of Operations Research, 2018, 265 : 183 - 186
  • [23] Algorithms for Big Data in Advanced Communication Systems and Cloud Computing
    Stergiou, Christos
    Psannis, Kostas E.
    2017 IEEE 19TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 1, 2017, 1 : 196 - 201
  • [24] A Security-aware Data Placement Mechanism for Big Data Cloud Storage Systems
    Kang, Seungmin
    Veeravalli, Bharadwaj
    Aung, Khin Mi Mi
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC), AND IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2016, : 327 - 332
  • [25] Geospatial cloud computing and big data
    Yang, Chaowei Phil
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2017, 61 : 119 - 119
  • [26] Survey on Big Data and Cloud Computing
    Prabha, M. Surya
    Sarojini, B.
    2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 119 - 122
  • [27] Advances in cloud and big data computing
    Bellatreche, Ladjel
    Leung, Carson
    Xia, Yinglong
    El Baz, Didier
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (02):
  • [28] Cloud Computing for Big Data Processing
    Li, Xiaofang
    Zhuang, Yanbin
    Yang, Simon X.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2017, 23 (04): : 545 - 546
  • [29] Cloud Computing for Big Data Analysis
    Marozzo, Fabrizio
    Belcastro, Loris
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [30] An Integration of Big Data and Cloud Computing
    Thingom, Chintureena
    Yeon, Guydeuk
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 729 - 737