The Dynamic Replication Mechanism of HDFS Hot File based on Cloud Storage

被引:2
|
作者
Li, Mingyong [1 ]
Ma, Yan [1 ]
Chen, Meilian [1 ]
机构
[1] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
关键词
Cloud storage; HDFS; hot files; dynamic Replication;
D O I
10.14257/ijsia.2015.9.8.39
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an open source cloud storage scheme, HDFS is used by more and more large enterprises and researchers, and is actually applied to many cloud computing systems to deal with huge amounts of data. HDFS has many advantages, but there are some problems such as NameNode single point of failure, small file problem, hot issues, etc. For HDFS hot issues, this paper proposes a dynamic Replication mechanism of HDFS hot file based on cloud storage(HDFS-DRM). The mechanism includes a Replication of the dynamic adjustment mechanism and adding, deleting duplicate node selection mechanism in two parts, by increasing the NameNode, BlockMap parameters, it records the number of reading requests of each file in a certain period of time to decide whether to increase or decrease the number of copies. The mechanism presents a replica placement method based on stage historical information and node load and selects the appropriate node to add or delete copies of documents to improve the utilization efficiency of the data node storage space effectively. Experimental results show that, HDFS - DRM in hot files case, compared to native HDFS file system access latency is significantly reduced, HDFS-DRM can solve the hot issues successfully.
引用
收藏
页码:439 / 448
页数:10
相关论文
共 50 条
  • [1] Fault Tolerant Erasure Coded Replication for HDFS Based Cloud Storage
    Ko, Aye Chan
    Zaw, Wint Thida
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 104 - 109
  • [2] Optimizing Small File Storage Process of the HDFS Which Based on the Indexing Mechanism
    Cheng, Wenjuan
    Zhou, Miaomiao
    Tong, Bing
    Zhu, Junhong
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 44 - 48
  • [3] A Client-based Replication Protocol for Multiversion Cloud File Storage
    Ohara, Mamoru
    Fukumoto, Satoshi
    2015 IEEE 34th Symposium on Reliable Distributed Systems Workshop (SRDSW), 2015, : 1 - 6
  • [4] Avoiding Performance Impacts by Re-Replication Workload Shifting in HDFS Based Cloud Storage
    Shwe, Thanda
    Aritsugi, Masayoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 2958 - 2967
  • [5] A Model of Cloud Data Secure Storage Based on HDFS
    Qian Quan
    Wang Tian-hong
    Zhang Rui
    Xin Ming-jun
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 173 - 178
  • [6] Dynamic replication consistency mechanism for cloud storage base-on data values
    Du, Hongtao
    Li, Zhanhuai
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2013, 31 (06): : 979 - 984
  • [7] QoSC: A QoS-Aware Storage Cloud Based on HDFS
    Yang, Bowei
    Song, Guanghua
    Zheng, Yao
    Wu, Yue
    2015 INTERNATIONAL SYMPOSIUM ON SECURITY AND PRIVACY IN SOCIAL NETWORKS AND BIG DATA (SOCIALSEC 2015), 2015, : 32 - 38
  • [8] Fountain Code Based Cloud Storage Mechanism For Optimal File Retrieval Delay
    Janet, J.
    Balakrishnan, S.
    Somasekhara, Kesani
    2016 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2016,
  • [9] Dynamic Replication Policy on HDFS Based on Machine Learning Clustering
    Ahmed, Motaz A.
    Khafagy, Mohamed H.
    Shaheen, Masoud E.
    Kaseb, Mostafa R.
    IEEE ACCESS, 2023, 11 : 18551 - 18559
  • [10] A Distributed File System Based on HDFS
    Liu J.
    Leng F.-L.
    Li S.-Q.
    Bao Y.-B.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (06): : 795 - 800