The Dynamic Replication Mechanism of HDFS Hot File based on Cloud Storage

被引:2
|
作者
Li, Mingyong [1 ]
Ma, Yan [1 ]
Chen, Meilian [1 ]
机构
[1] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
关键词
Cloud storage; HDFS; hot files; dynamic Replication;
D O I
10.14257/ijsia.2015.9.8.39
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an open source cloud storage scheme, HDFS is used by more and more large enterprises and researchers, and is actually applied to many cloud computing systems to deal with huge amounts of data. HDFS has many advantages, but there are some problems such as NameNode single point of failure, small file problem, hot issues, etc. For HDFS hot issues, this paper proposes a dynamic Replication mechanism of HDFS hot file based on cloud storage(HDFS-DRM). The mechanism includes a Replication of the dynamic adjustment mechanism and adding, deleting duplicate node selection mechanism in two parts, by increasing the NameNode, BlockMap parameters, it records the number of reading requests of each file in a certain period of time to decide whether to increase or decrease the number of copies. The mechanism presents a replica placement method based on stage historical information and node load and selects the appropriate node to add or delete copies of documents to improve the utilization efficiency of the data node storage space effectively. Experimental results show that, HDFS - DRM in hot files case, compared to native HDFS file system access latency is significantly reduced, HDFS-DRM can solve the hot issues successfully.
引用
收藏
页码:439 / 448
页数:10
相关论文
共 50 条
  • [41] Cost-effective data replication mechanism modelling for cloud storage
    Zaman, Khalid
    Hussain, Altaf
    Imran, Muhammad
    Sohail, Muhammad
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2022, 13 (06) : 652 - 669
  • [42] Enhancing Cloud Object Storage Performance using Dynamic Replication Approach
    Jindarak, Kanatorn
    Uthayopas, Putchong
    PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 800 - 803
  • [43] Profit-Based File Replication in Data Intensive Cloud Data Centers
    Alghamdi, Muhannad
    Tang, Bin
    Chen, Yutian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [44] Fault-tolerant mechanism combined with replication and error correcting code for cloud file systems
    Yang, Dongri
    Wang, Ying
    Liu, Peng
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2014, 54 (01): : 137 - 144
  • [45] A File Synchronization Framework Based on Rsync Protocol for Cloud Storage Services
    Lim M.
    Transactions of the Korean Institute of Electrical Engineers, 2022, 71 (08): : 1164 - 1175
  • [46] Cloud Based Storage System using Secure Deduplication and File Compression
    Sukruti, Gajare B.
    Rubeena, Khan A.
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [47] Securing Cloud-Based File Storage System via Homomorphism
    Tahir, Adnan
    Khan, M. N. A.
    Mughal, Sheeraz
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN COMPUTER SYSTEMS, 2016, 38 : 13 - 22
  • [48] A Metadata Management Mechanism Based on HDFS
    Chen, Xiaofeng
    Lou, Yuansheng
    Hu, Dongmei
    Applied Decisions in Area of Mechanical Engineering and Industrial Manufacturing, 2014, 577 : 1026 - 1029
  • [49] An Ensemble of Replication and Erasure Codes for Cloud File Systems
    Ma, Yadi
    Nandagopal, Thyaga
    Puttaswamy, Krishna P. N.
    Banerjee, Suman
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 1276 - 1284
  • [50] Placement Scheduling for Replication in HDFS Based on Probabilistic Approach
    Bui, Dinh-Mao
    Lee, Sungyoung
    INCLUSIVE SMART CITIES AND DIGITAL HEALTH, 2016, 9677 : 314 - 320