Integrated document caching and prefetching in storage hierarchies based on Markov-chain predictions

被引:18
|
作者
Kraiss, A [1 ]
Weikum, G [1 ]
机构
[1] Univ Saarland, Dept Comp Sci, D-66041 Saarbrucken, Germany
来源
VLDB JOURNAL | 1998年 / 7卷 / 03期
关键词
performance; caching; prefetching; scheduling; tertiary storage; stochastic modeling; Markov chains;
D O I
10.1007/s007780050060
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large multimedia document archives may hold a major fraction of their data in tertiary storage libraries for cost reasons. This paper develops an integrated approach to the vertical data migration between the tertiary, secondary, and primary storage in that it reconciles speculative prefetching, to mask the high latency of the tertiary storage, with the replacement policy of the document caches at the secondary and primary storage level, and also considers the interaction of these policies with the tertiary and secondary storage request scheduling. The integrated migration policy is based on a continuous-time Markov chain model for predicting the expected number of accesses to a document within a specified time horizon. Prefetching is initiated only if that expectation is higher than those of the documents that need to be dropped from secondary storage to free up the necessary space. In addition, the possible resource contention at the tertiary and secondary storage is taken into account by dynamically assessing the response-time benefit of prefetching a document versus the penalty that it would incur on the response time of the pending document requests. The parameters of the continuous-time Markov chain model, the probabilities of co-accessing certain documents and the interaction times between successive accesses, are dynamically estimated and adjusted to evolving workload patterns by keeping online statistics. The integrated policy for vertical data migration has been implemented in a prototype system. The system makes profitable use of the Markov chain model also for the scheduling of volume exchanges in the tertiary storage library. Detailed simulation experiments with Web-server-like synthetic workloads indicate significant gains in terms of client response time. The experiments also show that the overhead of the statistical bookkeeping and the computations for the access predictions is affordable.
引用
收藏
页码:141 / 162
页数:22
相关论文
共 50 条
  • [1] Integrated document caching and prefetching in storage hierarchies based on Markov-chain predictions
    Achim Kraiss
    Gerhard Weikum
    The VLDB Journal, 1998, 7 : 141 - 162
  • [2] CONTROL MARKOV-CHAIN MODELS FOR BIOLOGICAL HIERARCHIES
    NICOLIS, JS
    PROTONOTARIOS, EN
    VOULODEMOU, I
    JOURNAL OF THEORETICAL BIOLOGY, 1977, 68 (04) : 563 - 581
  • [3] Vertical data migration in large near-line document archives based on Markov-chain predictions
    Kraiss, A
    Weikum, G
    PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, 1997, : 246 - 255
  • [5] Scheduling tasks with Markov-chain based constraints
    Liu, DL
    Hu, XS
    Lemmon, MD
    Ling, Q
    17TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS, PROCEEDINGS, 2005, : 157 - 166
  • [6] An SPN-Based Integrated Model for Web Prefetching and Caching
    Lei Shi
    Ying-Jie Han
    Xiao-Guang Ding
    Lin Wei
    Zhi-Min Gu
    Journal of Computer Science and Technology, 2006, 21 : 482 - 489
  • [7] An SPN-based integrated model for web prefetching and caching
    Shi, Lei
    Han, Ying-Jie
    Ding, Xiao-Guang
    Wei, Lin
    Gu, Zhi-Min
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2006, 21 (04) : 482 - 489
  • [8] Markov-chain based reliability analysis for distributed systems
    Wang, JL
    COMPUTERS & ELECTRICAL ENGINEERING, 2004, 30 (03) : 183 - 205
  • [9] A Markov-chain Based Model for a Bike-Sharing System
    Crisostomi, Emanuele
    Faizrahnemoon, Mahsa
    Schlote, Arieh
    Shorten, Robert
    2015 INTERNATIONAL CONFERENCE ON CONNECTED VEHICLES AND EXPO (ICCVE), 2015, : 367 - 372
  • [10] Automated Markov-chain based Analysis for Large State Spaces
    Smith, Kaitlin N.
    Taylor, Michael A.
    Carroll, Anna A.
    Manikas, Theodore W.
    Thornton, Mitchell A.
    2017 11TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2017, : 306 - 313