A Distributed Fault-Tolerant Design for Multiple-Server VOD Systems

被引:0
|
作者
Ing-Jye Shyu
Shiuh-Pyng Shieh
机构
[1] National Chiao-Tung University,Department of Computer Science and Information Engineering
来源
关键词
fault tolerance; fault recovery; distributed algorithms; multimedia systems;
D O I
暂无
中图分类号
学科分类号
摘要
Fault tolerance is an important design criterion for reliable and robust video-on-demand systems. Conventional fault-tolerant designs use either a primary backup or an active replication method to provide system fault tolerance. However, these approaches suffer from low utilization of the backup or replication system. In this paper we propose two playback-recovery schemes for distributed video-on-demand systems called the forward playback-recovery scheme and the backward playback-recovery scheme. Unlike conventional fault-tolerant designs, our schemes use existing playback resources to recover faulty playbacks without allocating new resources, significantly reducing recovery overhead. To use the schemes effectively, we developed a distributed algorithm for determining the order and gap information between the playbacks on the distributed video-on-demand servers so that overhead for recovering from a server failure can be minimized. This algorithm achieves N − 1 fault-tolerant resiliency for N-server video-on-demand systems. In addition, three server-recovery policies are also presented to guide surviving servers in applying the proper scheme to recover faulty playbacks, thus reducing overall recovery costs. Simulation results show that the proposed recovery schemes are effective and useful in designing fault-tolerant multiple-server video-on-demand systems.
引用
收藏
页码:219 / 247
页数:28
相关论文
共 50 条
  • [41] Optimal fault-tolerant networks with a server
    Choudum, SA
    Sivagurunathan, S
    NETWORKS, 2000, 35 (02) : 157 - 160
  • [42] A fault-tolerant CORBA name server
    Maffeis, S
    15TH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1996, : 188 - 197
  • [43] Extension to Matlab/Simulink for design and implementation of distributed fault-tolerant control systems
    Domen, V
    Rok, O
    2004 1st International Conference on Electrical and Electronics Engineering (ICEEE), 2004, : 339 - 343
  • [44] REASONING ABOUT UNCERTAINTY IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    FISCHER, MJ
    ZUCK, LD
    LECTURE NOTES IN COMPUTER SCIENCE, 1988, 331 : 142 - 158
  • [45] Lazy verification in fault-tolerant distributed storage systems
    Abd-El-Malek, M
    Ganger, GR
    Goodson, GR
    Reiter, MK
    Wylie, JJ
    24TH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2005, : 179 - 190
  • [46] Evaluation of fault-tolerant mobile agents in distributed systems
    Mohammadi, K.
    Hamidi, H.
    2005 1ST IEEE/IFIP INTERNATIONAL CONFERENCE IN CENTRAL ASIA ON INTERNET (ICI), 2005, : 169 - 173
  • [47] Optimal fault-tolerant resource placement in distributed systems
    Kim, JH
    Seong, YR
    Kim, JM
    Lee, CH
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1272 - 1278
  • [48] Constraint logic programming for fault-tolerant distributed systems
    Creemers, T
    Riera, J
    Tourouta, EN
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 1998, 37 (05) : 689 - 698
  • [49] AN ADAPTIVE DEPENDABLE FAULT-TOLERANT SCHEME FOR DISTRIBUTED SYSTEMS
    Liu, Guoliang
    Chen, Shuyu
    THIRD INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY (ICCET 2011), 2011, : 697 - 702
  • [50] BASIC CONCEPTS AND ISSUES IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    CRISTIAN, F
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 563 : 119 - 149